<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML"
         xmlns:xlink="http://www.w3.org/1999/xlink"
         article-type="methods-article"
         dtd-version="1.2"
         xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">Digital Twin</journal-id>
            <journal-title-group>
                <journal-title>Digital Twin</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2752-5783</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/digitaltwin.17467.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Method Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Digital twin data: methods and key technologies</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 4 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Zhang</surname>
                        <given-names>Meng</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2253-4025</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Tao</surname>
                        <given-names>Fei</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-9020-0633</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Huang</surname>
                        <given-names>Biqing</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Liu</surname>
                        <given-names>Ang</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-9353-0948</uri>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Wang</surname>
                        <given-names>Lihui</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-8679-8049</uri>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Anwer</surname>
                        <given-names>Nabil</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Nee</surname>
                        <given-names>A. Y. C.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a6">6</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Automation, Tsinghua University, Beijing, 100084, China</aff>
                <aff id="a2">
                    <label>2</label>School of Automation Science and Electrical Engineering, Beihang University, Beijing, 100191, China</aff>
                <aff id="a3">
                    <label>3</label>School of Mechanical and Manufacturing Engineering, University of New South Wales, Sydney, NSW, 2052, Australia</aff>
                <aff id="a4">
                    <label>4</label>Department of Production Engineering, KTH Royal Institute of Technology, Stockholm, SE-10044, Sweden</aff>
                <aff id="a5">
                    <label>5</label>Automated Production Research Laboratory, Paris-Saclay University, ENS Paris-Saclay, LURPA, 91190, Gif-sur-Yvette, France</aff>
                <aff id="a6">
                    <label>6</label>Department of Mechanical Engineering, National University of Singapore, Singapore, 117576, Singapore</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:ftao@buaa.edu.cn">ftao@buaa.edu.cn</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>2</month>
            <year>2022</year>
            </pub-date>
            <pub-date pub-type="collection">
            <year>2021</year>
            </pub-date>
         <volume>1</volume>
            <elocation-id>2</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>28</day>
                    <month>1</month>
               <year>2022</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Zhang M et al.</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://digitaltwin1.org/articles/1-2/pdf"/>
            <abstract>
                <p>As a promising technology to converge the traditional industry with the digital economy, digital twin (DT) is being investigated by researchers and practitioners across many different fields. The importance of data to DT cannot be overstated. Data plays critical roles in constructing virtual models, building cyber-physical connections, and executing intelligent operations. The unique characteristics of DT put forward a set of new requirements on data. Against this background, this paper discusses the emerging requirements on DT-related data with respect to data gathering, interaction, universality, mining, fusion, iterative optimization, and on-demand usage. A new notion, namely digital twin data (DTD), is introduced. This paper explores some basic principles and methods for DTD gathering, interaction, storage, association, fusion, evolution and servitization, as well as the key enabling technologies. Based on the theoretical underpinning provided in this paper, it is expected that more DT researchers and practitioners can incorporate DTD into their DT development process.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>digital twin (DT)</kwd>
                <kwd>digital twin data (DTD)</kwd>
                <kwd>principles</kwd>
                <kwd>methods</kwd>
                <kwd>key technologies</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/501100005089">
                    <funding-source>Beijing Municipal Natural Science Foundation</funding-source>
                    <award-id>JQ19011</award-id>
                </award-group>
                <award-group id="fund-2">
                    <funding-source>National Key Research and Development Program of China</funding-source>
                    <award-id>2020YFB1708400</award-id>
                </award-group>
                <funding-statement>This work is supported by the National Key Research and Development Program of China (2020YFB1708400) and Beijing Municipal Natural Science Foundation (JQ19011). The grants were assigned to Prof. Fei Tao.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>In the revised version, more technical details are added in the section &#x201c;Key enabling technologies for digital twin data&#x201d; to better support the DTD applications. Then more explanations for some concepts (e.g. digital twin data, connection data, fusion data) are provided by adding related examples or definitions. DTD challenges are discussed in the last section. Figure 2, Figure 3 and Figure 4 are further refined in quality to better match the text in the corresponding sections. Besides, some well-cited references are added to supplement the work.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec>
            <title>From data to digital twin data</title>
            <p>With the rapid development of new information technologies (e.g., Internet of Things, cloud computing, big data, and artificial intelligence), the digital economy continues to flourish on a global scale. As an innovative way to converge the traditional industry and digital economy, digital twin (DT) is attracting continued attention from different fields
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>, such as aerospace
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>,
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>, automotive
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>,
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>, shipping
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>, smart grid
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>,
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>, and smart city
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>. Especially in the field of smart manufacturing, DT has been widely applied in control and management for shop-floor
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>,
                    <xref ref-type="bibr" rid="ref-11">11</xref>
                </sup>, rapid configuration of production line
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>, product lifecycle management
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>, intelligent logistics
                <sup>
                    <xref ref-type="bibr" rid="ref-14">14</xref>
                </sup>, dynamic scheduling
                <sup>
                    <xref ref-type="bibr" rid="ref-15">15</xref>,
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>, robot operation optimization
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup>, product quality assurance
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>,
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup>, machine tool maintenance
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>
                </sup>, and human-robot interaction
                <sup>
                    <xref ref-type="bibr" rid="ref-21">21</xref>,
                    <xref ref-type="bibr" rid="ref-22">22</xref>
                </sup>. Practical applications of digital twin involve a set of key enabling technologies
                <sup>
                    <xref ref-type="bibr" rid="ref-23">23</xref>
                </sup>, concerning the construction and verification of virtual models, the construction and management of intelligent services, the real-time sensing and control of physical entities, cyber-physical interaction and integration, multi-modal data association and fusion, and so forth. Since data is indispensable in empowering all these technologies, it can be argued that the success of DT lies in the availability of high-quality data source.</p>
            <p>Early-stage data acquisition is typically conducted by manual means, which suffers from low efficiency and high cost. Since the collected data is less in quantity and poor in real-time performance, they could only reflect features of a physical entity (e.g., machine tool and process) for a limited period of time with inevitable delays. Benefiting from the emergence of new information technologies in recent years, a huge volume of data can be collected by sensors, IoT devices, mobile devices, and wearable devices in real time and processed through an integrated computing infrastructure (e.g., cloud computing, edge computing, and fog computing). As a result, it is possible to obtain a complete data record and conduct timely analysis throughout the whole lifecycle of a physical entity. On the other hand, the growth of DT-related research and application puts forward new requirements on data.</p>
            <sec>
                <title>(1) Requirements on comprehensive data gathering</title>
                <p>Comprehensive data is required to enhance the accuracy, efficiency, and adaptability of DT-based services (e.g., performance prediction, process optimization, and quality assurance). Comprehensive data refers to a wide spectrum of data including, for example, data on both normal state and abnormal state, data on both common event and rare event, data on both certain scenario and uncertain scenario, and so forth. DT applications developed upon incomprehensive data suffer from various challenges. On the one hand, some existing work focuses mainly on data gathered from the physical world, for which, it is difficult to include low-probability data (e.g., failure data and extreme environment data) and data that could not be measured directly (e.g., deformation data, stress distribution, and airflow distribution)
                    <sup>
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup>. On the other hand, some other work focuses primarily on data gathered from virtual models, for which, it is difficult to accurately simulate disturbance data caused by abrupt disturbances and time-varying data with high uncertainties
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup>. A successful DT solution should be powered by comprehensive data obtained from both the physical and virtual worlds.</p>
            </sec>
            <sec>
                <title>(2) Requirements on real-time data interaction</title>
                <p>Real-time data interaction is required to enable coordinated operations. Firstly, real-time data from the physical entity can be used to dynamically update parameters of the virtual models, whereas simulation data from the virtual models is fed back to the physical entity in order to align its behaviors with a simulation plan. Secondly, data from DT-based services should be communicated with the physical entity for timely diagnosis, maintenance, and control, whereas real-time data from the physical entity can be used to update the services and make them more adaptable to the changing physical realities. Thirdly, since service availability should be verified before execution, real-time simulation results by the virtual models should be communicated with service providers to illustrate deficiencies, whereas service data can be used to calibrate the virtual models and improve their accuracy.</p>
            </sec>
            <sec>
                <title>(3) Requirements on data universality</title>
                <p>Low universality of data is a main obstacle for DT applications. It is difficult to transfer DT across different application scenarios, which enclose different requirements and constraints on data acquisition. The difficulty of data exchange and parsing is affected by various facets of a certain application scenario, such as different physical entities (e.g., robot, machine tool, and autonomous vehicle), data interfaces and communication protocols. Against different application scenarios (e.g., design, production, and maintenance), it is difficult to achieve smooth data integration and sharing due to various data formats. To cope with these difficulties, it is necessary to unify data transformation towards high universality of data.</p>
            </sec>
            <sec>
                <title>(4) Requirements on knowledge mining</title>
                <p>To build insightful virtual models that can reflect a physical entity&#x2019;s internal mechanisms and rules, it is necessary to derive knowledge behind raw data through data mining. The in-depth mining of massive data (e.g., physical entity data, virtual model data, and information system data) towards new knowledge remains a challenge. On the one hand, not every data is equally useful for extracting information and knowledge, in particular, concerning the irrelevant data, abnormal data, and redundant data. On the other hand, it is difficult to fully mine the in-depth knowledge hidden behind data.</p>
            </sec>
            <sec>
                <title>(5) Requirements on data fusion</title>
                <p>Since DT-related data comes from multiple sources (e.g., physical entity, virtual model, and service), there exists data noise, inconsistency, and conflict. For the data collected from a physical entity, various factors such as sensor malfunction, environmental fluctuation, and human interference would affect information entropy (a higher value indicates higher data uncertainty). For the data simulated by virtual models, deviations from the physical reality due to unsatisfactory model effectiveness would reduce the data reliability. In addition, neither the collected data nor the simulated data is sufficient to derive global perspectives. Data fusion is therefore required, through which, data obtained from diversified sources is integrated synthetically. It would bring lots of benefits, such as reducing information entropy of the sensor data (i.e. reducing data uncertainty), reducing root mean squared error (RMSE) between the simulated data and measured data (i.e. improving data accuracy), and improving relevancy between the data and a certain target indicator, such as product quality, remaining useful life of a key component, and equipment healthy condition (i.e. increasing the amount of information related to the target indicator). By doing so, data can be verified, corrected, and supplemented by each other, hence leading to more accurate and consistent information extraction.</p>
            </sec>
            <sec>
                <title>(6) Requirements on iterative optimization</title>
                <p>In the paper, iterative optimization refers to a cyclic process of &#x201c;data increase - data fusion - information increase&#x201d;, through which, new data is fused with historical data to generate new information continuously. In the cyclic process, each optimization, triggered by the injection of new data, aims at adding appropriate data into the fusion to increase the amount of valuable information carried by the fusion data, while being subject to possible constraints on the data to be merged, such as keeping the data redundancy lower than an upper bound. The iterative optimization aims at realizing the continuous information increase.</p>
            </sec>
            <sec>
                <title>(7) Requirements on convenient data usage</title>
                <p>Since DT users play different roles and undertake different responsibilities, they tend to demand different types of data. For examples, field operators demand on-site operational data (e.g., assembly sequence, maintenance steps, and control order); technicians demand process data (e.g., real-time condition monitoring data, equipment performance data, and diagnosis data); senior managers demand market data (e.g., material cost, market dynamics, and product benchmarking)
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. It is difficult to implement generic data operations (e.g., searching, matching, combination, invocation and visualization) in consideration of significant user discrepancies in terms of demand, professional skill, knowledge level, 
                    <italic toggle="yes">etc.</italic> To address this issue, it is necessary to encapsulate data provision as on-demand services.</p>
                <p>Considering the above-mentioned requirements, the authors proposed a new notion, namely digital twin data (DTD), which constitutes an important part of the five-dimension DT, in the previous work
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. Based on this, this paper aims to provide the theoretical underpinning of DTD with respect to its structure and processing. The remainder of this paper is organized as follows. Section 2, &#x2018;Composition of digital twin data&#x2019;, introduces the composition of DTD, followed by a set of DTD principles in Section 3, &#x2018;Principles for digital twin data&#x2019;. Section 4, &#x2018;Methodology for digital twin data&#x2019;, presents a structured methodology for DTD gathering, interaction, storage, association, fusion, evolution and servitization. Section 5, &#x2018;Key enabling technologies for digital twin data&#x2019; elaborates a set of key technologies concerning seven aspects. The final section concludes this work and outlines future work.</p>
            </sec>
        </sec>
        <sec>
            <title>Composition of digital twin data</title>
            <p>DTD refers to a wide spectrum of data closely related to the DT. It focuses equally on data collected from the physical space (e.g. sensor data from equipment, materials, and workers) and data produced by the virtual space (e.g. data mimicked by simulation models, data generated by algorithms embedded in services, and data deduced based on knowledge), and tends to make the data from the two spaces corrected and supplemented by each other through data fusion, to achieve more accurate and comprehensive information for the DT-related applications. According to previous studies
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>, DTD can be classified into six parts, i.e., physical entity-related data, virtual model-related data, service-related data, fusion data, connection data, and domain knowledge. 
                <xref ref-type="fig" rid="f1">Figure 1</xref> illustrates how these parts are interrelated towards a whole DTD.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>Composition of digital twin data (DTD).</title>
                </caption>
                <graphic orientation="portrait" position="float"
                     xlink:href="https://digitaltwin1-files.f1000.com/manuscripts/18840/6d62e342-fe68-4f23-a6c7-bf137704547f_figure1.gif"/>
            </fig>
            <p>Physical entity refers to an object existing in the physical world with specific functions, behaviors, and structures
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. According to ISO 23247-3 series (Digital Twin manufacturing framework -Part 3: Digital representation of manufacturing elements), physical entity-related data can be grouped in two categories: the first describing the static information concerning the physical entity (e.g. identification, characteristics, schedule) and the second reflecting the dynamic states (e.g. status, location, relationship). These data can help to represent the physical entity in a digital way.</p>
            <p>Virtual models reproduce and describe a physical entity in the digital space with respect to different aspects, such as geometric property, physical parameter, dynamic behavior, operation and maintenance rules, and so forth
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. Virtual model-related data include model parameters and simulation data (e.g., simulation conditions, simulation process data, and simulation results). Virtual model-related data is naturally coupled with the physical entity-related data, as virtual models are essentially built upon a physical entity.</p>
            <p>DT services can be classified into application services and functional services
                <sup>
                    <xref ref-type="bibr" rid="ref-27">27</xref>
                </sup>. Application services are provided, based on the physical entity-related data and virtual model-related data, to directly solve problems in a certain application scenario, such as equipment prognosis, resources scheduling, and product quality assurance. Therefore, the service-related data mainly includes performance data, scheduling data, and quality data. On the other hand, functional services are provided to support normal operations of DT by realizing relevant functions such as model management, data processing, data connection, 
                <italic toggle="yes">etc</italic>. Related data therefore includes model configuration data, algorithm configuration data and service encapsulation data, 
                <italic toggle="yes">etc.</italic>
            </p>
            <p>Domain knowledge includes common knowledge such as expert experience, predefined rules, and industry standards
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. Besides, domain knowledge also involves new knowledge being generated by data mining. Domain knowledge can be used to guide model construction, service optimization, data processing, 
                <italic toggle="yes">etc.</italic>
            </p>
            <p>Fusion data can be considered as a result of merging the physical entity-related data, virtual model-related data, service-related data and knowledge. Through data fusion, multi-source data with underlying relations are made complementary with each other, towards forming a unified description
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. Compared with single-source data, fusion data can combine data from multiple perspectives, thus, including more abundant information. For example, the healthy condition of an equipment can be reflected by various sensor data such as the vibrations, speeds, forces and pressures, as well as the simulated data such as deformation, stress distribution, and airflow distribution. The fusion data is a merging result of the above data, which is obtained by processing the data from different sources via data fusion algorithms, such as the weighting method
                <sup>
                    <xref ref-type="bibr" rid="ref-28">28</xref>
                </sup>, neural network
                <sup>
                    <xref ref-type="bibr" rid="ref-29">29</xref>
                </sup>, and Bayesian method
                <sup>
                    <xref ref-type="bibr" rid="ref-30">30</xref>
                </sup>. As a result, the fusion data would be more informative for the condition monitoring.</p>
            <p>Connection data refers to data derived from and transferred among the physical entity-related data, virtual model-related data, service-related data, fusion data and domain knowledge
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. Connection data makes it possible to compare data from different parts of DTD towards high consistency. For example, the real temperature collected by sensors and the expected temperature simulated by virtual models can be selected as the connection data between the physical entity-related data and virtual entity-related data. If their deviation exceeds a predefined threshold, perturbing factors (e.g. manual interferences, environment changes) causing the inconsistency would be found and eliminated, which helps to restore the data consistency.</p>
        </sec>
        <sec>
            <title>Principles for digital twin data</title>
            <p>In this section, a set of basic principles of gathering and processing DTD are proposed by the authors. They are developed in order to fulfil the requirements raised in Section 1 (&#x2018;From data to digital twin data&#x2019;), i.e., comprehensive data gathering, real-time data interaction, data universality, knowledge mining, data fusion, iterative optimization, and convenient data usage. As in 
                <xref ref-type="fig" rid="f2">Figure 2</xref>, the outer layer shows the data requirements in seven aspects as raised in Section 1. While, the corresponding principles, which should be observed for the sake of better fulfilling the requirements, are given one by one in the inner layer.</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>Principles for digital twin data (DTD).</title>
                </caption>
                <graphic orientation="portrait" position="float"
                     xlink:href="https://digitaltwin1-files.f1000.com/manuscripts/18840/6d62e342-fe68-4f23-a6c7-bf137704547f_figure2.gif"/>
            </fig>
            <sec>
                <title>(1) Complementary principle</title>
                <p>The complementary principle of DTD corresponds to the requirement of comprehensive data gathering. It emphasizes the simultaneous gathering of data from both the physical and virtual worlds, which supplement each other and make up respective deficiencies. On one hand, the physical entity-related data can truly reflect the dynamic changes of physical reality, including uncertainty, fuzziness, and complexity, which are difficult to simulate. On the other hand, the virtual models can generate a great variety of rare event data, unmeasurable data, and multi-physics coupling data through simulation at low cost, which cannot be directly collected from the physical world.</p>
            </sec>
            <sec>
                <title>(2) Timeliness principle</title>
                <p>The timeliness principle emphasizes that the connection data between any two parts of DTD should be transferred and adjusted in a timely manner. It aims at ensuring the rapid twinning process, namely the process of synchronizing the virtual and physical states, thus to enable parts of the DT to act synchronously
                    <sup>
                        <xref ref-type="bibr" rid="ref-31">31</xref>
                    </sup>. Besides, when the connection data between two parts is discovered to be inconsistent (i.e., the discrepancy exceeds a predefined value), timely adjustment is triggered to diagnose inconsistency and restore consistency.</p>
            </sec>
            <sec>
                <title>(3) Standardized principle</title>
                <p>As much as possible, DTD obtained from various objects, models, conditions, and scenes should be transformed into standardized data with the unified format, interface and protocol. A template that describes the data with respect to different aspects would be built, to help identify their differences with the standard version. The standardized principle is intended for DTD to fulfil the requirement on data universality, thus to support data exchange between different application objects and data sharing against different application scenarios.</p>
            </sec>
            <sec>
                <title>(4) Association principle</title>
                <p>The association principle corresponds to the requirement on knowledge mining. It emphasizes extracting various association relations among different parts of DTD, such as the causal, analogous, and complementary relations among the physical entity-related data, virtual model-related data, service-related data, and domain knowledge. The extracted associations can reflect valuable information such as causality among events, correlation of performances, and dependency of data variables. Underlying knowledge can be further induced based on such information.</p>
            </sec>
            <sec>
                <title>(5) Fusion principle</title>
                <p>The fusion principle corresponds to the requirement on data fusion. It emphasizes realizing the data fusion by fully merging data with different relations. For examples, by fusing analogous data from the physical and virtual worlds, which could be verified by each other, it can minimize the uncertainty, randomness, and fuzziness of the physical entity-related data or improve the reliability of the virtual model-related data. Besides, by fusing complementary data from different sources, more comprehensive data can be obtained to achieve richer information.</p>
            </sec>
            <sec>
                <title>(6) Information growth principle</title>
                <p>The information growth principle corresponds to the requirement on iterative optimization. It means that, with the continuous accumulation of data, DTD should be optimized iteratively through continued data fusion between the new data with the historical data. After each fusion, the increase or decrease in information quantity can be evaluated to decide whether accept the fusion or not, as a way, to ensure the lasting growth of information.</p>
            </sec>
            <sec>
                <title>(7) Servitization principle</title>
                <p>The servitization principle means encapsulating DTD-related resources (e.g., data, data models, data processing algorithms, and data visualization methods) into on-demand services, which can be treated as black boxes with respect to the corresponding input, output, control, and mechanism. When a user proposes a demand on data, which is decomposed into sub-demands, suitable services are searched, matched, and combined to provide the required data. This principle is intended to lower the requirements on a user&#x2019;s professional skills in virtue of those on-demand data services.</p>
            </sec>
        </sec>
        <sec sec-type="methods">
            <title>Methodology for digital twin data</title>
            <p>Based on the above principles, a methodology for DTD gathering, interaction, storage, association, fusion, evolution, and servitization is developed by authors, as shown in 
                <xref ref-type="fig" rid="f3">Figure 3</xref>. According to the complementary principle, with respect to data gathering, data from both the physical and virtual worlds should be collected simultaneously to ensure data comprehensiveness. The real-time interaction is realized through timely transmission and adjustment for the connection data based on the timeliness principle. According to the standardized principle, with respect to data storage, data with different formats, interfaces and protocols should be transformed towards standard shapes. According to the association principle, various relations between different DT components should be extracted to support further knowledge mining. Based on the fusion principle, fusion data is generated by merging similar data and complementary data for mutual correction and supplementation. The information growth principle is followed to interpret the progressive data evolution. Based on the on-demand principle, the data-related resources are encapsulated into services for different users. The relevant methods are elaborated as follows.</p>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>Methodology for digital twin data (DTD).</title>
                </caption>
                <graphic orientation="portrait" position="float"
                     xlink:href="https://digitaltwin1-files.f1000.com/manuscripts/18840/6d62e342-fe68-4f23-a6c7-bf137704547f_figure3.gif"/>
            </fig>
            <sec>
                <title>Digital twin data gathering</title>
                <p>DTD gathering refers to the gathering of physical entity-related data, virtual model-related data, service-related data, and domain knowledge. These four kinds of data constitute the foundation of DTD, upon which, the fusion data and connection data can be derived. For the physical entity-related data, the dynamic data (e.g., entity state, environment parameters, and abrupt disturbances) can be collected in real time by means of sensors, embedded systems, radio frequency identification (RFID), and so forth; while the static data (e.g., entity attribute, identification and characteristics) can be obtained through off-line measurement and sampling inspection. The virtual model-related data is typically generated by the process of modeling, simulating, and verifying those geometric, physical, behavioral, and rule models. The data can be gathered through manual books, simulation logs, and real-time simulation outputs. In particular, since the physical model-based simulation, e.g., computational fluid dynamics (CFD) simulation, tends to be time-consuming and even inaccurate, a hybrid method that combines physical models and data-driven approaches can be adopted to improve the simulation efficiency and accuracy
                    <sup>
                        <xref ref-type="bibr" rid="ref-19">19</xref>
                    </sup>. Service-related data is generated based on application services and functional services. Such data can be obtained throughout the service construction, operation, and maintenance. Domain knowledge is typically collected from experts, crowdsourced, knowledge bases, and historical data, 
                    <italic toggle="yes">etc</italic>. In addition, the DTD would also extract essential information from the existing IT systems, such as product lifecycle management (PLM) system, product data management (PDM) system, manufacturing execution system (MES) and supervisory control and data acquisition (SCADA) system, which store large amounts of data across the lifecycle of the physical entity
                    <sup>
                        <xref ref-type="bibr" rid="ref-32">32</xref>
                    </sup>.</p>
            </sec>
            <sec>
                <title>Digital twin data interaction</title>
                <p>Connection data between any two parts of DTD supports the real-time interactions. First, it is necessary to select the suitable data to support the message transmission between any pair of two parts. In the interest of data transmission, the data is further processed by filtering algorithms, dimensionality reduction algorithms, and relevance analysis algorithms, which are intended to remove data noise and redundancy. Then the data can be transferred between the DTD parts through some types of data transmission protocols and formats suggested by the ISO 23247-4 (Digital Twin manufacturing framework- Part 4: Information exchange). During the transmission, the consistency of connection data can be evaluated by calculating the data distances. To better illustrate the interaction, take an autoclave (a key equipment for composites curing) as an example. During curing, temperature distribution of the mold within the autoclave is an important quality indicator, since it largely affects the deformation degree of the composite components
                    <sup>
                        <xref ref-type="bibr" rid="ref-19">19</xref>
                    </sup>. To monitor the component quality, the real mold temperatures are collected by thermocouples. While, a virtual model can also be built to mimic the expected mold temperature distribution by using the CFD simulation. The data interaction between the physical equipment and the virtual model is used to ensure that the actual curing is executed as simulated. In this situation, a major feature for reflecting the mold temperature distribution, i.e. the difference between the maximum and minimum temperatures on the mold upper surface, can be selected as the connection data, leaving out other redundant or unimportant features. Then the collected and simulated mold temperature differences can be exchanged and compared. If their deviation is always within a predefined threshold, the expected composite quality would be achieved. Otherwise, perturbing factors causing the inconsistency need to be found, such as uneven airflow field in the autoclave, deteriorating insulation of the autoclave wall, and abrupt changes of curing conditions. These factors should be eliminated by adjusting parameters of the physical autoclave (e.g. heating speed, air speed, pressure) or by maintaining the equipment, which aims at reducing negative effects caused by the disturbances, to keep the collected data close with the simulated ones.</p>
            </sec>
            <sec>
                <title>Digital twin data storage</title>
                <p>The physical entity-related data, virtual model-related data, and service-related data collected from different application objects and scenarios should be transformed into a unified mode for sharing. Towards this goal, data should be represented in a standardized fashion first, which involves the description of data properties, such as content, format, sampling frequency, historical data accumulation, interface, communication protocol, 
                    <italic toggle="yes">etc</italic>. Next, necessary constraints should be enclosed on the properties. For example, constraints on historical data accumulation can help to filter those unqualified application conditions, where DT is inapplicable due to the lack of adequate data. If certain constraints cannot be complied with, it indicates that the corresponding application conditions are not qualified. For the qualified ones, on one hand, data with various formats can be converted into a standardized format, e.g. eXtensible markup language (XML), for the sake of easier data sharing. On the other hand, since data from different objects would be transmitted by different interfaces and communication protocols, a middleware is needed to support the unified data access to the database, shielding the heterogeneity
                    <sup>
                        <xref ref-type="bibr" rid="ref-33">33</xref>
                    </sup>. In addition, there are a wide variety of modelling languages and methods used in the literature to handle data and information modeling for products and systems, such as unified modeling language (UML)
                    <sup>
                        <xref ref-type="bibr" rid="ref-34">34</xref>
                    </sup>, systems modeling language (SysML)
                    <sup>
                        <xref ref-type="bibr" rid="ref-35">35</xref>
                    </sup>, ontology language
                    <sup>
                        <xref ref-type="bibr" rid="ref-36">36</xref>
                    </sup>, 
                    <italic toggle="yes">etc</italic>. New sound mathematical approaches based on Category theory
                    <sup>
                        <xref ref-type="bibr" rid="ref-37">37</xref>
                    </sup> could provide a new way to create a comprehensive foundation for modeling, interoperability and integration.</p>
            </sec>
            <sec>
                <title>Digital twin data association</title>
                <p>Association relations among DTD are mined to support knowledge discovery. Firstly, data obtained from the physical entity, virtual model, and service is preprocessed through data filtering, data reduction, and feature extraction in order to remove irrelevant and useless data. Secondly, temporal and spatial alignments are conducted. For example, the least squares method can be used to synchronize data in time, and transform data into the same coordinate system in space. Next, the relations (e.g., causality, similarity, and complementation) among data are mined by means of Pearson correlation analysis
                    <sup>
                        <xref ref-type="bibr" rid="ref-38">38</xref>
                    </sup>, K-means
                    <sup>
                        <xref ref-type="bibr" rid="ref-39">39</xref>
                    </sup>, Apriori algorithm
                    <sup>
                        <xref ref-type="bibr" rid="ref-40">40</xref>
                    </sup>, 
                    <italic toggle="yes">etc.</italic> Based on this, further knowledge (e.g. behavioral patterns of the physical entity, simulation mechanisms of virtual models, and behavior-performance mappings between different parts of DT) can be deduced through knowledge reasoning based on logic rules, distributed representations, and neural network, 
                    <italic toggle="yes">etc.</italic>
                    <sup>
                        <xref ref-type="bibr" rid="ref-41">41</xref>
                    </sup>, and the deduced knowledge can be expressed in the form of knowledge graph. To support the subsequent data fusion, two kinds of data associations are especially important, i.e., the similar relation and complementary relation. The former refers to the relation between data with similar attributes, values or changing trends, whereas the latter refers to the relation between multi-modal data from multiple sources, which can describe the same attribute or behavior from different perspectives.</p>
            </sec>
            <sec>
                <title>Digital twin data fusion</title>
                <p>Existing work on data fusion focuses primarily on the merging of data from the physical world (e.g., fusing sensor data with manual data), while few efforts have been devoted to fusing data from both the physical and virtual worlds
                    <sup>
                        <xref ref-type="bibr" rid="ref-42">42</xref>,
                        <xref ref-type="bibr" rid="ref-43">43</xref>
                    </sup>. In contrast, DTD fusion involves a holistic merging of physical entity-related data, virtual model-related data, service-related data and domain knowledge. DTD fusion includes the following aspects. In the case that the physical entity-related data is disturbed by environment perturbations, sensor failures, and human interferences, methods such as the weighted average method
                    <sup>
                        <xref ref-type="bibr" rid="ref-28">28</xref>
                    </sup>, Dempster&#x2013;Shafer theory
                    <sup>
                        <xref ref-type="bibr" rid="ref-44">44</xref>
                    </sup>, and Kalman filter
                    <sup>
                        <xref ref-type="bibr" rid="ref-45">45</xref>
                    </sup> can be used to fuse the physical entity-related data with similar virtual model-related data and service-related data to reduce the information entropy. By doing so, the data uncertainty, randomness, and fuzziness can be reduced. Likewise, in the case that the virtual model-related data and service-related data deviate from the reality, methods such as the neural network
                    <sup>
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup> and Bayesian method
                    <sup>
                        <xref ref-type="bibr" rid="ref-30">30</xref>
                    </sup> can be used to merge the data with similar physical entity-related data, as a way, to improve accuracy and reliability. For the complementary multi-modal data from different parts of DTD, methods such as weighted average method
                    <sup>
                        <xref ref-type="bibr" rid="ref-28">28</xref>
                    </sup> and neural network
                    <sup>
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup>, can be used to increase information diversity.</p>
            </sec>
            <sec>
                <title>Digital twin data evolution</title>
                <p>DTD evolution is characterized by a dynamic process, through which, new data is continuously added, processed, and then fused with historical data. This process can be described by the data relation network built based on the complex network. When new data is incorporated into the network, new association relations are built between new data and historical data through preprocessing new data, aligning new data with historical data, and then extracting association relations from aligned data by using data mining algorithms, following by fusing the new data with similar or complementary relations for data correction and supplement. This process can be guided by an automatic fusion mechanism to reduce the dependency on manual labor. Information entropy is employed to evaluate the effectiveness of data evolution. By comparing the information entropy before and after the fusion, the gain or loss of information can be evaluated. Meanwhile, the structure change of the network after each fusion can be analyzed with respect to degree, edge betweenness, clustering coefficient, 
                    <italic toggle="yes">etc</italic>. In light of the evaluation results over time, certain DTD evolution rules can be mined, such as the rules for information transfer and the relations between network structure and information distribution. In turn, such rules can be used to guide the subsequent data fusions with respect to the selection of fusion data,  and methods, to ensure iterative optimization and information growth.</p>
            </sec>
            <sec>
                <title>Digital twin data servitization</title>
                <p>DTD servitization aims to enable users to access data through on-demand services, including multiple aspects as follows
                    <sup>
                        <xref ref-type="bibr" rid="ref-46">46</xref>
                    </sup>. Firstly, various digital resources are encapsulated into services, which have functions (e.g. data searching, processing, and virtualization), inputs (e.g. data type, volume, and feature), outputs (e.g. processed data and visual diagram), quality (e.g. time, cost, and reliability), and states (e.g. working and idle). Given a user demand (e.g., the query of equipment status data, part remaining life data, and maintenance guidance data), relevant services can be prescribed for data extraction, processing, and visualization, etc., respectively. The separate services can be then combined, in compliance with relevant constraints (e.g., time and cost), towards an integrated service solution. Related methods include demand decomposition, similarity matching, multi-objective optimization, and so forth
                    <sup>
                        <xref ref-type="bibr" rid="ref-46">46</xref>
                    </sup>. With respect to the DTD visualization (i.e., a particular form of DTD service), on top of the traditional visual diagrams, virtual reality (VR)
                    <sup>
                        <xref ref-type="bibr" rid="ref-47">47</xref>
                    </sup> and augmented reality (AR)
                    <sup>
                        <xref ref-type="bibr" rid="ref-48">48</xref>
                    </sup> can be exploited to visualize the mapping relations between the physical entity and virtual models. Finally, the integrated services provide demanded results to users.</p>
            </sec>
        </sec>
        <sec>
            <title>Key enabling technologies for digital twin data</title>
            <p>As illustrated in 
                <xref ref-type="fig" rid="f4">Figure 4</xref>, the key enabling technologies for DTD are explored based on the enabling technologies for DT in general
                <sup>
                    <xref ref-type="bibr" rid="ref-23">23</xref>
                </sup> as well as the enabling technologies for digital twin shop-floor (DTS) in particular
                <sup>
                    <xref ref-type="bibr" rid="ref-49">49</xref>
                </sup>. Some main technologies are explained as follows.</p>
            <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                <label>Figure 4. </label>
                <caption>
                    <title>Key enabling technologies for digital twin data (DTD).</title>
                    <p>AR, augmented reality; VR, virtual reality.</p>
                </caption>
                <graphic orientation="portrait" position="float"
                     xlink:href="https://digitaltwin1-files.f1000.com/manuscripts/18840/6d62e342-fe68-4f23-a6c7-bf137704547f_figure4.gif"/>
            </fig>
            <p>(1)&#x00a0;&#x00a0;&#x00a0;DTD gathering deals with rare-event data, extreme environment data, multi-physics coupling data, 
                <italic toggle="yes">etc.</italic> Since such data is difficult to be measured directly, data generation based on simulation plays a crucial role. First, various modeling and simulation tools that can help to generate data concerning geometry, physics, behaviors, and rules are needed, such as AutoCAD, UG, ANSYS, COMSOL, Matlab, Pajek, 
                <italic toggle="yes">etc</italic>.
                <sup>
                    <xref ref-type="bibr" rid="ref-23">23</xref>
                </sup>. Then a set of activities of verification, validation and accreditation (VV&amp;A) can be applied to ensure the model accuracy, such as raw modeling data credibility assessment, model transformation process checking (e.g. from conceptual model to formal model), and model completeness checking
                <sup>
                    <xref ref-type="bibr" rid="ref-50">50</xref>
                </sup>. For the scenario that the modeling data is insufficient, transfer learning technology can be employed to enable the modeling based on small sample data, since this technology can use data from already known processes to supplement data of the similar process with limited data availability. The high-efficiency simulation technology can be used to improve the efficiency of data generation. For example, metamodel can reduce the computation burdens of simulation, since it is a relatively simple approximation of the input-output function defined by the complex and time-consuming simulation
                <sup>
                    <xref ref-type="bibr" rid="ref-51">51</xref>
                </sup>. </p>
            <p>(2)&#x00a0;&#x00a0;&#x00a0;With respect to data interaction, data transmission technology is required to exchange connection data among different parts of DTD. ISO 23247-4 provides some options for data transmission protocols and formats. For example, the virtual entity can access data of the physical entity through protocols such as MTConnect, OPC-UA, HTTP and CoAP; and the virtual entity can be connected with the service by using application interfaces (i.e. open API) or by employing shared memory. The suggested data formats include XML, JavaScript object notation (JSON), quality information framework (QIF) and so forth. The asset administration shell, which virtualizes the physical asset and enables the access to asset-related data via application interfaces, also provides a way for data exchange
                <sup>
                    <xref ref-type="bibr" rid="ref-52">52</xref>
                </sup>. Then to reduce the redundancy in the exchanged data, measures such as Pearson correlation coefficient
                <sup>
                    <xref ref-type="bibr" rid="ref-38">38</xref>
                </sup> and mutual information
                <sup>
                    <xref ref-type="bibr" rid="ref-53">53</xref>
                </sup> can be employed to judge the overlapping of the data. Besides, Euclidean distance, Manhattan distance and dynamic time warping
                <sup>
                    <xref ref-type="bibr" rid="ref-54">54</xref>
                </sup>, 
                <italic toggle="yes">etc</italic>. can be used to evaluate the discrepancy between data from different parts of the DT to evaluate their consistency.</p>
            <p>(3)&#x00a0;&#x00a0;&#x00a0;DTD storage deals with data originating from different sources with various formats. First, database management technology performs basic data functions such as adding, deleting, changing, and checking. To represent the data in a unified form for easier sharing, many researches explored the data formats transformation methods. For example, 
                <xref ref-type="bibr" rid="ref-55">55</xref> converted the unstructured data to semi-structured data in XML format, and then to structured data through parsing XML file; and 
                <xref ref-type="bibr" rid="ref-56">56</xref> transformed the heterogeneous big data to a semantically-enriched format called resource description framework (RDF). Data middleware technology completes the transformations for data interfaces and communication protocols. For example, the Industrial Internet-of-Things Hub (II-hub) was designed to transform various data interfaces (e.g. RS232, RS484, CAN) and protocols (e.g. Profibus, Modbus) into the unified Wi-Fi interface and CoAP protocol, respectively
                <sup>
                    <xref ref-type="bibr" rid="ref-33">33</xref>
                </sup>. Besides, for data storage, the security would always be a concerned problem. To deal with it, related activities such as encryption, proof of ownership, authentication and authorization need to be conducted
                <sup>
                    <xref ref-type="bibr" rid="ref-57">57</xref>
                </sup>.</p>
            <p>(4)&#x00a0;&#x00a0;&#x00a0;With respect to data association, related technologies include spatial-temporal data alignment, data mining, knowledge reasoning, knowledge representation, 
                <italic toggle="yes">etc</italic>. Spatial-temporal data alignment algorithms, including the least square method, maximum likelihood method and Kalman filter method, 
                <italic toggle="yes">etc</italic>., makes DTD synchronous in time and in the same coordinate system in space
                <sup>
                    <xref ref-type="bibr" rid="ref-58">58</xref>
                </sup>. Data mining algorithms (e.g. K-means
                <sup>
                    <xref ref-type="bibr" rid="ref-39">39</xref>
                </sup>, Apriori algorithm
                <sup>
                    <xref ref-type="bibr" rid="ref-40">40</xref>
                </sup>, algorithm based on conditional mutual information
                <sup>
                    <xref ref-type="bibr" rid="ref-59">59</xref>
                </sup>) can make visible clustering groups and association relations among DTD. For example, the conditional mutual information is a measure to identify the complementary relation between two variables, which means that the appearance of one variable would make the other more relevant with a certain target indicator
                <sup>
                    <xref ref-type="bibr" rid="ref-59">59</xref>
                </sup>. Then knowledge reasoning technology based on logic rules, distributed representations, neural network, 
                <italic toggle="yes">etc</italic>.
                <sup>
                    <xref ref-type="bibr" rid="ref-41">41</xref>
                </sup>, can be used to extract new knowledge from the extracted relations and groups, or from existing knowledge. Knowledge representation technology (e.g., knowledge graph) can visually describe the knowledge, knowledge carriers, and knowledge relations.</p>
            <p>(5)&#x00a0;&#x00a0;&#x00a0;With respect to data fusion, related technologies include anomaly detection, granularity transformation, heterogeneous data fusion, fault-tolerant technology, 
                <italic toggle="yes">etc</italic>.
                <sup>
                    <xref ref-type="bibr" rid="ref-58">58</xref>
                </sup>. Anomaly detection technology can be employed to remove abnormal data before fusion. The basic idea is training a model that describes the normal pattern of the target object based on historical data, to detect the anomaly of the newly collected data
                <sup>
                    <xref ref-type="bibr" rid="ref-60">60</xref>
                </sup>. Granularity transformation technology can be used to transform data with different granularities (e.g., raw data and extracted features, sparse data and dense data, a large collection of text and refined knowledge)
                <sup>
                    <xref ref-type="bibr" rid="ref-58">58</xref>
                </sup> into the same level. Related methods include the feature extraction algorithms (e.g. Fourier transformation and wavelet analysis) for raw data processing, the neural network for building mapping relations between the sparse and dense data
                <sup>
                    <xref ref-type="bibr" rid="ref-61">61</xref>
                </sup>, as well as the knowledge extraction for text
                <sup>
                    <xref ref-type="bibr" rid="ref-62">62</xref>
                </sup>, 
                <italic toggle="yes">etc</italic>. Then data fusion algorithms (e.g. weighting method
                <sup>
                    <xref ref-type="bibr" rid="ref-28">28</xref>
                </sup>, Dempster-Shafer theory
                <sup>
                    <xref ref-type="bibr" rid="ref-44">44</xref>
                </sup>, and Kalman filter
                <sup>
                    <xref ref-type="bibr" rid="ref-45">45</xref>
                </sup>) can converge the data for getting richer information. In addition, adopting the distributed fusion framework introduced in 
                <xref ref-type="bibr" rid="ref-58">58</xref> can achieve higher efficiency for data fusion, since the local processors in the framework share the computation tasks.</p>
            <p>(6)&#x00a0;&#x00a0;&#x00a0;Data evolution deals with the iterative process of data fusion. Complex network modeling technology can be used to build the data network, in which various data variables are regarded as nodes, and data association relations (e.g. similarity and complementarity) are treated as edges. The network virtualization and analysis tools are required, such as Pajek, EChart, and Gephi. Statistical characteristics of the network with respect to degree, weighted degree, edge betweenness, and clustering coefficient, 
                <italic toggle="yes">etc</italic>., can be used to identify roles of different nodes, which could help to decide what data to be fused. Once new data is added, the data fusion will merge the new data with the previous ones for richer information. Then the information before and after the fusion can be measured by calculating the information entropy, based on which, to evaluate the efficiency of data evolution. </p>
            <p>(7)&#x00a0;&#x00a0;&#x00a0;With respect to data servitization, related technologies include resource encapsulation, demand decomposition, multi-objective combinational optimization, data visualization based on VR and AR, 
                <italic toggle="yes">etc</italic>. Resource encapsulation technology can transform digital resources into services, by using semantic web, manufacturing service description language and ontology modeling
                <sup>
                    <xref ref-type="bibr" rid="ref-63">63</xref>
                </sup>, 
                <italic toggle="yes">etc</italic>. Demand decomposition technology can be used to decompose a complicated user demand into specific sub-demands, which can facilitate the search of a specific service. The hierarchical task network (HTN), which supports to separate a goal task into more specific tasks, is applied in this field
                <sup>
                    <xref ref-type="bibr" rid="ref-64">64</xref>
                </sup>. Multi-objective combinational optimization technology can be used to integrate multiple services in light of conflicting objectives (e.g., time, cost, and reliability). The commonly used optimization algorithms include genetic algorithm, ant colony algorithm, and particle swarm optimization algorithm, 
                <italic toggle="yes">etc</italic>. Data visualization based on VR and AR can be used to present the relations among data, virtual models, and physical entities in a more intuitive manner.</p>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions and future work</title>
            <p>Data is a core driver for DT. This paper focuses on the DTD that can be classified into physical entity-related data, virtual model-related data, service-related data, domain knowledge, fusion data, and connection data. The advent of DT puts forward some new requirements on data in terms of data gathering, interaction, universality, mining, fusion, iterative optimization, and on-demand usage. Triggered by these requirements, seven basic principles are proposed to support the DTD organization and processing. Guided by these principles, related methods for DTD gathering, interaction, storage, association, fusion, evolution, and servitization are discussed. Finally, the key enabling technologies are discussed.</p>
            <p>This paper provides some theoretical underpinnings for DTD, which are imperative for the further promotion and application of DT, to support more DT researchers to incorporate the DTD into their DT development process. When it comes to a certain DT application scenario, the researchers are expected to analyze the data requirements according to Section 1 (&#x2018;From data to digital twin data&#x2019;) first. Then to satisfy the requirements, related principles for DTD proposed in Section 3 (&#x2018;Principles for digital twin data&#x2019;) should be observed. Under the guideline of the principles, researchers can employ the corresponding methods and key enabling technologies explored in Section 4 (&#x2018;Methodology for digital twin data&#x2019;) and Section 5 (&#x2018;Key enabling technologies for digital twin data&#x2019;) for DTD gathering and processing. The above processes facilitate achieving more comprehensive and consistent data, more abundant and deeper information, as well as more convenient and standard data usage, which could bring countless benefits to DT applications. For example, the DTD-based prognosis would have higher accuracy in virtue of the comprehensive data support. The DTD-based production control can better align the practical process with the simulated plan through the real-time data interaction. The DTD-based product design could have higher efficiency due to the efficient knowledge mining of key information for designers.</p>
            <p>For the DTD, challenges ahead are mainly concerning simulation data generation and cyber-physical data fusion. Firstly, since DTD emphasizes on the supplementary role of simulation data, how to build virtual models to generate the simulation data accurately remains a fundamental challenge. The commonly used models include data model and physical model. However, the former relies heavily on the modeling data and it usually has poor interpretability. Whereas, although the latter is established based on physical mechanisms and constraints, it tends to be over-simplistic sometimes and thus leads to non-ignorable deviations with the practice. The hybrid modeling method that merges the data model and the physical model would be a promising way for addressing this challenge. Secondly, even though cyber-physical integration is a distinguishing feature of DT, cyber-physical data fusion remains to be a challenge. As data from the physical and virtual spaces may have their own limitations and flaws, how to ensure that the fusion result would always be superior to data from either space is a concerned problem.</p>
            <p>In the future, this work will continue to be enriched by (1) applying the generic principles and methods to guide DTD gathering and processing in practical applications, (2) refining the principles and methods based on lessons learned from practical applications, for instance, further improving related processing algorithms to better adapt to DTD, refining data generation and fusion methods, and exploring the hierarchical organization way for the DTD and (3) introducing big data-related tools and platforms to support large-scale DT applications, since in this case, the DTD, which is closely related to real-time dynamic DT, also has the &#x201c;4V&#x201d; character of big data (i.e. volume, variety, value and velocity).</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>No data are associated with this article.</p>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Qi</surname>
                            <given-names>Q</given-names>
                        </name>
					          </person-group>:
                    <article-title>Make more digital twins.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					          </source>
               <year>2019</year>;<volume>573</volume>(<issue>7775</issue>):<fpage>490</fpage>&#x2013;<lpage>491</lpage>.
                    <pub-id pub-id-type="pmid">31554984</pub-id>
                    <pub-id pub-id-type="doi">10.1038/d41586-019-02849-1</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin satellite: concept, key technologies and applications.</article-title>
                    <source>
						
                        <italic toggle="yes">Comput Integr Manuf Syst.</italic>
					          </source>
               <year>2020</year>;<volume>26</volume>(<issue>3</issue>):<fpage>565</fpage>&#x2013;<lpage>588</lpage>.
                    <pub-id pub-id-type="doi">10.13196/j.cims.2020.03.001</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Mandolla</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Petruzzell</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Percoco</surname>
                            <given-names>G</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Building a digital twin for additive manufacturing through the exploitation of blockchain: A case analysis of the aircraft industry.</article-title>
                    <source>
						
                        <italic toggle="yes">Comput Ind.</italic>
					          </source>
               <year>2019</year>;<volume>109</volume>:<fpage>134</fpage>&#x2013;<lpage>152</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.compind.2019.04.011</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Rajesh</surname>
                            <given-names>PK</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Manikandan</surname>
                            <given-names>N</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ramshankar</surname>
                            <given-names>CS</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin of an automotive brake Pad for predictive maintenance.</article-title>
                    <source>
						
                        <italic toggle="yes">Procedia Comput Sci.</italic>
					          </source>
               <year>2019</year>;<volume>165</volume>:<fpage>18</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procs.2020.01.061</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Zheng</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>L</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lu</surname>
                            <given-names>X</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin for geometric feature online inspection system of car body-in-white.</article-title>
                    <source>
						
                        <italic toggle="yes">Int J Comput Integr Manuf.</italic>
					          </source>
               <year>2020</year>;<volume>34</volume>(<issue>7&#x2013;8</issue>):<fpage>752</fpage>&#x2013;<lpage>763</lpage>.
                    <pub-id pub-id-type="doi">10.1080/0951192X.2020.1736637</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Coraddu</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Oneto</surname>
                            <given-names>L</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Baldi</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Data-driven ship digital twin for estimating the speed loss caused by the marine fouling.</article-title>
                    <source>
						
                        <italic toggle="yes">Ocean Eng.</italic>
					          </source>
               <year>2019</year>;<volume>186</volume>:<fpage>106063</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.oceaneng.2019.05.045</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Yan</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Feng</surname>
                            <given-names>D</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twin framework and its application to power grid online analysis.</article-title>
                    <source>
						
                        <italic toggle="yes">CSEE J Power Energy Syst.</italic>
					          </source>
               <year>2019</year>;<volume>5</volume>(<issue>3</issue>):<fpage>391</fpage>&#x2013;<lpage>398</lpage>.
                    <pub-id pub-id-type="doi">10.17775/CSEEJPES.2018.01460</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhao</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>H</given-names>
                        </name>
					          </person-group>:
                    <article-title>A digital twin based estimation method for health indicators of DC-DC Converters.</article-title>
                    <source>
						
                        <italic toggle="yes">IEEE Trans Power Electron.</italic>
					          </source>
               <year>2021</year>;<volume>36</volume>(<issue>2</issue>):<fpage>2105</fpage>&#x2013;<lpage>2118</lpage>.
                    <pub-id pub-id-type="doi">10.1109/TPEL.2020.3009600</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Deng</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Shen</surname>
                            <given-names>Z</given-names>
                        </name>
					          </person-group>:
                    <article-title>A systematic review of a digital twin city: A new pattern of urban governance toward smart cities.</article-title>
                    <source>
						
                        <italic toggle="yes">J Manag Sci Eng.</italic>
					          </source>
               <year>2021</year>;<volume>6</volume>(<issue>2</issue>):<fpage>125</fpage>&#x2013;<lpage>134</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jmse.2021.03.003</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twin shop-floor: a new shop-floor paradigm towards smart manufacturing.</article-title>
                    <source>
						
                        <italic toggle="yes">IEEE Access.</italic>
					          </source>
               <year>2017</year>;<volume>5</volume>:<fpage>20418</fpage>&#x2013;<lpage>20427</lpage>.
                    <pub-id pub-id-type="doi">10.1109/ACCESS.2017.2756069</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Nee</surname>
                            <given-names>AYC</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twin driven smart manufacturing</article-title>. Elsevier,<year>2019</year>.
                    <ext-link ext-link-type="uri"
                         xlink:href="https://www.elsevier.com/books/digital-twin-driven-smart-manufacturing/tao/978-0-12-817630-6">Reference Source</ext-link>
            </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Leng</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Q</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ye</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin-driven rapid reconfiguration of the automated manufacturing system via an open architecture model.</article-title>
                    <source>
						
                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
					          </source>
               <year>2020</year>;<volume>63</volume>:<fpage>101895</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2019.101895</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Qi</surname>
                            <given-names>Q</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin-driven product design, manufacturing and service with big data.</article-title>
                    <source>
						
                        <italic toggle="yes">Int J Adv Manuf Technol.</italic>
					          </source>
               <year>2018</year>;<volume>94</volume>(<issue>9&#x2013;12</issue>):<fpage>3563</fpage>&#x2013;<lpage>3576</lpage>.
                    <pub-id pub-id-type="doi">10.1007/S00170-017-0233-1</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>L</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Just-in-time material distribution method for satellite assembly digital twin shop-floor.</article-title>
                    <source>
						
                        <italic toggle="yes">Computer Integrated Manufacturing System.</italic>
					          </source>
               <year>2020</year>;<volume>26</volume>(<issue>11</issue>):<fpage>2897</fpage>&#x2013;<lpage>2914</lpage>.</mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Nee</surname>
                            <given-names>AYC</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twin enhanced dynamic job-shop scheduling.</article-title>
                    <source>
						
                        <italic toggle="yes">J Manuf Syst.</italic>
					          </source>
               <year>2021</year>;<volume>58</volume>(<issue>Part B)</issue>:<fpage>146</fpage>&#x2013;<lpage>156</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jmsy.2020.04.008</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Fang</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lou</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital-Twin-based job shop scheduling toward smart manufacturing.</article-title>
                    <source>
						
                        <italic toggle="yes">IEEE Trans Industr Inform.</italic>
					          </source>
               <year>2019</year>;<volume>15</volume>(<issue>12</issue>):<fpage>6425</fpage>&#x2013;<lpage>6435</lpage>.
                    <pub-id pub-id-type="doi">10.1109/TII.2019.2938572</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Xu</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cui</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>L</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin-based industrial cloud robotics: framework, control approach and implementation.</article-title>
                    <source>
						
                        <italic toggle="yes">J Manuf Syst.</italic>
					          </source>
               <year>2021</year>;<volume>58</volume>(<issue>Part B</issue>):<fpage>196</fpage>&#x2013;<lpage>209</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jmsy.2020.07.013</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>S&#x00f6;derberg</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>W&#x00e4;rmefjord</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Carlson</surname>
                            <given-names>JS</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Toward a digital twin for real-time geometry assurance in individualized production.</article-title>
                    <source>
						
                        <italic toggle="yes">CIRP Ann Manuf Technol.</italic>
					          </source>
               <year>2017</year>;<volume>66</volume>(<issue>1</issue>):<fpage>137</fpage>&#x2013;<lpage>140</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.cirp.2017.04.038</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Huang</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>A physical model and data-driven hybrid prediction method towards quality assurance for composite components.</article-title>
                    <source>
						
                        <italic toggle="yes">CIRP Ann Manuf Technol.</italic>
					          </source>
               <year>2021</year>;<volume>70</volume>(<issue>1</issue>):<fpage>115</fpage>&#x2013;<lpage>118</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.cirp.2021.04.062</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Luo</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ye</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>A hybrid predictive maintenance approach for CNC machine tool driven by digital twin.</article-title>
                    <source>
						
                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
					          </source>
               <year>2020</year>;<volume>65</volume>:<fpage>101974</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2020.101974</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Bilberg</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Malik</surname>
                            <given-names>AA</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twin driven human-robot collaborative assembly.</article-title>
                    <source>
						
                        <italic toggle="yes">CIRP Ann Manuf Technol.</italic>
					          </source>
               <year>2019</year>;<volume>68</volume>(<issue>1</issue>):<fpage>499</fpage>&#x2013;<lpage>502</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.cirp.2019.04.011</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Malik</surname>
                            <given-names>AA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Brem</surname>
                            <given-names>A</given-names>
                        </name>
					          </person-group>:
                    <article-title>Digital twins for collaborative robots: A case study in human-robot interaction.</article-title>
                    <source>
						
                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
					          </source>
               <year>2021</year>;<volume>68</volume>:<fpage>102092</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2020.102092</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Qi</surname>
                            <given-names>Q</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Enabling technologies and tools for digital twin.</article-title>
                    <source>
						
                        <italic toggle="yes">J Manuf Syst.</italic>
					          </source>
               <year>2019</year>;<volume>58</volume>(<issue>Part B</issue>):<fpage>3</fpage>&#x2013;<lpage>21</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jmsy.2019.10.001</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Y</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Digital twin enhanced fault prediction for the autoclave with insufficient data.</article-title>
                    <source>
						
                        <italic toggle="yes">J Manuf Syst.</italic>
					          </source>
               <year>2021</year>;<volume>60</volume>:<fpage>350</fpage>&#x2013;<lpage>359</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jmsy.2021.05.015</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Uhlmann</surname>
                            <given-names>E</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Barth</surname>
                            <given-names>E</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Seifarth</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <etal/>
					          </person-group>:
                    <article-title>Simulation of metal cutting with cutting fluid using the Finite-Pointset-Method.</article-title>
                    <source>
						
                        <italic toggle="yes">Procedia CIRP.</italic>
					          </source>
               <year>2021</year>;<volume>101</volume>:<fpage>98</fpage>&#x2013;<lpage>101</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procir.2021.02.013</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Digital twin driven prognostics and health management for complex equipment.</article-title>
                    <source>

                        <italic toggle="yes">CIRP Ann Manuf Technol.</italic>
               </source>
               <year>2018</year>;<volume>67</volume>(<issue>1</issue>):<fpage>169</fpage>&#x2013;<lpage>172</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.cirp.2018.04.055</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>
               </person-group>:
                    <article-title>Five-dimension digital twin model and its ten applications.</article-title>
                    <source>

                        <italic toggle="yes">Computer Integrated Manufacturing System.</italic>
               </source>
               <year>2019</year>;<volume>25</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>18</lpage>.
                    <pub-id pub-id-type="doi">10.13196/j.cims.2019.01.001</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Guo</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tie</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Weighted hybrid fusion with rank consistency.</article-title>
                    <source>

                        <italic toggle="yes">Pattern Recognit Lett.</italic>
               </source>
               <year>2020</year>;<volume>138</volume>:<fpage>329</fpage>&#x2013;<lpage>335</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.patrec.2020.07.037</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Guan</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cao</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection.</article-title>
                    <source>

                        <italic toggle="yes">Information Fusion.</italic>
               </source>
               <year>2019</year>;<volume>50</volume>:<fpage>148</fpage>&#x2013;<lpage>157</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.inffus.2018.11.017</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taylor</surname>
                            <given-names>CN</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bishop</surname>
                            <given-names>AN</given-names>
                        </name>
               </person-group>:
                    <article-title>Homogeneous functionals and Bayesian data fusion with unknown correlation.</article-title>
                    <source>

                        <italic toggle="yes">Information Fusion.</italic>
               </source>
               <year>2019</year>;<volume>45</volume>:<fpage>179</fpage>&#x2013;<lpage>189</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.inffus.2018.02.002</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jones</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Snider</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nassehi</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Characterising the digital twin: a systematic literature review.</article-title>
                    <source>

                        <italic toggle="yes">CIRP J Manuf Sci Technol.</italic>
               </source>
               <year>2020</year>;<volume>29</volume>(<issue>Part A</issue>):<fpage>36</fpage>&#x2013;<lpage>52</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.cirpj.2020.02.002</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Boschert</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rosen</surname>
                            <given-names>R</given-names>
                        </name>
               </person-group>:
                    <article-title>Digital twin-the simulation aspect</article-title>. In
                    <italic toggle="yes">Mechatronic Futures.</italic>Springer,<year>2016</year>;<fpage>59</fpage>&#x2013;<lpage>74</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-3-319-32156-1_5</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Qi</surname>
                            <given-names>Q</given-names>
                        </name>
               </person-group>:
                    <article-title>IIHub: An industrial Internet-of-Things hub toward smart manufacturing based on cyber-physical system.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Trans Industr Inform.</italic>
               </source>
               <year>2018</year>;<volume>14</volume>(<issue>5</issue>):<fpage>2271</fpage>&#x2013;<lpage>2280</lpage>.
                    <pub-id pub-id-type="doi">10.1109/TII.2017.2759178</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Uke</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thool</surname>
                            <given-names>R</given-names>
                        </name>
               </person-group>:
                    <article-title>UML Based modeling for data aggregation in secured wireless sensor network.</article-title>
                    <source>

                        <italic toggle="yes">Procedia Comput Sci.</italic>
               </source>
               <year>2016</year>;<volume>78</volume>:<fpage>706</fpage>&#x2013;<lpage>713</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procs.2016.02.120</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brahmi</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hammadi</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aifaoui</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Interoperability of CAD models and SysML specifications for the automated checking of design requirements.</article-title>
                    <source>

                        <italic toggle="yes">Procedia CIRP.</italic>
               </source>
               <year>2021</year>;<volume>100</volume>:<fpage>259</fpage>&#x2013;<lpage>264</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procir.2021.05.064</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Manaa</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Akaichi</surname>
                            <given-names>J</given-names>
                        </name>
               </person-group>:
                    <article-title>Ontology-based modeling and querying of trajectory data.</article-title>
                    <source>

                        <italic toggle="yes">Data &amp; Knowledge Engineering.</italic>
               </source>
               <year>2017</year>;<volume>111</volume>:<fpage>58</fpage>&#x2013;<lpage>72</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.datak.2017.06.005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-37">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Legatiuk</surname>
                            <given-names>D</given-names>
                        </name>
               </person-group>:
                    <article-title>Mathematical modelling by help of category theory: models and relations between them.</article-title>
                    <source>

                        <italic toggle="yes">Mathematics.</italic>
               </source>
               <year>2021</year>;<volume>9</volume>(<issue>16</issue>):<fpage>1946</fpage>.
                    <pub-id pub-id-type="doi">10.3390/math9161946</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jebli</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Belouadha</surname>
                            <given-names>FZ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kabbaj</surname>
                            <given-names>MI</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Prediction of solar energy guided by pearson correlation using machine learning.</article-title>
                    <source>

                        <italic toggle="yes">Energy.</italic>
               </source>
               <year>2021</year>;<volume>224</volume>:<fpage>120109</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.energy.2021.120109</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-39">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sheng</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>An adaptive and opposite 
                        <italic toggle="yes">K</italic>-means operation based memetic algorithm for data clustering.</article-title>
                    <source>

                        <italic toggle="yes">Neurocomputing.</italic>
               </source>
               <year>2021</year>;<volume>437</volume>:<fpage>131</fpage>&#x2013;<lpage>142</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.neucom.2021.01.056</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-40">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tian</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Guo</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Data dependence analysis for defects data of relay protection devices based on Apriori algorithm</article-title>. IEEE Access.<year>2020</year>;<volume>8</volume>:<fpage>120647</fpage>&#x2013;<lpage>120653</lpage>.
                    <pub-id pub-id-type="doi">10.1109/ACCESS.2020.3006345</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-41">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jia</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Xiang</surname>
                            <given-names>Y</given-names>
                        </name>
               </person-group>:
                    <article-title>A review: knowledge reasoning over knowledge graph.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst Appl.</italic>
               </source>
               <year>2020</year>;<volume>141</volume>:<fpage>112948</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2019.112948</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Segreto</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Caggiano</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Teti</surname>
                            <given-names>R</given-names>
                        </name>
               </person-group>:
                    <article-title>Neuro-fuzzy system implementation in multiple sensor monitoring for Ni-Ti alloy machinability evaluation.</article-title>
                    <source>

                        <italic toggle="yes">Procedia CIRP.</italic>
               </source>
               <year>2015</year>;<volume>37</volume>:<fpage>193</fpage>&#x2013;<lpage>198</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procir.2015.08.020</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-43">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Er</surname>
                            <given-names>PV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>An RBF neural network approach towards precision motion system with selective sensor fusion.</article-title>
                    <source>

                        <italic toggle="yes">Neurocomputing.</italic>
               </source>
               <year>2016</year>;<volume>199</volume>:<fpage>31</fpage>&#x2013;<lpage>39</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.neucom.2016.01.093</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-44">
                <label>44</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mourtzis</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vlachou</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Doukas</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Cloud-based adaptive shop-floor scheduling considering machine tool availability</article-title>.
                    <italic toggle="yes">ASME 2015 International Mechanical Engineering Congress and Exposition</italic>. November 13&#x2013;19, Houston, Texas, USA 2015, 57588: V015T19A017.<year>2015</year>.
                    <pub-id pub-id-type="doi">10.1115/IMECE2015-53025</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-45">
                <label>45</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zheng</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Qiu</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Data fusion based multi-rate Kalman filtering with unknown input for on-line estimation of dynamic displacements.</article-title>
                    <source>

                        <italic toggle="yes">Measurement.</italic>
               </source>
               <year>2019</year>;<volume>131</volume>:<fpage>211</fpage>&#x2013;<lpage>218</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.measurement.2018.08.057</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-46">
                <label>46</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>SDMSim: A manufacturing service supply-demand matching simulator under cloud environment.</article-title>
                    <source>

                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
               </source>
               <year>2017</year>;<volume>45</volume>:<fpage>34</fpage>&#x2013;<lpage>46</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2016.07.001</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-47">
                <label>47</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nee</surname>
                            <given-names>AYC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ong</surname>
                            <given-names>SK</given-names>
                        </name>
               </person-group>:
                    <article-title>Virtual and Augmented Reality Applications in Manufacturing.</article-title>
                    <source>

                        <italic toggle="yes">IFAC Proceedings Volumes.</italic>
               </source>
               <year>2013</year>;<volume>46</volume>(<issue>9</issue>):<fpage>15</fpage>&#x2013;<lpage>26</lpage>.
                    <pub-id pub-id-type="doi">10.3182/20130619-3-RU-3018.00637</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-48">
                <label>48</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ong</surname>
                            <given-names>SK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yew</surname>
                            <given-names>AWW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thanigaivel</surname>
                            <given-names>NK</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Augmented reality-assisted robot programming system for industrial applications.</article-title>
                    <source>

                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
               </source>
               <year>2020</year>;<volume>61</volume>:<fpage>101820</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2019.101820</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-49">
                <label>49</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Digital twin workshop: a new paradigm for future workshop</article-title>. Computer Integrated Manufacturing System,<year>2017</year>;<volume>23</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="doi">10.13196/j.cims.2017.01.001</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-50">
                <label>50</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lehmann</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>Z</given-names>
                        </name>
               </person-group>:
                    <article-title>Verification, validation, and accreditation (VV&amp;A)&#x2014;requirements, standards, and trends</article-title>. In
                    <italic toggle="yes">Model Engineering for Simulation.</italic>Academic Press,<year>2019</year>;<fpage>101</fpage>&#x2013;<lpage>121</lpage>.
                    <pub-id pub-id-type="doi">10.1016/B978-0-12-813543-3.00006-8</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-51">
                <label>51</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kleijnen</surname>
                            <given-names>J</given-names>
                        </name>
               </person-group>:
                    <article-title>Regression and Kriging metamodels with their experimental designs in simulation: review.</article-title>
                    <source>

                        <italic toggle="yes">Eur J Oper Res.</italic>
               </source>
               <year>2017</year>;<volume>256</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>16</lpage>.
                    <pub-id pub-id-type="doi">10.2139/ssrn.2627131</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-52">
                <label>52</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wagner</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grothoff</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Epple</surname>
                            <given-names>U</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>The role of the Industry 4.0 asset administration shell and the digital twin during the life cycle of a plant.</article-title>
                    <source>

                        <italic toggle="yes">2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation.</italic>
               </source>September 12&#x2013;15, 2017, Limassol, Cyprus,<year>2017</year>.
                    <pub-id pub-id-type="doi">10.1109/ETFA.2017.8247583</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-53">
                <label>53</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cover</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thomas</surname>
                            <given-names>J</given-names>
                        </name>
               </person-group>:
                    <article-title>Elements of information theory</article-title>. Wiley,<year>2006</year>.
                    <ext-link ext-link-type="uri"
                         xlink:href="https://www.wiley.com/en-us/Elements+of+Information+Theory,+2nd+Edition-p-9780471241959">Reference Source</ext-link>
            </mixed-citation>
            </ref>
            <ref id="ref-54">
                <label>54</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bank&#x00f3;</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abonyi</surname>
                            <given-names>J</given-names>
                        </name>
               </person-group>:
                    <article-title>Correlation based dynamic time warping of multivariate time series.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst Appl.</italic>
               </source>
               <year>2012</year>;<volume>39</volume>(<issue>17</issue>):<fpage>12814</fpage>&#x2013;<lpage>12823</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2012.05.012</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-55">
                <label>55</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>S</given-names>
                        </name>
               </person-group>:
                    <article-title>Method for unstructured data transformation based on XML technology.</article-title>
                    <source>

                        <italic toggle="yes">Comput Sci.</italic>
               </source>
               <year>2017</year>;<volume>44</volume>(<issue>S2</issue>):<fpage>414</fpage>&#x2013;<lpage>417</lpage>.</mixed-citation>
            </ref>
            <ref id="ref-56">
                <label>56</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Malik</surname>
                            <given-names>KR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ahmad</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Farhan</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Big-data: transformation from heterogeneous data to semantically-enriched simplified data.</article-title>
                    <source>

                        <italic toggle="yes">Multimed Tools Appl.</italic>
               </source>
               <year>2016</year>;<volume>75</volume>:<fpage>12727</fpage>&#x2013;<lpage>12747</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s11042-015-2918-5</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-57">
                <label>57</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wen</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Big Data Storage Security</article-title>. In
                    <italic toggle="yes">Big Data Concepts, Theories, and Applications.</italic>Springer,<year>2016</year>.</mixed-citation>
            </ref>
            <ref id="ref-58">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Han</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhu</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Duan</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Multi-source information fusion.</article-title>Tsinghua University Press, 2010.</mixed-citation>
            </ref>
            <ref id="ref-59">
                <label>59</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Meyer</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schretter</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bontempi</surname>
                            <given-names>G</given-names>
                        </name>
               </person-group>:
                    <article-title>Information-theoretic feature selection in microarray data using variable complementarity.</article-title>
                    <source>

                        <italic toggle="yes">IEEE J Sel Top Signal Process.</italic>
               </source>
               <year>2008</year>;<volume>2</volume>(<issue>3</issue>):<fpage>261</fpage>&#x2013;<lpage>274</lpage>.
                    <pub-id pub-id-type="doi">10.1109/JSTSP.2008.923858</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-60">
                <label>60</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Agrawal</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Agrawal</surname>
                            <given-names>J</given-names>
                        </name>
               </person-group>:
                    <article-title>Survey on anomaly detection using data mining techniques.</article-title>
                    <source>

                        <italic toggle="yes">Procedia Comput Sci.</italic>
               </source>
               <year>2015</year>;<volume>60</volume>:<fpage>708</fpage>&#x2013;<lpage>713</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.procs.2015.08.220</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-61">
                <label>61</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gao</surname>
                            <given-names>Q</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Feng</surname>
                            <given-names>Z</given-names>
                        </name>
               </person-group>:
                    <article-title>InteractionNN: a neural network for learning hidden features in sparse prediction.</article-title>
                    <source>

                        <italic toggle="yes">Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence.</italic>
               </source>August 10&#x2013;16, 2019, Macao, China,<year>2019</year>;<fpage>4334</fpage>&#x2013;<lpage>4340</lpage>.
                    <ext-link ext-link-type="uri"
                         xlink:href="https://www.ijcai.org/proceedings/2019/0602.pdf">Reference Source</ext-link>
            </mixed-citation>
            </ref>
            <ref id="ref-62">
                <label>62</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hur</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Knowledge extraction and visualization of digital design process.</article-title>
                    <source>

                        <italic toggle="yes">Expert Syst Appl.</italic>
               </source>
               <year>2018</year>;<volume>92</volume>:<fpage>206</fpage>&#x2013;<lpage>215</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.eswa.2017.09.002</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-63">
                <label>63</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Research on services encapsulation and virtualization access model of machine for cloud manufacturing.</article-title>
                    <source>

                        <italic toggle="yes">J Intell Manuf.</italic>
               </source>
               <year>2017</year>;<volume>28</volume>(<issue>5</issue>):<fpage>1109</fpage>&#x2013;<lpage>1123</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10845-015-1064-2</pub-id>
            </mixed-citation>
            </ref>
            <ref id="ref-64">
                <label>64</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tao</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhao</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
               </person-group>:
                    <article-title>Modeling of manufacturing service supply&#x2013;demand matching hypernetwork in service-oriented manufacturing systems.</article-title>
                    <source>

                        <italic toggle="yes">Robot Comput Integr Manuf.</italic>
               </source>
               <year>2017</year>;<volume>45</volume>:<fpage>59</fpage>&#x2013;<lpage>72</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.rcim.2016.05.007</pub-id>
            </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report26887">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18840.r26887</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Zheng</surname>
                        <given-names>Pai</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26887a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2329-8634</uri>
                </contrib>
                <aff id="r26887a1">
                    <label>1</label>Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>21</day>
                <month>3</month>
            <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Zheng P</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26887"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have fully addressed my concerns and I have no further comments to make.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Smart manufacturing system, product-service systems, engineering design</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26886">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18840.r26886</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Zhang</surname>
                        <given-names>Yicha</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26886a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r26886a1">
                    <label>1</label>Mechanical Engineering and Design, G&#x00e9;nie M&#x00e9;canique et Conception, Campus de Sevenans, Universite de Technologie de Belfort-Montbeliard, Sevenans, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>3</day>
                <month>3</month>
            <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Zhang Y</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26886"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The revised version well responded the concerns in the first round reviewing.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Digital design, planning &amp; manufacturing, additive manufacturing, product-service-system engineering</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26888">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18840.r26888</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Nassehi</surname>
                        <given-names>Aydin</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26888a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3417-3391</uri>
                </contrib>
                <aff id="r26888a1">
                    <label>1</label>Department of Mechanical Engineering, University of Bristol, Bristol, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>9</day>
                <month>2</month>
            <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Nassehi A</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26888"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>With the revisions the paper presents a good overview of digital twin data challenges and solutions.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>AI in Manufacturing, Agent Based Modelling of Distributed System</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26810">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18742.r26810</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Nassehi</surname>
                        <given-names>Aydin</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26810a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3417-3391</uri>
                </contrib>
                <aff id="r26810a1">
                    <label>1</label>Department of Mechanical Engineering, University of Bristol, Bristol, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>22</day>
                <month>10</month>
            <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Nassehi A</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26810"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The paper presents an excellent overview of challenges and potential solutions to the issues related to digital twin data. The authors should consider including references to the 
                <ext-link ext-link-type="uri" xlink:href="https://www.iso.org/standard/75066.html">ISO23247 standard</ext-link> which would complement the presented work very nicely. In particular, Part 3 of the document provides a standardised framework for digital representation of the elements in a digital twin and Part 4 provides the framework to underpin information exchange in Digital Twins.</p>
            <p> </p>
            <p> There are a few other well-cited references which contain digital twin data issues that should be used in the paper to position the work better. These include: 
                <list list-type="bullet">
                    <list-item>
                        <p>Wagner 
                            <italic>et al.</italic> (2017)
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-26810-1">1</xref>
                            </sup> -&#x00a0;Provides an overview of the Asset Administration Shell as an interoperability wrapper that enables data transfer between the physical and digital counterparts in a manner that virtualises the asset and allows various twins to communicate in a similar manner.</p>
                    </list-item>
                    <list-item>
                        <p>Jones 
                            <italic>et al.</italic> (2020)
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-26810-2">2</xref>
                            </sup> - Provides a contextual framework for Virtual-Physical and Physical-Virtual twinning to underpin the various purposes of data transfer and storage in Digital Twins (an article of mine).</p>
                    </list-item>
                    <list-item>
                        <p>Boschert et al. (2016)
                            <sup>
                                <xref ref-type="bibr" rid="rep-ref-26810-3">3</xref>
                            </sup> -&#x00a0;Provides a view of the Digital Twin and its data across the lifecycle of a product and its production system that provides scope for the topic of discussion in the presented paper.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>AI in Manufacturing, Agent Based Modelling of Distributed System</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-26810-1">
                    <label>1</label>
                    <mixed-citation>
                        <person-group person-group-type="author"/>:
                        <article-title>The role of the Industry 4.0 asset administration shell and the digital twin during the life cycle of a plant</article-title>.
                        <source>
                            <italic>2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation (ETFA)</italic>
                        </source>.<year>2017</year>;
                        <elocation-id>10.1109/ETFA.2017.8247583</elocation-id>
                        <pub-id pub-id-type="doi">10.1109/ETFA.2017.8247583</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-26810-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Characterising the Digital Twin: A systematic literature review</article-title>.
                        <source>
                            <italic>CIRP Journal of Manufacturing Science and Technology</italic>
                        </source>.<year>2020</year>;<volume>29</volume>:
                        <elocation-id>10.1016/j.cirpj.2020.02.002</elocation-id>
                  <fpage>36</fpage>-<lpage>52</lpage>
                        <pub-id pub-id-type="doi">10.1016/j.cirpj.2020.02.002</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-26810-3">
                    <label>3</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Digital Twin&#x2014;The Simulation Aspect</article-title>.<year>2016</year>;
                        <elocation-id>10.1007/978-3-319-32156-1_5</elocation-id>
                  <fpage>59</fpage>-<lpage>74</lpage>
                        <pub-id pub-id-type="doi">10.1007/978-3-319-32156-1_5</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment3326-26810">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Zhang</surname>
                            <given-names>Meng</given-names>
                        </name>
                        <aff>Department of Automation, Tsinghua University, China</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>1</month>
               <year>2022</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 4</bold>
                </p>
                <p> The paper presents an excellent overview of challenges and potential solutions to the issues related to digital twin data.</p>
                <p> </p>
                <p> 
                    <bold>Question 1: </bold>The authors should consider including references to the ISO23247 standard which would complement the presented work very nicely. In particular, Part 3 of the document provides a standardised framework for digital representation of the elements in a digital twin and Part 4 provides the framework to underpin information exchange in Digital Twins.</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. The reference to the ISO 23247 has been added in the revised version. The information classification for the manufacturing elements in Part 3 is referenced in the section &#x201c;Composition of digital twin data&#x201d; as follows:</p>
                <p> </p>
                <p> Section &#x201c;Composition of digital twin data&#x201d;, Para. 2&#x00a0;</p>
                <p> 
                    <italic>According to ISO 23247-3 series (Digital Twin manufacturing framework-Part 3: Digital representation of manufacturing elements), physical entity-related data can be grouped in two categories: the first describing the static information concerning the physical entity (e.g. identification, characteristics, schedule) and the second reflecting the dynamic states (e.g. status, location, relationship). These data can help to represent the physical entity in a digital way.</italic>
                </p>
                <p> </p>
                <p> The suggested data formats and protocols for data exchange in Part 4 are cited in the section &#x201c;Digital twin data interaction&#x201d;, and the section &#x201c;Key enabling technologies for digital twin data&#x201d; as follows:</p>
                <p> </p>
                <p> Section &#x201c;Digital twin data interaction&#x201d; 
                    <italic>Then the data can be transferred between the DTD parts through some types of data transmission protocols and formats suggested by the ISO 23247-4 (Digital Twin manufacturing framework- Part 4: Information exchange). &#x00a0;</italic>
                </p>
                <p> </p>
                <p> Section &#x201c;Key enabling technologies for digital twin data&#x201d;, Para. 3 
                    <italic>ISO 23247-4 provides some options for data transmission protocols and formats. For example, the virtual entity can access data of the physical entity by using protocols such as MTConnect, OPC-UA, HTTP and CoAP; and the virtual entity can be connected with the service by using an application interface (i.e. open API) or by employing shared memory. The suggested data formats include XML, JavaScript object notation (JSON), quality information framework (QIF) and so forth.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Question 2: </bold>There are a few other well-cited references which contain digital twin data issues that should be used in the paper to position the work better. These include: Wagner et al. (2017) - Provides an overview of the Asset Administration Shell as an interoperability wrapper that enables data transfer between the physical and digital counterparts in a manner that virtualises the asset and allows various twins to communicate in a similar manner. Jones et al. (2020) - Provides a contextual framework for Virtual-Physical and Physical-Virtual twinning to underpin the various purposes of data transfer and storage in Digital Twins (an article of mine). Boschert et al. (2016) - Provides a view of the Digital Twin and its data across the lifecycle of a product and its production system that provides scope for the topic of discussion in the presented paper.</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. These references have been added in the sections of &#x201c;Principles for digital twin data&#x201d;, &#x201c;Digital twin data gathering&#x201d;, and &#x201c;Key enabling technologies for digital twin data&#x201d; as follows:</p>
                <p> </p>
                <p> Section &#x201c;Principles for digital twin data&#x201d;, Para. 3 
                    <italic>It aims at ensuring the rapid twinning process, namely the process of synchronizing the virtual and physical states, thus to enable parts of the DT to act synchronously 
                        <sup>31</sup>.</italic> Section &#x201c;Digital twin data gathering&#x201d; 
                    <italic>In addition, the DTD would also extract essential information from the existing IT systems, such as product lifecycle management (PLM) system, product data management (PDM) system, manufacturing execution system (MES) and supervisory control and data acquisition (SCADA) system, which store large amounts of data across the lifecycle of the physical entity 
                        <sup>32</sup>.</italic>
                </p>
                <p> </p>
                <p> Section &#x201c;Key enabling technologies for digital twin data&#x201d;, Para. 3 
                    <italic>In addition, the asset administration shell, which virtualizes the asset and enables the access to asset-related data via application interfaces, also provides a way for data exchange 
                        <sup>52</sup>.</italic>
                </p>
                <p> </p>
                <p> 
                    <italic>[31] Jones D, Snider C, Nassehi A, et al.: Characterising the digital twin: a systematic literature review. CIRP Journal of Manufacturing Science and Technology. 2020; 29, Part A: 36-52.</italic>
                </p>
                <p> 
                    <italic>[32] Boschert S, Rosen R: Digital twin-the simulation aspect. In Mechatronic Futures. Springer, 2016.</italic>
                </p>
                <p> 
                    <italic>[52] Wagner C, Grothoff J, Epple U, et al.: The role of the Industry 4.0 asset administration shell and the digital twin during the life cycle of a plant. 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation, September 12-15, 2017, Limassol, Cyprus, 2017, DOI: 10.1109/ETFA.2017.8247583.</italic>
                </p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26809">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18742.r26809</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Zhang</surname>
                        <given-names>Yicha</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26809a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r26809a1">
                    <label>1</label>Mechanical Engineering and Design, G&#x00e9;nie M&#x00e9;canique et Conception, Campus de Sevenans, Universite de Technologie de Belfort-Montbeliard, Sevenans, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>18</day>
                <month>10</month>
            <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Zhang Y</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26809"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This paper tries to propose a new concept, DTD, to describe the related requirements, principles and enabling techs. It is interesting in the attempt to draw a new DT data processing framework or guideline. However, from reviewer's perspective on operation in practice, there are still some places that are unclear and hard to follow.</p>
            <p> </p>
            <p> 1. There is no clear definition about DTD, but more about data classification in the context of DT. It is possible to give a new definition to show the difference of data in the context of DT? The definition difference would focus on more about contents or format or application?</p>
            <p> </p>
            <p> 2. Is there an exact logic for the listing of DTD requirements? Mining is before or after fusion, or both directions are possible?</p>
            <p> </p>
            <p> 3. It is hard to understand 'data interaction'. It is more similar to service as we often encountered in IOT or CPS systems.</p>
            <p> </p>
            <p> 4. What exactly is 'iterative optimization'? As you mentioned in the paper: iterative optimization refers to a cyclic process of &#x201c;data increase - data fusion - information increase&#x201d;, through which, new data is fused with historical data to generate new information continuously. This means to generate new data sets. It is more about data evolution? However, as you also mentioned in the later section of the paper, 'optimization of DTD'. So, how to optimize DTD? What are the objective, process and constraints?</p>
            <p> </p>
            <p> 5. The requirement of data universality seems an ideal solution in every domain. However, DT is usually customized system. Hence, is there any conflict between this requirement with the application? What does it mean by universality, contents or format?</p>
            <p> </p>
            <p> 6. What is fusion data, data of data (well structured and associated) or 'green data' (intermediate data)?</p>
            <p> </p>
            <p> 7. Are the components of DTD organized in a hierarchical way or not?</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Digital design, planning &amp; manufacturing, additive manufacturing, product-service-system engineering</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment3325-26809">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Zhang</surname>
                            <given-names>Meng</given-names>
                        </name>
                        <aff>Department of Automation, Tsinghua University, China</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>1</month>
               <year>2022</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 3</bold>
                </p>
                <p> This paper tries to propose a new concept, DTD, to describe the related requirements, principles and enabling techs. It is interesting in the attempt to draw a new DT data processing framework or guideline. However, from reviewer's perspective on operation in practice, there are still some places that are unclear and hard to follow.</p>
                <p> </p>
                <p> 
                    <bold>Question 1: </bold>There is no clear definition about DTD, but more about data classification in the context of DT. It is possible to give a new definition to show the difference of data in the context of DT? The definition difference would focus on more about contents or format or application?</p>
                <p> 
                    <bold>Response:</bold> Thanks for the comments. A definition about DTD is added in the first paragraph of the section &#x201c;Composition of digital twin data&#x201d; as follows:</p>
                <p> </p>
                <p> As cyber-physical integration is a distinguishing feature of DT, cyber-physical data fusion is a major characteristic of the DTD. It could be reflected on the data contents, as the DTD involves data from both the physical and virtual spaces, and tends to merge these data to generate fusion data for more accurate and comprehensive information. 
                    <italic>DTD refers to a wide spectrum of data closely related to the DT. It focuses equally on data collected from the physical space (e.g. sensor data from equipment, materials, and workers) and data produced by the virtual space (e.g. data mimicked by simulation models, data generated by algorithms embedded in services, and data deduced based on knowledge), and tends to make the data from the two spaces corrected and supplemented by each other through data fusion, to achieve more accurate and comprehensive information for the DT-related applications.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Question 2: </bold>Is there an exact logic for the listing of DTD requirements? Mining is before or after fusion, or both directions are possible?</p>
                <p> 
                    <bold>Response: </bold>The listing of DTD requirements is corresponding to the logic of data gathering, interaction, storage, association, fusion, evolution and usage. There is a one to one correspondence between the DTD requirements and the principles (Fig.2), as well as the principles and methods (Fig.3). As data mining is the basis for data fusion, mining is before fusion. Data fusion is developed based on data with different relations, such as fusing similar data for data correction and fusing complementary data for getting supplementary information. While, data association relations (e.g. causality, similarity, and complementation) are extracted based data mining algorithms, such as Apriori algorithm, K-means, and Pearson correlation analysis, etc.</p>
                <p> </p>
                <p> 
                    <bold>Question 3:</bold>
                    <italic> </italic>It is hard to understand 'data interaction'. It is more similar to service as we often encountered in IOT or CPS systems</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. In the paper, 'data interaction' refers to the transmission and comparison of connection data from any two parts of the DT. It aims at making the data from different sources consistent with each other to enable the coordinated operations. In the section &#x201c;Digital twin data interaction&#x201d;, an example about the data interaction is added as follows:&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>To better illustrate the interaction, take an autoclave (a key equipment for composites curing) as an example. During curing, temperature distribution of the mold within the autoclave is an important quality indicator, since it largely affects the deformation degree of the composite components 
                        <sup>19</sup>. To monitor the component quality, the real mold temperatures are collected by thermocouples. While, a virtual model can also be built to mimic the expected mold temperature distribution by using the CFD simulation. The data interaction between the physical equipment and the virtual model is used to ensure that the actual curing is executed as simulated. In this situation, a major feature for reflecting the mold temperature distribution, i.e. the difference between the maximum and minimum temperatures on the mold upper surface, can be selected as the connection data, leaving out other redundant or unimportant features. Then the collected and simulated mold temperature differences can be exchanged and compared. If their deviation is always within a predefined threshold, the expected composite quality would be achieved. Otherwise, perturbing factors causing the inconsistency need to be found, such as uneven airflow field in the autoclave, deteriorating insulation of the autoclave wall, and abrupt changes of curing conditions. These factors should be eliminated by adjusting parameters of the physical autoclave (e.g. heating speed, air speed, pressure) or by maintaining the equipment, which aims at reducing negative effects caused by the disturbances, to keep the collected data close with the simulated ones.</italic>
                </p>
                <p> </p>
                <p> In the section &#x201c;Key enabling technologies for digital twin data&#x201d;, more technical details for data interaction are also added in the part (2). Besides, as the service is a relatively broad concept, I think the &#x201c;data interaction&#x201d; could also be treated as a kind of service, which exchanges and compares data from different sources to ensure the data consistency. Perhaps it can be encapsulated as a service and then invocated to realize the data exchange and comparison when necessary.</p>
                <p> </p>
                <p> 
                    <italic>
                        <bold>Question 4:</bold> 
               </italic>What exactly is 'iterative optimization'? As you mentioned in the paper: iterative optimization refers to a cyclic process of &#x201c;data increase - data fusion - information increase&#x201d;, through which, new data is fused with historical data to generate new information continuously. This means to generate new data sets. It is more about data evolution? However, as you also mentioned in the later section of the paper, 'optimization of DTD'. So, how to optimize DTD? What are the objective, process and constraints?</p>
                <p> 
                    <bold>Response:</bold> Thanks for the comments. Before the &#x201c;iterative optimization&#x201d;, &#x201c;optimization of DTD&#x201d; is first explained. &#x201c;Optimization of DTD&#x201d; mainly aims at making the data more informative through data fusion. In different situations, the objective, process and constraints would be different. For example, assume that there are data of several variables related to the remaining useful life (RUL) of an equipment component, such as vibrations, speeds and forces collected by sensors, as well as deformation and stress distribution simulated by models. We can fuse the data by using the neural network as the fusion algorithm to obtain more informative data. Inputs of the network are all or part of the RUL-related variables and the output is the merging result, i.e. the fusion data, which would be more relevant with the RUL than any other single variable. The relevance between the fusion data and the RUL can be measured by the mutual information (a measure to quantify the shared information between two variables). Higher relevance means that the data carry more RUL-related information. In this situation, the optimization objective would be maximizing the mutual information between the fusion data and the RUL. Since selecting different subsets of the RUL-related variables as inputs of the neural network would lead to different fusion data, the optimization process mainly needs to search out the best subset from the candidate ones to make the objective optimal. Some constraints would be imposed on the variables to be selected. For example, the relevance between each variable and the RUL would be larger than a lower limit, to eliminate the less correlated ones; and the relevance between any two selected variables would be lower than an upper limit, to reduce unnecessary redundancy.&#x00a0;The iterative optimization refers to the repeated process of the above optimization. When new data from different parts such as the physical entity, virtual model and service are injected into the DTD, a new optimization would begin to decide what new data to be added to the fusion to make the objective more optimal than before. It means that the newly generated fusion data would carry more information. Besides, the &#x201c;iterative optimization&#x201d; is closely related with the data evolution. The logic in the paper is as follows. &#x201c;Iterative optimization&#x201d; is a requirement on DTD. To meet the requirement, the &#x201c;information growth principle&#x201d; proposed in the later section should be observed, to ensure that the &#x201c;data evolution&#x201d; step in the section &#x201c;Methodology for digital twin data&#x201d; would keep the fusion data generated and information carried by the fusion data increased. To better illustrate the iterative optimization, some explanations are added in the &#x201c;Requirements on iterative optimization&#x201d; in Section 1 as follows:&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>In the cyclic process, each optimization, triggered by the injection of new data, aims at adding appropriate data into the fusion to increase the amount of valuable information carried by the fusion data, while being subject to possible constraints on the data to be merged, such as keeping the data redundancy lower than an upper bound.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Question 5: </bold>The requirement of data universality seems an ideal solution in every domain. However, DT is usually customized system. Hence, is there any conflict between this requirement with the application? What does it mean by universality, contents or format?</p>
                <p> 
                    <bold>Response: </bold>It could not be conflicted. As the DT system is customized, it would develop different sensors, models and services for different applications, which bring a wide variety of data with different types and contents. Although it is hard to make the data unified in every aspect, we can still make them transformed into some unified formats (e.g. XML) for data storing and sharing, and transmitted by some universal interfaces and communication protocols for easier communication. In the paper, universality mainly refers to unified data format, interface and communication protocol.</p>
                <p> </p>
                <p> 
                    <bold>Question 6: </bold>What is fusion data, data of data (well structured and associated) or 'green data' (intermediate data)?</p>
                <p> 
                    <bold>Response: </bold>Fusion data is a merging result of data from different sources by using data fusion algorithms. Take the healthy monitoring for an equipment as an example. From the physical space, data such as the vibrations, speeds, forces and pressures can be collected by sensors. While, from the virtual space, data such as deformation, stress distribution, and airflow distribution can be simulated by virtual models. Since all the data would be relevant with the equipment conditions, a merging result of these data (i.e. the fusion data) can be used to support a more accurate monitoring, as it involves more comprehensive information, compared with data from a single source. The fusion data is obtained through processing the original data from the sensors, simulations or other sources via data fusion algorithms, such as the weighting method, neural network, Bayesian method, etc. To clarify it, some explanations are added in the second to last paragraph in the section &#x201c;Composition of digital twin data&#x201d; as follows:&#x00a0;</p>
                <p> </p>
                <p> 
                    <italic>For example, the healthy condition of an equipment can be reflected by various sensor data such as the vibrations, speeds, forces and pressures, as well as the simulated data such as deformation, stress distribution, and airflow distribution. The fusion data is a merging result of the above data, which is obtained by processing the data from different sources via data fusion algorithms, such as the weighting method 
                        <sup>28</sup>, neural network 
                        <sup>29</sup>, and Bayesian method 
                        <sup>30</sup>. As a result, the fusion data would be more informative for the condition monitoring.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Question 7: </bold>Are the components of DTD organized in a hierarchical way or not?</p>
                <p> 
                    <bold>Response: </bold>The components of DTD can be organized in a hierarchical way. In the previous work of our team [1], the hierarchical way for data organization has been preliminarily discussed, combining with the edge computing, fog computing and cloud computing. According to it, the original data from the physical entity, virtual entity and service belongs to the edge level. In this level, the incoming data can be preprocessed (e.g. filtering, redundancy removing, missing data filling) and then used to identify some potential threat patterns (e.g. equipment fault). The edge level can provide timely responses to the threats, as it generally has low latency. Then the preprocessed data would be transferred to the fog level, where more complex computing tasks are enabled to support the data feature extraction, data relation mining and fusion data generation. In this level, the fusion data can be employed to provide more comprehensive information for monitoring, prediction and optimization. The cloud level stores amounts of historical data from the edge and fog levels and supports the large-scale computation. It helps to mine more domain knowledge from the mass data to support long-term prediction and planning. In the paper, the hierarchical organization way for the DTD is not particularly discussed. More technical details about the DTD organization would be explored in our future work.</p>
                <p> </p>
                <p> [1] Qi Q., Zhao D., Liao T. W., Tao F., Modeling of cyber-physical systems and digital twin based on edge computing, fog computing and cloud computing towards smart manufacturing, ASME 2018 13
                    <sup>th</sup> International Manufacturing Science and Engineering Conference, Texas, US, 2018, V001T05A018.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26808">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18742.r26808</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Zheng</surname>
                        <given-names>Pai</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26808a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2329-8634</uri>
                </contrib>
                <aff id="r26808a1">
                    <label>1</label>Department of Industrial and Systems Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>11</day>
                <month>10</month>
            <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Zheng P</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26808"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this paper, the authors provide a future prospect and couple of requirements of data applications in the digital twin domain from structure, principle, and methodology views. With the illustration, the potential coming research directions are summarized and pointed out. This topic and corresponding learning areas are promising and worth investigating deeper. &#x00a0;</p>
            <p> </p>
            <p> Here are some minor issues that may be considered for improving this paper: 
                <list list-type="order">
                    <list-item>
                        <p>In the third requirement of digital twin data, could you explain more on the seamless mean? Such requirement is too blurry and what kind of performance do you want to achieve?</p>
                    </list-item>
                    <list-item>
                        <p>In the composition of digital twin data section, the explanation of the connection data is provided theoretically. However, it lacks the intuitive example demonstration, therefore, could more cases or concrete technologies be provided to help better understanding?</p>
                    </list-item>
                    <list-item>
                        <p>In the principle part, could more explanation of figure 2 be provided? What&#x2019;s the serial numbers and arrows in the figure mean? What do the different levels stand for in the figure?</p>
                    </list-item>
                    <list-item>
                        <p>In figure 3, the quality of figures and expression way may be refined.</p>
                    </list-item>
                    <list-item>
                        <p>In key enabling technologies, some technologies concepts are adopted to enhance the development of DTD. However, could the authors provide more technical/algorithm details to achieve the corresponding goal/proposed performances? &#x00a0;</p>
                    </list-item>
                </list>
            </p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Partly</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Smart manufacturing system, product-service systems, engineering design</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment3324-26808">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Zhang</surname>
                            <given-names>Meng</given-names>
                        </name>
                        <aff>Department of Automation, Tsinghua University, China</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>1</month>
               <year>2022</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 2</bold>
                </p>
                <p> In this paper, the authors provide a future prospect and couple of requirements of data applications in the digital twin domain from structure, principle, and methodology views. With the illustration, the potential coming research directions are summarized and pointed out. This topic and corresponding learning areas are promising and worth investigating deeper. Here are some minor issues that may be considered for improving this paper:</p>
                <p> </p>
                <p> 
                    <bold>Question 1: </bold>In the third requirement of digital twin data, could you explain more on the seamless mean? Such requirement is too blurry and what kind of performance do you want to achieve?</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. The &#x201c;seamless&#x201d; is used to express that data carrying different information and coming from different sources are fused to maximize improving the data quality, such as improving data accuracy, reducing data uncertainty, and increasing the amount of information carried by data. For example, to improve accuracy of the simulated data, the fusion would aim at reducing the root mean squared error (RSME) between the simulated and measured data. For reducing data uncertainty, it would minimize the information entropy of the data. To improve the amount of information carried by data, taking product quality-related information as an example, the fusion will aim at improving the relevancy between the data and the quality indicator.</p>
                <p> </p>
                <p> To clarify it, some words are added to this requirement (the fifth requirement in the revised version) as follows:</p>
                <p> </p>
                <p> It would bring lots of benefits, such as reducing information entropy of the sensor data (i.e. reducing data uncertainty), reducing root mean squared error (RMSE) between the simulated data and measured data (i.e. improving data accuracy), and improving relevancy between the data and a certain target indicator, such as product quality, remaining useful life of a key component, and equipment healthy condition (i.e. increasing the amount of information related to the indicator).</p>
                <p> </p>
                <p> Besides, as the &#x201c;seamless&#x201d; may seem blurry to express the above content, this word is deleted in the revised version.</p>
                <p> </p>
                <p> 
                    <bold>Question 2: </bold>In the composition of digital twin data section, the explanation of the connection data is provided theoretically. However, it lacks the intuitive example demonstration, therefore, could more cases or concrete technologies be provided to help better understanding?</p>
                <p> 
                    <bold>Response:</bold> Thanks for the comments. In the section &#x201c;Composition of digital twin data&#x201d;, a simple example is added in the last paragraph as follows:</p>
                <p> </p>
                <p> For example, the real temperature collected by sensors and the expected temperature simulated by virtual models can be selected as the connection data between the physical entity-related data and virtual entity-related data. If their deviation exceeds a predefined threshold, perturbing factors (e.g. manual interferences, environment changes) causing the inconsistency would be found and eliminated, which helps to restore the data consistency.</p>
                <p> </p>
                <p> Then a more detailed example about the connection data of an autoclave is provided in the section &#x201c;Digital twin data interaction&#x201d; as follows:</p>
                <p> </p>
                <p> To better illustrate the interaction, take an autoclave (a key equipment for composites curing) as an example. During curing, temperature distribution of the mold within the autoclave is an important quality indicator, since it largely affects the deformation degree of the composite components 
                    <sup>19</sup>. To monitor the component quality, the real mold temperatures are collected by thermocouples. While, a virtual model can also be built to mimic the expected mold temperature distribution by using the CFD simulation. The data interaction between the physical equipment and the virtual model is used to ensure that the actual curing is executed as simulated. In this situation, a major feature for reflecting the mold temperature distribution, i.e. the difference between the maximum and minimum temperatures on the mold upper surface, can be selected as the connection data, leaving out other redundant or unimportant features. Then the collected and simulated mold temperature differences can be exchanged and compared. If their deviation is always within a predefined threshold, the expected composite quality would be achieved. Otherwise, perturbing factors causing the inconsistency need to be found, such as uneven airflow field in the autoclave, deteriorating insulation of the autoclave wall, and abrupt changes of curing conditions. These factors should be eliminated by adjusting parameters of the physical autoclave (e.g. heating speed, air speed, pressure) or by maintaining the equipment, which aims at reducing negative effects caused by the disturbances, to keep the collected data close with the simulated ones.</p>
                <p> </p>
                <p> In addition, more related technical details are added in the part (2) of the section &#x201c;Key enabling technologies for digital twin data&#x201d; as follows:</p>
                <p> </p>
                <p> (2) With respect to data interaction, data transmission technology is required to exchange connection data among different parts of DTD. ISO 23247-4 provides some options for data transmission protocols and formats. For example, the virtual entity can access data of the physical entity through protocols such as MTConnect, OPC-UA, HTTP and CoAP; and the virtual entity can be connected with the service by using application interfaces (i.e. open API) or by employing shared memory. The suggested data formats include XML, JavaScript object notation (JSON), quality information framework (QIF) and so forth. The asset administration shell, which virtualizes the asset and enables the access to asset-related data via application interfaces, also provides a way for data exchange 
                    <sup>52</sup>. Then to reduce the redundancy in the exchanged data, measures such as Pearson correlation coefficient 
                    <sup>38</sup> and mutual information 
                    <sup>53</sup> can be employed to judge the overlapping of the data. Besides, Euclidean distance, Manhattan distance and dynamic time warping 
                    <sup>54</sup>,
                    <italic> etc</italic>. can be used to evaluate the discrepancy between data from different parts of the DT to evaluate their consistency.</p>
                <p> </p>
                <p> 
                    <bold>Question 3: </bold>In the principle part, could more explanation of figure 2 be provided? What&#x2019;s the serial numbers and arrows in the figure mean? What do the different levels stand for in the figure?</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. More explanation for Fig. 2 is provided in first paragraph of the section &#x201c;Principles for digital twin data&#x201d; as follows, which explains the different levels in the figure. As in Figure 2, the outer layer shows the data requirements in seven aspects as raised in Section 1. While, the corresponding principles, which should be observed for the sake of better fulfilling the requirements, are given one by one in the inner layer. As arrows in the figure have no practical significance, they are deleted. In addition, serial numbers in the figure are used to distinguish the principles, since in the previous version, these principles are mentioned as &#x201c;Principle (1), Principle (2)&#x2026;&#x201d; in Fig.3. But as in this version, such statements are deleted, the serial numbers in this figure are deleted accordingly.</p>
                <p> </p>
                <p> 
                    <bold>Question 4:</bold> In figure 3, the quality of figures and expression way may be refined.</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. The Fig.3 has been refined.</p>
                <p> </p>
                <p> 
                    <bold>Question 5: </bold>In key enabling technologies, some technologies concepts are adopted to enhance the development of DTD. However, could the authors provide more technical/algorithm details to achieve the corresponding goal/proposed performances?</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. More technical and algorithm details have been added in the section &#x201c;Key enabling technologies for digital twin data&#x201d; in the revised version.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report26807">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/digitaltwin.18742.r26807</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sun</surname>
                        <given-names>Huibin</given-names>
                    </name>
                    <xref ref-type="aff" rid="r26807a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-6767-1912</uri>
                </contrib>
                <aff id="r26807a1">
                    <label>1</label>School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an, China</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>10</month>
            <year>2021</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Sun H</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport26807"
                          related-article-type="peer-reviewed-article"
                          xlink:href="10.12688/digitaltwin.17467.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>Data plays critical roles in constructing virtual models, building cyber-physical connections, and executing intelligent operations in digital twins. This paper explores some basic principles and methods for digital twin data gathering, storage, interaction, association, fusion, evolution and servitization, as well as the key enabling technologies. The theoretical underpinnings for DTD proposed in this paper, are imperative for the further promotion and application of DT.</p>
            <p> </p>
            <p> 1. The barriers or challenges to digital twin data could be addressed.</p>
            <p> </p>
            <p> 2. If possible, an example or a scenario could be used to enhance the paper&#x2019;s contributions.</p>
            <p>Is the rationale for developing the new method (or application) clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the method technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions about the method and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>If any results are presented, are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>No source data required</p>
            <p>Are sufficient details provided to allow replication of the method development and its use by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Digital twin</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
        <sub-article article-type="response" id="comment3323-26807">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Zhang</surname>
                            <given-names>Meng</given-names>
                        </name>
                        <aff>Department of Automation, Tsinghua University, China</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>28</day>
                    <month>1</month>
               <year>2022</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer 1</bold>
                </p>
                <p> Data plays critical roles in constructing virtual models, building cyber-physical connections, and executing intelligent operations in digital twins. This paper explores some basic principles and methods for digital twin data gathering, storage, interaction, association, fusion, evolution and servitization, as well as the key enabling technologies. The theoretical underpinnings for DTD proposed in this paper, are imperative for the further promotion and application of DT.</p>
                <p> </p>
                <p> 
                    <bold>Question 1: </bold>The barriers or challenges to digital twin data could be addressed.</p>
                <p> 
                    <bold>Response:</bold> Thanks for the comments. The challenges to digital twin data are discussed in the last section as follows:</p>
                <p> </p>
                <p> 
                    <italic>For the DTD, challenges ahead are mainly concerning simulation data generation and cyber-physical data fusion. Firstly, since DTD emphasizes on the supplementary role of simulation data, how to build virtual models to generate the simulation data accurately remains a fundamental challenge. The commonly used models include data model and physical model. However, the former relies heavily on the modeling data and it usually has poor interpretability. Whereas, although the latter is established based on physical mechanisms and constraints, it tends to be over-simplistic sometimes and thus leads to non-ignorable deviations with the practice. The hybrid modeling method that merges the data model and the physical model would be a promising way for addressing this challenge. Secondly, even though cyber-physical integration is a distinguishing feature of DT, cyber-physical data fusion remains to be a challenge. As data from the physical and virtual spaces may have their own limitations and flaws, how to ensure that the fusion result would always be superior to data from either space is a concerned problem.</italic>
                </p>
                <p> </p>
                <p> 
                    <bold>Question 2: </bold>If possible, an example or a scenario could be used to enhance the paper&#x2019;s contributions.</p>
                <p> 
                    <bold>Response: </bold>Thanks for the comments. This paper mainly provides a theoretical method for DTD gathering and processing, aiming at helping more DT researchers to incorporate DTD into their works. The related research is still at the primary stage. Based on this, more applications examples and scenarios would be provided in our future works, such as examples of DTD applications in product design optimization, product quality control and equipment prognosis. They will further validate and refine the proposed methods.</p>
            </body>
        </sub-article>
    </sub-article>
</article>