TechRxiv
paper_MHI22.pdf (6.27 MB)
Download file

Scenario-Driven Data Generation with Experimentable Digital Twins

Download (6.27 MB)
preprint
posted on 24.05.2022, 20:58 authored by Osama MaqboolOsama Maqbool
Synthetic data is an indispensable supplement to the difficult-to-acquire real data in order to meet the substantial demand by machine learning based systems. Data playing the key role in machine learning models, its objective and maintainable quality metrics are vital for quality assurance of the whole system. This paper introduces a systematic and domain-neutral methodology based on formalized scenario variation and experimental digital twins for the generation of synthetic data. The methodology uses human-readable scenarios and semantically meaningful parameter variations to describe possible entities, actions and events to be simulated, whereas experimental digital twins bring the scenarios to life by the integration of various domains of a system such as mechanics, sensors, actuators and communication under one platform that can be simulated as a whole. The scenario description and digital twin simulation is carried out iteratively to derive the optimal distribution of synthetic data. Thus scenarios and experimentable digital twins can together serve as mediums to systematically cover diverse application scenarios, test dangerous situations and find faults within a system.

History

Email Address of Submitting Author

maqbool@mmi.rwth-aachen.de

Submitting Author's Institution

Institute for Man-Machine-Interaction

Submitting Author's Country

Germany

Usage metrics

Licence

Exports