Essential Maintenance: All Authorea-powered sites will be offline 9am-10am EDT Tuesday 28 May
and 11pm-1am EDT Tuesday 28-Wednesday 29 May. We apologise for any inconvenience.

loading page

C3-TGAN- Controllable Tabular Data Synthesis with Explicit Correlations and Property Constraints
  • +4
  • Peiyi Han ,
  • Wen Xu ,
  • Wanyu Lin ,
  • Jiahao Cao ,
  • Chuanyi Liu ,
  • Shaoming Duan ,
  • Haifeng Zhu
Peiyi Han
Harbin Institute of Technology (Shenzhen)

Corresponding Author:[email protected]

Author Profile
Wanyu Lin
Author Profile
Jiahao Cao
Author Profile
Chuanyi Liu
Author Profile
Shaoming Duan
Author Profile
Haifeng Zhu
Author Profile


GAN-based tabular synthesis methods have made important progress in generating sophisticated synthetic data for privacypreserving data publishing. However, existing methods do not consider explicit attribute correlations and property constraints on tabular data synthesis, which may lead to inaccurate data analysis results. In this paper, we propose a Controllable tabular data synthesis framework with explicit Correlations and property Constraints, namely C3-TGAN. It leverages Bayesian networks to learn explicit correlations among attributes and model them as control vectors. Such control vectors can guide C3-TGAN to generate synthetic data with complicated property constraints. By conducting comprehensive experiments on 14 publicly available benchmark datasets, we showcase C3-TGAN's remarkable performance advantage over state-of-the-art methods for synthesizing tabular data.