Generation and Evaluation of Tabular Data in Different Domains Using Gans

Marecha, Persevearance and Ye, Lu (2023) Generation and Evaluation of Tabular Data in Different Domains Using Gans. Asian Journal of Research in Computer Science, 16 (1). pp. 15-27. ISSN 2581-8260

[thumbnail of Marecha+and+Ye1612023AJRCOS99992.pdf] Text
Marecha+and+Ye1612023AJRCOS99992.pdf - Published Version

Download (502kB)

Abstract

Deep learning techniques like Generative Adversarial Networks (GANs) provide solutions in many domains where real data needs to be kept private. Synthesizing tabular data is difficult because of its high complexity. Tabular data usually contains a mixture of discrete and continuous data, which is not an easy model to build. The contributions made in this paper include training and generating data with the original Vanilla Gan, then CGan and WGan-Gp and WCGan-Gp which performs better than the former. The Adult Income Census dataset mainly focuses on predicting whether income exceeds 50,000 per year based on census data, then comparing the accuracy of machine learning models and calculating the F1 scores. Then the use of TimeGan on the stock dataset, comparing synthetic data vs real data. This paper will explore the use of GANs for generating and evaluating tabular data in different domains.

Item Type: Article
Subjects: ScienceOpen Library > Computer Science
Depositing User: Managing Editor
Date Deposited: 25 May 2023 12:34
Last Modified: 24 Oct 2024 03:47
URI: http://scholar.researcherseuropeans.com/id/eprint/1366

Actions (login required)

View Item
View Item