Marecha, Persevearance and Ye, Lu (2023) Generation and Evaluation of Tabular Data in Different Domains Using Gans. Asian Journal of Research in Computer Science, 16 (1). pp. 15-27. ISSN 2581-8260
Marecha+and+Ye1612023AJRCOS99992.pdf - Published Version
Download (502kB)
Abstract
Deep learning techniques like Generative Adversarial Networks (GANs) provide solutions in many domains where real data needs to be kept private. Synthesizing tabular data is difficult because of its high complexity. Tabular data usually contains a mixture of discrete and continuous data, which is not an easy model to build. The contributions made in this paper include training and generating data with the original Vanilla Gan, then CGan and WGan-Gp and WCGan-Gp which performs better than the former. The Adult Income Census dataset mainly focuses on predicting whether income exceeds 50,000 per year based on census data, then comparing the accuracy of machine learning models and calculating the F1 scores. Then the use of TimeGan on the stock dataset, comparing synthetic data vs real data. This paper will explore the use of GANs for generating and evaluating tabular data in different domains.
Item Type: | Article |
---|---|
Subjects: | ScienceOpen Library > Computer Science |
Depositing User: | Managing Editor |
Date Deposited: | 25 May 2023 12:34 |
Last Modified: | 24 Oct 2024 03:47 |
URI: | http://scholar.researcherseuropeans.com/id/eprint/1366 |