- Turkish Journal of Science and Technology
- Vol: 17 Issue: 1
- Efficient Text Classification with Deep Learning on Imbalanced Data Improved with Better Distributio...
Efficient Text Classification with Deep Learning on Imbalanced Data Improved with Better Distribution
Authors : Beytullah Yildiz
Pages : 89-98
Doi:10.55525/tjst.1068940
View : 17 | Download : 8
Publication Date : 2022-03-20
Article Type : Research
Abstract :Technological developments and the widespread use of the internet cause the data produced on a daily basis to increase exponentially. An important part of this deluge of data is text data from applications such as social media, communication tools, customer service. The processing of this large amount of text data needs automation. Significant successes have been achieved in text processing recently. Especially with deep learning applications, text classification performance has become quite satisfactory. In this study, we proposed an innovative data distribution algorithm that reduces the data imbalance problem to further increase the text classification success. Experiment results show that there is an improvement of approximately 3.5% in classification accuracy and over 3 in F1 score with the algorithm that optimizes the data distribution.Keywords : Text classification, Data Imbalance, Data Distribution, Deep learning, Word Embedding.