ANALISA DAN PREDIKSI IKLAN LOWONGAN KERJA PALSU DENGAN METODE NATURAL LANGUAGE PROGRAMING DAN MACHINE LEARNING

Hary Sabita, Fitria Fitria, Riko Herwanto

Abstract


This research was conducted using the data provided by Kaggle. This data contains features that describe job vacancies. This study used location-based data in the US, which covered 60% of all data. Job vacancies that are posted are categorized as real or fake. This research was conducted by following five stages, namely: defining the problem, collecting data, cleaning data (exploration and pre-processing) and modeling. The evaluation and validation models use Naïve Bayes as a baseline model and Small Group Discussion as end model. For the Naïve Bayes model, an accuracy value of 0.971 and an F1-score of 0.743 is obtained. While the Stochastic Gradient Descent obtained an accuracy value of 0.977 and an F1-score of 0.81. These final results indicate that SGD performs slightly better than Naïve Bayes.

KeywordsNLP, Machine Learning, Naïve Bayes, SGD, Fake Jobs


Full Text:

PDF


DOI: https://doi.org/10.30873/ji.v21i1.2865


 Jurnal Informatika is abstracting and indexing in the following databases:

 


JURNAL INFORMATIKA

 

Dikelola Oleh: Lembaga Penelitian dan Pengabdian kepada Masyarakat (LPPM)

Diterbitkan Oleh: Institut Informatika dan Bisnis Darmajaya
Alamat: Jl. Z.A. Pagar Alam No. 93 Gedong Meneng, Bandar Lampung Lampung
Website: jurnal.darmajaya.ac.id

Email: lp4mjurin@gmail.com


View Jurnal Informatika StatCounter

Creative Commons License

Jurnal Informatika is licensed under a Creative Commons Attribution 4.0 International License.