Analisis Active learning SVM berbasis Margin Sampling pada Sentimen YouTube MBG

Authors

  • Minarwati STMIK El Rahma Author
  • Alvian Putra Hardiadi STMIK El Rahma Author

DOI:

https://doi.org/10.61805/fahma.v24i2.203

Keywords:

Active Learning, SVM, Sentiment Analysis, YouTube, MBG

Abstract

The Free Nutritious Meal Program (MBG) has generated extensive public discussion on YouTube, making sentiment analysis a valuable tool for understanding public perceptions. This study investigates the effectiveness of Support Vector Machine (SVM)-based Active Learning (AL) using a Margin Sampling strategy and a Human-in-the-Loop (HITL) framework, in which researchers acted as oracles following annotation guidelines for three sentiment classes: Negative, Neutral, and Positive. The dataset comprised 999 labeled Indonesian YouTube comments, split into 599 initial training samples, 200 oracle pool samples, and 200 fixed test samples. Texts were represented using TF-IDF features with unigrams and bigrams (max_features = 5,000). Three approaches were compared: AL-HITL, Simulated Active Learning, and Random Sampling. After 15 iterations involving 50 additional labeled samples, AL-HITL achieved the highest macro F1-score of 0.5389, outperforming Simulated AL (0.4950) and Random Sampling (0.4977). A skip mechanism with a 33.3% skip rate reduced negative learning, whereas the other approaches experienced performance declines from the baseline score of 0.5154. Positive sentiment, the minority class (20%), yielded the lowest F1-score (0.4286), while the majority negative class (40.7%) achieved the highest (0.6098). These findings provide preliminary empirical evidence on the potential and challenges of Margin Sampling-based Active Learning for Indonesian YouTube sentiment analysis.

Downloads

Download data is not yet available.

References

A. Sitanggang, Y. Umaidah, R. I. Adam, U. S. Karawang, and T. Timur, "ANALISIS SENTIMEN MASYARAKAT TERHADAP PROGRAM MAKAN SIANG GRATIS PADA MEDIA," JITET (Jurnal Informatika dan Teknik Elektro Terapan), vol. 12, no. 3, 2024, doi: 10.23960/jitet.v12i3.4902. https://doi.org/10.23960/jitet.v12i3.4902

T. A. Aziz, I. Ismayadi, and B. Budiman, "Analisis Sentimen Terhadap Program Makan Siang Gratis pada Media Sosial X Menggunakan Logistic Regression dan SVM," In Search, vol. 24, no. 1, pp. 18-28, 2025, doi: 10.37278/insearch.v24i1.1238. https://doi.org/10.37278/insearch.v24i1.1238

Saurabh Chakraborty, Rutika Zambre, Hrishikesh Sharma, Tapash Gaikwad, and Dr. Nitin Janwe, "Sentiment Analysis of YouTube Comments," International Journal of Advanced Research in Science, Communication and Technology, pp. 194-198, 2024, doi: 10.48175/ijarsct-18131. https://doi.org/10.48175/IJARSCT-18131

M. Liebenlito, N. Inayah, E. Choerunnisa, T. E. Sutanto, and S. Inna, "Active Learning on Indonesian Twitter Sentiment Analysis Using Uncertainty Sampling," Journal of Applied Data Sciences, vol. 5, no. 1, pp. 114-121, 2024, doi: 10.47738/jads.v5i1.144. https://doi.org/10.47738/jads.v5i1.144

S. M. Daudpota, S. Hassan, Y. Alkhurayyif, A. S. Alqahtani, and M. H. Aziz, "Active Learning Strategies for Textual Dataset-Automatic Labelling," Computers, Materials & Continua, 2023, doi: 10.32604/cmc.2023.034157. https://doi.org/10.32604/cmc.2023.034157

N. W. S. Saraswati, I. K. G. D. Putra, M. Sudarma, and I. M. Sukarsa, "Enhance Sentiment Analysis in Big Data Tourism using Hybrid Lexicon and Active Learning Support Vector Machine," Bulletin of Electrical Engineering and Informatics, vol. 13, no. 5, pp. 3663-3674, 2024, doi: 10.11591/eei.v13i5.7807. https://doi.org/10.11591/eei.v13i5.7807

Settles, B. (2010). Computer sciences active learning literature survey. Laporan Teknis, University of Wisconsin-Madison. https://burrsettles.com/pub/settles.activelearning.pdf

Sekar Cinta Amaria and Nurtriana Hidayati, "ANALISIS PROGRAM MAKAN BERGIZI GRATIS DENGAN SUPPORT VECTOR MACHINE (SVM) PADA APLIKASI X," JSiI (Jurnal Sistem Informasi), vol. 12, no. 2, pp. 16-24, 2025, doi: 10.30656/jsii.v12i2.10708. https://doi.org/10.30656/jsii.v12i2.10708

M. Venugopalan and D. Gupta, "A Reinforced Active Learning Approach for Optimal Sampling in Aspect Term Extraction for Sentiment Analysis," Expert Syst. Appl., vol. 209, no. May, p. 118228, 2022, doi: 10.1016/j.eswa.2022.118228. https://doi.org/10.1016/j.eswa.2022.118228

J. Z. Bengar, "Reducing Label Effort : Self-Supervised meets Active Learning," IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 1631-1639, 2021, doi: 10.1109/iccvw54120.2021.00188. https://doi.org/10.1109/ICCVW54120.2021.00188

D. Kartchner, D. N. An, W. Ren, C. Zhang, and C. S. Mitchell, "Rule-Enhanced Active Learning for Semi-Automated Weak Supervision," AI, vol. 3, no. 1, pp. 211-228, 2022, doi: 10.3390/ai3010013. https://doi.org/10.3390/ai3010013

M. Learning, "Machine Learning as an Experimental Science," Mach. Learn., vol. 3, no. 1, pp. 5-8, 1988, doi: 10.1023/A:1022623814640. https://doi.org/10.1023/A:1022623814640

J. Kamiri and G. Mariga, "Research Methods in Machine Learning: A Content Analysis," International Journal of Computer and Information Technology, vol. 10, no. 2, 2021, doi: 10.24203/ijcit.v10i2.79. https://doi.org/10.24203/ijcit.v10i2.79

A. Andhini, F. N. Handayani, I. Diasih, and Nurmalitasari, "Analisis Sentimen Opini Publik pada Channel Youtube Mata Najwa Menggunakan Metode SVM Universitas Duta Bangsa Surakarta , Indonesia Perkembangan pesat media sosial dan platform video seperti YouTube telah menciptakan ruang baru bagi masyarakat untuk menge," Jurnal Teknik Informatika dan Teknologi Informasi, vol. 5, no. 2, 2025. https://doi.org/10.55606/jutiti.v5i2.5426

M. Afnan Ul Haque, A. Rahman, and M. M. A. Hashem, "Sentiment Analysis in Low-Resource Bangla Text Using Active Learning," 2021 5th International Conference on Electrical Information and Communication Technology, EICT 2021, no. December, pp. 17-19, 2021, doi: 10.1109/EICT54103.2021.9733711. https://doi.org/10.1109/EICT54103.2021.9733711

G. Yu et al., "Learning by Querying Subexamples CMAL : Cost-effective Multi-label Active Learning by Querying Subexamples," IEEE Trans. Knowl. Data Eng., vol. 34, no. 5, pp. 2091-2104, 2022, doi: 10.1109/tkde.2020.3003899. https://doi.org/10.1109/TKDE.2020.3003899

Published

03-06-2026

Issue

Section

Articles

How to Cite

Analisis Active learning SVM berbasis Margin Sampling pada Sentimen YouTube MBG. (2026). Jurnal Informatika Komputer, Bisnis Dan Manajemen, 24(2), 123-135. https://doi.org/10.61805/fahma.v24i2.203

Similar Articles

1-10 of 44

You may also start an advanced similarity search for this article.