Analisis Sentimen Pengguna Media Sosial X terhadap Perubahan Harga Bitcoin: Pendekatan Machine Learning

Authors

DOI:

https://doi.org/10.24002/konstelasi.v4i1.9043

Keywords:

analisis sentimen, Bitcoin, machine learning

Abstract

Media sosial X menjadi gudang data yang dapat dimanfaatkan untuk memperoleh wawasan mengenai sentimen publik dan potensi yang berdampak pada harga cryptocurrency. Dalam beberapa tahun terakhir, bitcoin menjadi pusat perhatian sebagai bentuk investasi yang menarik bagi para pelaku pasar.  Bitcoin (BTC) sering kali ditandai dengan tingkat volatilitas yang tinggi dan harganya menunjukkan kenaikan dan penurunan yang ekstrem dalam jangka waktu yang singkat. Dengan menganalisis tweet pengguna media sosial X, penelitian ini bertujuan untuk meneliti hubungan antara sentimen yang diungkapkan oleh pengguna media sosial X dan perubahan harga bitcoin. Data set yang digunakan dalam penelitian ini yaitu dataset pelatihan model yang terdapat di laman Kaggle dan dataset pengujian yang dikumpulkan dari tweet media sosial X berdasarkan tanggal terjadinya golden cross dan death cross. Data set akan melalui teknik preprocessing data, klasifikasi sentimen positif, negatif, dan netral menggunakan VADER. Pembangunan model menggunakan algoritma naïve bayes dan support vector machine. Hasil penelitian ini memperoleh model support vector machine memiliki kinerja terbaik terhadap keakuratan model dalam klasifikasi sentimen dengan accuracy sebesar 95.92%, ketepatan model dalam memprediksi nilai positif dengan tingkat precision sebesar 95.89%, tingkat usaha dalam menemukan informasi kembali dengan tingkat recall sebesar 95.92%, dan presentasi nilai bobot dari nilai precision dengan nilai recall pada f1-score sebesar 95.89%. Akan tetapi, dalam memprediksi sentimen lima dataset pengujian yang diberikan menggunakan model yang telah dilatih ditemukan algoritma naïve bayes memiliki persentase lebih tinggi yaitu 80% dalam memperoleh hasil yang sesuai antara sentimen positif untuk kondisi golden cross dan sentimen negatif untuk kondisi death cross.

Social media X is a data warehouse that can be utilized to gain insight into public sentiment and its potential impact on cryptocurrency prices. In recent years, Bitcoin has become the center of attention as an attractive form of investment for market players. Bitcoin (BTC) is often characterized by high levels of volatility and its price exhibits extreme rises and falls over short periods of time. By analyzing the tweets of social media user X, this study aims to examine the relationship between the sentiment expressed by social media user X and changes in Bitcoin prices. The dataset used in this research is the model training dataset found on the Kaggle page and the testing dataset collected from X's social media tweets based on the dates of the golden cross and death cross. The dataset will go through data preprocessing techniques, classifying positive, negative and neutral sentiment using VADER. Model construction uses the Naïve Bayes algorithm and Support Vector Machine. The results of this research show that the Support Vector Machine model has the best performance regarding model accuracy in sentiment classification with an accuracy of 95.92%, model accuracy in predicting positive values ​​with a precision level of 95.89%, level of effort in finding information again with a recall rate of 95.92%, and presentation of the weighted value of the precision value with the recall value on the f1-score of 95.89%. However, in predicting the sentiment of the five test datasets provided using the trained model, it was found that the Naïve Bayes algorithm had a higher percentage, namely 80%, in obtaining results that matched positive sentiment for the golden cross condition and negative sentiment for the death cross condition.

References

A. Yadav and D. K. Vishwakarma, “Sentiment analysis using deep learning architectures: a review,” Artif. Intell. Rev., vol. 53, no. 6, pp. 4335–4385, 2020, doi: 10.1007/s10462-019-09794-5.

S. Aggarwal, R. Chaudhary, G. S. Aujla, N. Kumar, K. K. R. Choo, and A. Y. Zomaya, “Blockchain for smart communities: Applications, challenges and opportunities,” J. Netw. Comput. Appl., vol. 144, no. February, pp. 13–48, 2019, doi: 10.1016/j.jnca.2019.06.018.

K. H. Al-Yahyaee, M. U. Rehman, W. Mensi, and I. M. W. Al-Jarrah, “Can uncertainty indices predict Bitcoin prices? A revisited analysis using partial and multivariate wavelet approaches,” North Am. J. Econ. Financ., vol. 49, no. March, pp. 47–56, 2019, doi: 10.1016/j.najef.2019.03.019.

S. Rouhani and E. Abedin, “Crypto-currencies narrated on tweets: a sentiment analysis approach,” Int. J. Ethics Syst., vol. 36, no. 1, pp. 58–72, 2020, doi: 10.1108/IJOES-12-2018-0185.

K. Joshi, B. H. N, and J. Rao, “Stock Trend Prediction Using News Sentiment Analysis,” Int. J. Comput. Sci. Inf. Technol., vol. 8, no. 3, pp. 67–76, 2016, doi: 10.5121/ijcsit.2016.8306.

I. Hamraoui and A. Boubaker, “Impact of Twitter sentiment on stock price returns,” Soc. Netw. Anal. Min., vol. 12, no. 1, pp. 60–71, 2022, doi: 10.1007/s13278-021-00856-7.

E. W. Kim, N. E. Syaheerah, A. Shukor, N. R. Ismail, and S. A. Halim, “Golden cross as Buying Indicator for Stock Investment in Bursa Malaysia,” Teren. Int. Financ. Econ. J., vol. 2, no. 2, pp. 94–102, 2012, [Online]. Available: http://thestar.com.my

S. C. Pranoto, “Web Scraping On Forecast Report Form Of Geomagnet,” [CEPAT] J. Comput. Eng. Progress, Appl. Technol., vol. 1, no. 02, p. 16, 2022, doi: 10.25124/cepat.v1i02.5226.

N. L. W. S. R. Ginantra, C. P. Yanti, G. D. Prasetya, I. B. G. Sarasvananda, and I. K. A. G. Wiguna, “Analisis Sentimen Ulasan Villa di Ubud Menggunakan Metode Naive Bayes, Decision Tree, dan K-NN,” J. Nas. Pendidik. Tek. Inform., vol. 11, no. 3, pp. 205–215, 2022, doi: 10.23887/janapati.v11i3.49450.

V. K. S. Que, A. Iriani, and H. D. Purnomo, “Analisis Sentimen Transportasi Online Menggunakan Support Vector Machine Berbasis Particle Swarm Optimization,” J. Nas. Tek. Elektro dan Teknol. Inf., vol. 9, no. 2, pp. 162–170, 2020, doi: 10.22146/jnteti.v9i2.102.

M. Ghiassi and S. Lee, “A domain transferable lexicon set for Twitter sentiment analysis using a supervised machine learning approach,” Expert Syst. Appl., vol. 106, pp. 197–216, 2018, doi: 10.1016/j.eswa.2018.04.006.

S. Rizkia, E. Budi Setiawan, and D. Puspandari, “Analisis Sentimen Kepuasan Pelanggan Terhadap Internet Provider Indihome di Twitter Menggunakan Metode Decision Tree dan Pembobotan TF-IDF,” e-Proceeding Eng., vol. 6, no. Agustus, pp. 9683–9693, 2019.

R. Maulana, P. A. Rahayuningsih, W. Irmayani, D. Saputra, and W. E. Jayanti, “Improved Accuracy of Sentiment Analysis Movie Review Using Support Vector Machine Based Information Gain,” in Journal of Physics: Conference Series, Nov. 2020, vol. 1641, no. 1. doi: 10.1088/1742-6596/1641/1/012060.

A. Kulkarni, D. Chong, and F. A. Batarseh, Foundations of data imbalance and solutions for a data democracy. Elsevier Inc., 2020. doi: 10.1016/B978-0-12-818366-3.00005-8.

Burhanudin, Y. Musa, and Y. Wihardi, “Klasifikasi Komentar Spam Pada Youtube Menggunakan Metode Naïve Bayes, Support Vector Machine, Dan K-Nearest Neighbors,” J. Inform. dan Komput., vol. 3, no. 2, pp. 54–59, 2108.

Downloads

Published

27-06-2024

Issue

Section

Articles