Preprocessing of Aspect-based English Telugu Code Mixed Sentiment Analysis (مقاله علمی وزارت علوم)

درجه علمی: نشریه علمی (وزارت علوم)

نویسندگان: Arun Kodirekka Ayyagari Srinagesh

منبع: Journal of Information Technology Management , Volume 15, Special Issue: Digital Twin Enabled Neural Networks Architecture Management for Sustainable Computing, 2023

کلید واژه ها: English-Telugu code-mixed data Natural Language Processing Telugu Senti Wordnet Machine Learning deep learning

حوزه های تخصصی:

doi: 10.22059/jitm.2023.91573

شماره صفحات: ۱۵۰ - ۱۶۳

دریافت مقاله تعداد دانلود : ۴۹

آرشیو

چکیده

Extracting sentiments from the English-Telugu code-mixed data can be challenging and is still a relatively new research area. Data obtained from the Twitter API has to be in English-Telugu code-mixed language. That data is free-form text, noisy, lexicon borrowings, code-mixed, phonetic typing and misspelling data. The initial step is language identification and sentiment class labels assigned to each tweet in the dataset. The second step is the data normalization task, and the final step is classification, which can be achieved using three different methods: lexicon, machine learning, and deep learning. In the lexicon-based approach, tokenize each tweet with its language tag. If the language tag is in Telugu, transliterate the roman script into native Telugu words. Words are verified with TeluguSentiWordNet, and the Telugu sentiments are extracted, and English SentiWordNets are used to extract sentiments from the English tokens. In this paper, the aspect-based sentiment analysis approach is suggested and used with normalized data. In addition, deep learning and machine learning techniques are applied to extract sentiment ratings, and the results are compared to prior work.

Preprocessing of Aspect-based English Telugu Code Mixed Sentiment Analysis (مقاله علمی وزارت علوم)

درجه علمی: نشریه علمی (وزارت علوم)

آرشیو

آرشیو شماره ها:
۶۹

سال ۲۰۲۳ (۷)

سال ۲۰۲۲ (۸)

سال ۲۰۲۱ (۷)

سال ۲۰۲۰ (۶)

سال ۲۰۱۹ (۴)

سال ۲۰۱۸ (۴)

سال ۲۰۱۷ (۴)

سال ۲۰۱۶ (۴)

سال ۲۰۱۵ (۴)

سال ۲۰۱۴ (۴)

سال ۲۰۱۳ (۴)

سال ۲۰۱۲ (۴)

سال ۱۳۹۰ (۴)

سال ۱۳۸۹ (۲)

سال ۱۳۸۸ (۲)

سال ۱۳۸۷ (۱)

چکیده

تبلیغات

Preprocessing of Aspect-based English Telugu Code Mixed Sentiment Analysis (مقاله علمی وزارت علوم)

درجه علمی: نشریه علمی (وزارت علوم)

آرشیو

آرشیو شماره ها: ۶۹

چکیده

تبلیغات

آرشیو شماره ها:
۶۹