• 제목/요약/키워드: Lexicon Analysis

검색결과 91건 처리시간 0.022초

Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society

  • Almutairi, Amjad Rasmi;Al-Hagery, Muhammad Abdullah
    • International Journal of Computer Science & Network Security
    • /
    • 제21권3호
    • /
    • pp.112-119
    • /
    • 2021
  • Social media has become a global means of communication in people's lives. Most people are using Twitter for communication purposes and its inappropriate use, which has negative effects on people's lives. One of the widely common misuses of Twitter is cyberbullying. As the resources of dialectal Arabic are rare, so for cyberbullying most people are using dialectal Arabic. For this reason, the ultimate goal of this study is to detect and classify cyberbullying on Twitter in the Arabic context in Saudi Arabia. To help in the detection and classification of tweets, Pointwise Mutual Information (PMI) to generate a lexicon, and Support Vector Machine (SVM) algorithms are used. The evaluation is performed on both methods in terms of the F1-score. However, the F1-score after applying the PMI is 50%, while after the SVM application on the resampling data it is 82%. The analysis of the results shows that the SVM algorithm outperforms better.

온라인 브랜드 커뮤니티 내 부정적 감정들이 기업 혁신을 위한 고객 기여에 미치는 영향 (The Influence of Negative Emotions on Customer Contribution to Organizational Innovation in an Online Brand Community)

  • 정수연;이한준;서용무
    • 인터넷정보학회논문지
    • /
    • 제14권4호
    • /
    • pp.91-100
    • /
    • 2013
  • 최근 많은 기업들이 도입하는 온라인 브랜드 커뮤니티는 기업 혁신에 도움이 될 고객의 의견을 수집하는 데 유용하게 활용되고 있다. 본 논문은 온라인 브랜드 커뮤니티에 게시되는 다양한 의견들 중 부정적 감정을 담고 있는 고객의견이 기업 혁신에 기여하는데 미치는 영향력을 분석하였다. 이를 위해 먼저 부정적 감정을 Fear, Anger, Shame, Sadness, Frustration의 총 다섯 가지 세분화된 감정으로 분류하고 WordNet과 SentiWordNet을 기반으로 부정적 감정에 대한 감정 어휘군을 구축하였다. 실험을 위해 본 연구에서는 스타벅스의 브랜드 커뮤니티인 MyStarbucksIdea.com에서 81,534건의 고객의견을 수집하였으며 부정적 감정 어휘군을 활용하여 각 고객의견 내 부정적 감정 정보를 추출하였다. 부정적 감정의 유무, 빈도, 강도의 세 가지 측면에 따른 기업 혁신에 대한 영향력을 분석한 결과, 부정적 감정이 담긴 고객의견이 기업 혁신에 유의미한 영향력을 미치는 것으로 나타났으며 부정적 감정 중, Frustration과 Sadness의 감정이 기업 혁신에 긍정적인 영향을 가지고 있음을 확인할 수 있었다.

뉴스기사를 이용한 소비자의 경기심리지수 생성 (Construction of Consumer Confidence index based on Sentiment analysis using News articles)

  • 송민채;신경식
    • 지능정보연구
    • /
    • 제23권3호
    • /
    • pp.1-27
    • /
    • 2017
  • 경제주체들의 경기상황에 대한 판단 및 전망은 경기변동에 영향을 미치므로 경기심리지수와 거시경제지표들 간에는 밀접한 관련성을 나타내는 것으로 알려져 있다. 경기선행지표로 국내에서 많이 사용되는 경기심리지수에는 소비자동향조사, 기업경기조사, 경제심리지수가 있다. 그러나 설문조사를 통해 생성된 지수는 자료의 성격상 속보성이 떨어지는 문제가 있다. 본 연구에서는 이러한 정형데이터의 한계를 보완할 수 있도록 비정형데이터에서 정보를 추출해 경기심리지수를 생성하고, 경제분석에서의 활용 가능성을 검토하였다. 민간소비와 관련된 실물지표에는 소매판매업지수와 서비스업생산지수를 사용하였고, 고용지표에는 고용률과 실업률을, 가격지표에는 소비자물가상승률과 가계의 대출금리를 사용하여 지표들 간의 추이 분석 및 시차구조 파악을 위한 교차상관분석을 수행하였다. 마지막으로 이들 지표들에 대한 예측 가능성을 점검하였다. 분석결과, 다른 지표들의 선행지수로 많이 사용되는 소비자심리지수와 비교해 선택 지표들과 높은 상관관계를 보이며, 1~2개월 선행한 것으로 나타났다. 예측력 또한 향상되어 텍스트데이터에서 생성한 소비자 경기심리지수의 유용성이 확인되었다. 온라인에서 생성되는 뉴스기사나 소셜 SNS 등의 텍스트 데이터는 속보성이 뛰어나고, 커버리지가 넓어 특정 경제적 이슈가 발생할 경우 이것이 경제에 미치는 영향을 빠르게 파악할 수 있다는 점에서 경기판단지표로써의 잠재적 가능성이 클 것으로 보인다. 경제분석에서 비정형데이터를 활용한 국내연구는 초기 단계지만 데이터의 유용성이 확인되면 그 활용도가 크게 높아질 것으로 기대한다.

Effect of Shiitake(Lentinus edodes. p) Mushroom Powder and Sodium Tripolyphosphate on Texture and Flavor of Pork Patties

  • Chun, S.S.;Edger Chambers IV;Delores H. Chambers
    • 한국식품영양학회:학술대회논문집
    • /
    • 한국식품영양학회 2004년도 하계 학술 심포지엄
    • /
    • pp.48-48
    • /
    • 2004
  • Shiitake mushroom contained numerous nutrients, special flavor (lenthionine) and taste. In Asia, they are thought to have numerous medical properties f3r several diseases such as diabetes, anemia, tumors. Phosphates are known to increase hydration and water binding, stabilize meat emulsions, improve juiciness and tenderness, provide mineral supplementation, and maintain flavor of processed meat products. A lexicon f3r describing the texture and flavor of cooked pork patties were developed. (omitted)

  • PDF

Descriptive Sensory Characteristics of Beef Jerky Prepared Different Methods

  • Lee, J.H.;Iv, Edgar-Chambers;Chambers, Delores-H.;Chin, K.B.;Kim, R.Y.;Chun, S.S.;Oh, J.S.
    • 한국식품영양학회:학술대회논문집
    • /
    • 한국식품영양학회 2004년도 하계 학술 심포지엄
    • /
    • pp.49-49
    • /
    • 2004
  • Beef jerky is a traditional salted or soysauce-added and dried Korea meat product. Jerky is made from loins which is thin sliced, curing salt(soysauce-added), smoked and dried. The purpose of this study was to investigate the effects curing salt soy sam and/or smoking. A lexicon for describing the texture and flavor of beef jerky were developed. The intensity of a vatiety of texture, flavor, and mouth feel properties was characterized for beef jerky. A highly trained descriptive sensory panel identified, defined and referenced 17 attributes for beef jerky. (omitted)

  • PDF

코로나19 팬데믹 상황에서 감성분석을 이용한 미국, 중국, 한국 여행자의 온라인 리뷰 비교 분석 (A Comparative Analysis of Travelers' Online Reviews among China, USA, and South Korea using Sentiment Analysis in the Era of the COVID-19 Pandemic)

  • 홍준우;홍태호
    • 한국IT서비스학회지
    • /
    • 제20권5호
    • /
    • pp.159-176
    • /
    • 2021
  • In this study, we performed a comparative analysis of the sentiment value for the tourists in USA, China, and Korea on the COVID19 pandemic era to explore and find out the features of the tourists by using online reviews. We collected a total of 243,826 online hotel reviews for metropolitan city and vacation spot in the three countries to compare the features between the business and the vacation trips. We collected the online reviews into the tow groups from Jan. 1, 2019 to Nov. 31, 2019 for before COVID19 pandemic and from Apr. 1, 2020 to Deb 28, 2021 for during COVID19. Online reviews were categorized into 6 dimensions using LDA model. Sentiment analysis were presented for 6 dimensions by utilizing a lexicon base. We proposed an approach to analyzing the importance of each attribute by applying 6-dimensional sentiment values to conjoint analysis. Our empirical analysis showed that the proposed approach could explore and find out the changed features of travelers during the COVID19 pandemic.

Competitive intelligence in Korean Ramen Market using Text Mining and Sentiment Analysis

  • Kim, Yoosin;Jeong, Seung Ryul
    • 인터넷정보학회논문지
    • /
    • 제19권1호
    • /
    • pp.155-166
    • /
    • 2018
  • These days, online media, such as blogospheres, online communities, and social networking sites, provides the uncountable user-generated content (UGC) to discover market intelligence and business insight with. The business has been interested in consumers, and constantly requires the approach to identify consumers' opinions and competitive advantage in the competing market. Analyzing consumers' opinion about oneself and rivals can help decision makers to gain in-depth and fine-grained understanding on the human and social behavioral dynamics underlying the competition. In order to accomplish the comparison study for rival products and companies, we attempted to do competitive analysis using text mining with online UGC for two popular and competing ramens, a market leader and a market follower, in the Korean instant noodle market. Furthermore, to overcome the lack of the Korean sentiment lexicon, we developed the domain specific sentiment dictionary of Korean texts. We gathered 19,386 pieces of blogs and forum messages, developed the Korean sentiment dictionary, and defined the taxonomy for categorization. In the context of our study, we employed sentiment analysis to present consumers' opinion and statistical analysis to demonstrate the differences between the competitors. Our results show that the sentiment portrayed by the text mining clearly differentiate the two rival noodles and convincingly confirm that one is a market leader and the other is a follower. In this regard, we expect this comparison can help business decision makers to understand rich in-depth competitive intelligence hidden in the social media.

LDA를 이용한 온라인 리뷰의 다중 토픽별 감성분석 - TripAdvisor 사례를 중심으로 - (Multi-Topic Sentiment Analysis using LDA for Online Review)

  • 홍태호;니우한잉;임강;박지영
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제27권1호
    • /
    • pp.89-110
    • /
    • 2018
  • Purpose There is much information in customer reviews, but finding key information in many texts is not easy. Business decision makers need a model to solve this problem. In this study we propose a multi-topic sentiment analysis approach using Latent Dirichlet Allocation (LDA) for user-generated contents (UGC). Design/methodology/approach In this paper, we collected a total of 104,039 hotel reviews in seven of the world's top tourist destinations from TripAdvisor (www.tripadvisor.com) and extracted 30 topics related to the hotel from all customer reviews using the LDA model. Six major dimensions (value, cleanliness, rooms, service, location, and sleep quality) were selected from the 30 extracted topics. To analyze data, we employed R language. Findings This study contributes to propose a lexicon-based sentiment analysis approach for the keywords-embedded sentences related to the six dimensions within a review. The performance of the proposed model was evaluated by comparing the sentiment analysis results of each topic with the real attribute ratings provided by the platform. The results show its outperformance, with a high ratio of accuracy and recall. Through our proposed model, it is expected to analyze the customers' sentiments over different topics for those reviews with an absence of the detailed attribute ratings.

Developing Sensory Lexicons for Tofu

  • Chung, Jin-A;Lee, Hye-Seong;Chung, Seo-Jin
    • Food Quality and Culture
    • /
    • 제2권1호
    • /
    • pp.27-31
    • /
    • 2008
  • The objective of this study was to develop sensory lexicons that can be utilized for various types of tofu such as pressed, unpressed, and tofu made from germinated soybeans, using generic descriptive analysis. In the first phase of the experiment, trained descriptive panelists developed and defined the appearance, aroma, flavor, and texture attributes that are commonly present in tofu. Then, the sensory characteristics of seven types of tofu were analyzed using the sensory lexicons established in the initial stage of the experiment. Four appearance, 6 odor/aroma, 6 flavor/taste, 7 texture, and 4 aftertaste attributes were identified, and reference standards were established for most of the terms in order to facilitate the understanding of the attribute definitions. The intensities of the sensory attributes were measured on a 15-point scale. Statistical analyses, including analysis of variance and principal component analysis, were used for the data. The seven tofu samples showed significant differences in the intensities of 22 attributes. The unpressed tofu samples were generally rated as being high in moistness, easy to cut, silky, and easy to swallow. The pressed tofu, on the other hand, was salty, astringent, beany, hard, and rough in texture. The tofu made with germinated soybeans was characterized as having a strong cooked bean flavor, salty and astringent aftertaste, and hard texture. Overall, the attributes of moistness, easy to swallow, and silkiness showed strong positive correlations; hardness and sticks to teeth were also positively correlated to each other.

  • PDF

Impact of the Liver Imaging Reporting and Data System on Research Studies of Diagnosing Hepatocellular Carcinoma Using MRI

  • Yura Ahn;Sang Hyun Choi;Jong Keon Jang;So Yeon Kim;Ju Hyun Shim;Seung Soo Lee;Jae Ho Byun
    • Korean Journal of Radiology
    • /
    • 제23권5호
    • /
    • pp.529-538
    • /
    • 2022
  • Objective: Since its introduction in 2011, the CT/MRI diagnostic Liver Imaging Reporting and Data System (LI-RADS) has been updated in 2014, 2017, and 2018. We evaluated the impact of CT/MRI diagnostic LI-RADS on liver MRI research methodology for the diagnosis of hepatocellular carcinoma (HCC). Materials and Methods: The MEDLINE, EMBASE, and Cochrane databases were searched for original articles reporting the diagnostic performance of liver MRI for HCC between 2011 and 2019. The MRI techniques, image analysis methods, and diagnostic criteria for HCC used in each study were investigated. The studies were classified into three groups according to the year of publication (2011-2013, 2014-2016, and 2017-2019). We compared the percentage of studies adopting MRI techniques recommended by LI-RADS, image analysis methods in accordance with the lexicon defined in LI-RADS, and diagnostic criteria endorsed by LI-RADS. We compared the pooled sensitivity and specificity between studies that used the LI-RADS and those that did not. Results: This systematic review included 179 studies. The percentages of studies using imaging techniques recommended by LI-RADS were 77.8% for 2011-2013, 85.7% for 2014-2016, and 84.2% for 2017-2019, with no significant difference (p = 0.951). After the introduction of LI-RADS, the percentages of studies following the LI-RADS lexicon were 0.0%, 18.4%, and 56.6% in the respective periods (p < 0.001), while the percentages of studies using the LI-RADS diagnostic imaging criteria were 0.0%, 22.9%, and 60.7%, respectively (p < 0.001). Studies that did not use the LI-RADS and those that used the LIRADS version 2018 showed no significant difference in sensitivity and specificity (86.3% vs. 77.7%, p = 0.102 and 91.4% vs. 89.9%, p = 0.770, respectively), with some difference in heterogeneity (I2 = 94.3% vs. 86.7% in sensitivity and I2 = 86.6% vs. 53.2% in specificity). Conclusion: LI-RADS imparted significant changes in the image analysis methods and diagnostic criteria used in liver MRI research for the diagnosis of HCC.