• Title/Summary/Keyword: web text analysis

Search Result 279, Processing Time 0.026 seconds

Perceptions of Residents in Relation to Smartphone Applications to Promote Understanding of Radiation Exposure after the Fukushima Accident: A Cross-Sectional Study within and outside Fukushima Prefecture

  • Kuroda, Yujiro;Goto, Jun;Yoshida, Hiroko;Takahashi, Takeshi
    • Journal of Radiation Protection and Research
    • /
    • v.47 no.2
    • /
    • pp.67-76
    • /
    • 2022
  • Background: We conducted a cross-sectional study of residents within and outside Fukushima Prefecture to clarify their perceptions of the need for smartphone applications (apps) for explaining exposure doses. The results will lead to more effective methods for identifying target groups for future app development by researchers and municipalities, which will promote residents' understanding of radiological situations. Materials and Methods: In November 2019, 400 people in Fukushima Prefecture and 400 people outside were surveyed via a web-based questionnaire. In addition to basic characteristics, survey items included concerns about radiation levels and intention to use a smartphone app to keep track of exposure. The analysis was conducted by stratifying responses in each region and then cross-tabulating responses to concerns about radiation levels and intention to use an app by demographic variables. The intention to use an app was analyzed by binomial logistic regression analysis. Text-mining analyses were conducted in KH Coder software. Results and Discussion: Outside Fukushima Prefecture, concerns about the medical exposure of women to radiation exceeded 30%. Within the prefecture, the medical exposure of women, purchasing food products, and consumption of own-grown food were the main concerns. Within the prefecture, having children under the age of 18, the experience of measurement, and having experience of evacuation were significantly related to the intention to use an app. Conclusion: Regional and individual differences were evident. Since respondents differ, it is necessary to develop and promote app use in accordance with their needs and with phases of reconstruction. We expect that a suitable app will not only collect data but also connect local service providers and residents, while protecting personal information.

Using Roots and Patterns to Detect Arabic Verbs without Affixes Removal

  • Abdulmonem Ahmed;Aybaba Hancrliogullari;Ali Riza Tosun
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.4
    • /
    • pp.1-6
    • /
    • 2023
  • Morphological analysis is a branch of natural language processing, is now a rapidly growing field. The fundamental tenet of morphological analysis is that it can establish the roots or stems of words and enable comparison to the original term. Arabic is a highly inflected and derivational language and it has a strong structure. Each root or stem can have a large number of affixes attached to it due to the non-concatenative nature of Arabic morphology, increasing the number of possible inflected words that can be created. Accurate verb recognition and extraction are necessary nearly all issues in well-known study topics include Web Search, Information Retrieval, Machine Translation, Question Answering and so forth. in this work we have designed and implemented an algorithm to detect and recognize Arbic Verbs from Arabic text.The suggested technique was created with "Python" and the "pyqt5" visual package, allowing for quick modification and easy addition of new patterns. We employed 17 alternative patterns to represent all verbs in terms of singular, plural, masculine, and feminine pronouns as well as past, present, and imperative verb tenses. All of the verbs that matched these patterns were used when a verb has a root, and the outcomes were reliable. The approach is able to recognize all verbs with the same structure without requiring any alterations to the code or design. The verbs that are not recognized by our method have no antecedents in the Arabic roots. According to our work, the strategy can rapidly and precisely identify verbs with roots, but it cannot be used to identify verbs that are not in the Arabic language. We advise employing a hybrid approach that combines many principles as a result.

Impact of dental imaging on pregnant women and recommendations for fetal radiation safety: A systematic review

  • Thiago Oliveira Gamba;Fernanda Visioli;Deise Renata Bringmann;Pantelis Varvaki Rados;Heraldo Luis Dias da Silveira;Isadora Luana Flores
    • Imaging Science in Dentistry
    • /
    • v.54 no.1
    • /
    • pp.1-11
    • /
    • 2024
  • Purpose: This study was conducted to investigate the safety of dental imaging in pregnant women with respect to fetal health. Materials and Methods: Searches were conducted of the PubMed, Scopus, and Web of Science databases in May 2023. The inclusion criteria encompassed cross-sectional and longitudinal studies that focused on the analysis of diagnostic dental imaging in pregnant women, as well as studies utilizing phantoms to simulate imaging examinations. The exclusion criteria consisted of reviews, letters to the editor, book chapters, and abstracts from scientific conferences and seminars. Results: A total of 3,913 articles were identified. Based on a review of the titles and abstracts, 3,892 articles were excluded, leaving 21 articles remaining for full-text review. Of these, 18 were excluded, and 4 additional articles were included as cross-references. Ultimately, 7 articles underwent quantitative-qualitative analysis. Three retrospective studies were focused on pregnant women who underwent dental imaging procedures. The remaining 4 studies utilized female phantoms to simulate imaging examinations and represent the radiation doses absorbed by the uterus or thyroid. Conclusion: Few dental radiology studies have been conducted to determine the safe radiation threshold for pregnant women. Additionally, the reviewed articles did not provide numbers of dental examinations, by type, corresponding to this dose. Dental imaging examinations of pregnant women should not be restricted if clinically indicated. Ultimately, practitioners must be able to justify the examination and should adhere to the "as low as diagnostically acceptable, being indication-oriented and patient-specific" (ALADAIP) principle of radioprotection.

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.

Educational Implications about Online Debates on a Socio-Scientific Issue from a Postmodernist Perspective: Focus on the Mad Cow Disease (포스트모더니즘의 관점에서 본 과학 관련 사회적 쟁점에 대한 온라인 토론의 과학교육적 함의: 광우병 사례를 중심으로)

  • Jho, Hun-Koog;Song, Jin-Woong
    • Journal of The Korean Association For Science Education
    • /
    • v.30 no.8
    • /
    • pp.933-952
    • /
    • 2010
  • This study aims to characterize debate on a socio-scientific issue in the Internet and to provide implications from a postmodernist perspective. This study concentrates on disentanglement of the complex relationship among society, economy, politics and science in an issue and characterization of the given text centering on its originality, the relationship between writer and reader, and the purpose of utterance. Sixty-six most read articles on a web message board were chosen and analyzed as a typical case of a socio-scientific issue in the internet. In them, five scientific disputes were identified: the cause of mad cow disease (MCD), specified risk material and the incubation period, the cause of new variant Creutzfeld-Jakob disease (vCJD), vulnerability of vCJD and the relation of Alzheimer and vCJD in American patients. Each argument is intertwined with social, economic and political problems such as its impact on the domestic beef market, feeding environment of imported cattle and the retaliation against denial of importation. With regard to originality, it is found that the originality of an author is weakened but communal through repetitive quotation of 'Peom', cutting and pasting, and engagement of readers with their comments. Furthermore, in order to close the gap between writer and reader, identity and personal narrative of the writers are often introduced into their writing. In terms of purpose of utterance, these are intended to deliver one's feelings or facilitate human behavior rather than inform through verification of a principle.

Sensitivity Identification Method for New Words of Social Media based on Naive Bayes Classification (나이브 베이즈 기반 소셜 미디어 상의 신조어 감성 판별 기법)

  • Kim, Jeong In;Park, Sang Jin;Kim, Hyoung Ju;Choi, Jun Ho;Kim, Han Il;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • From PC communication to the development of the internet, a new term has been coined on the social media, and the social media culture has been formed due to the spread of smart phones, and the newly coined word is becoming a culture. With the advent of social networking sites and smart phones serving as a bridge, the number of data has increased in real time. The use of new words can have many advantages, including the use of short sentences to solve the problems of various letter-limited messengers and reduce data. However, new words do not have a dictionary meaning and there are limitations and degradation of algorithms such as data mining. Therefore, in this paper, the opinion of the document is confirmed by collecting data through web crawling and extracting new words contained within the text data and establishing an emotional classification. The progress of the experiment is divided into three categories. First, a word collected by collecting a new word on the social media is subjected to learned of affirmative and negative. Next, to derive and verify emotional values using standard documents, TF-IDF is used to score noun sensibilities to enter the emotional values of the data. As with the new words, the classified emotional values are applied to verify that the emotions are classified in standard language documents. Finally, a combination of the newly coined words and standard emotional values is used to perform a comparative analysis of the technology of the instrument.

Construction of Consumer Confidence index based on Sentiment analysis using News articles (뉴스기사를 이용한 소비자의 경기심리지수 생성)

  • Song, Minchae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.1-27
    • /
    • 2017
  • It is known that the economic sentiment index and macroeconomic indicators are closely related because economic agent's judgment and forecast of the business conditions affect economic fluctuations. For this reason, consumer sentiment or confidence provides steady fodder for business and is treated as an important piece of economic information. In Korea, private consumption accounts and consumer sentiment index highly relevant for both, which is a very important economic indicator for evaluating and forecasting the domestic economic situation. However, despite offering relevant insights into private consumption and GDP, the traditional approach to measuring the consumer confidence based on the survey has several limits. One possible weakness is that it takes considerable time to research, collect, and aggregate the data. If certain urgent issues arise, timely information will not be announced until the end of each month. In addition, the survey only contains information derived from questionnaire items, which means it can be difficult to catch up to the direct effects of newly arising issues. The survey also faces potential declines in response rates and erroneous responses. Therefore, it is necessary to find a way to complement it. For this purpose, we construct and assess an index designed to measure consumer economic sentiment index using sentiment analysis. Unlike the survey-based measures, our index relies on textual analysis to extract sentiment from economic and financial news articles. In particular, text data such as news articles and SNS are timely and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. There exist two main approaches to the automatic extraction of sentiment from a text, we apply the lexicon-based approach, using sentiment lexicon dictionaries of words annotated with the semantic orientations. In creating the sentiment lexicon dictionaries, we enter the semantic orientation of individual words manually, though we do not attempt a full linguistic analysis (one that involves analysis of word senses or argument structure); this is the limitation of our research and further work in that direction remains possible. In this study, we generate a time series index of economic sentiment in the news. The construction of the index consists of three broad steps: (1) Collecting a large corpus of economic news articles on the web, (2) Applying lexicon-based methods for sentiment analysis of each article to score the article in terms of sentiment orientation (positive, negative and neutral), and (3) Constructing an economic sentiment index of consumers by aggregating monthly time series for each sentiment word. In line with existing scholarly assessments of the relationship between the consumer confidence index and macroeconomic indicators, any new index should be assessed for its usefulness. We examine the new index's usefulness by comparing other economic indicators to the CSI. To check the usefulness of the newly index based on sentiment analysis, trend and cross - correlation analysis are carried out to analyze the relations and lagged structure. Finally, we analyze the forecasting power using the one step ahead of out of sample prediction. As a result, the news sentiment index correlates strongly with related contemporaneous key indicators in almost all experiments. We also find that news sentiment shocks predict future economic activity in most cases. In almost all experiments, the news sentiment index strongly correlates with related contemporaneous key indicators. Furthermore, in most cases, news sentiment shocks predict future economic activity; in head-to-head comparisons, the news sentiment measures outperform survey-based sentiment index as CSI. Policy makers want to understand consumer or public opinions about existing or proposed policies. Such opinions enable relevant government decision-makers to respond quickly to monitor various web media, SNS, or news articles. Textual data, such as news articles and social networks (Twitter, Facebook and blogs) are generated at high-speeds and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. Although research using unstructured data in economic analysis is in its early stages, but the utilization of data is expected to greatly increase once its usefulness is confirmed.

Design and Implementation of Geographic Education Website Based on the Google Earth (구글어스 기반의 지리교육 사이트 설계 및 구현)

  • Lee, Sun-Ju;Kang, Young-Ok
    • Spatial Information Research
    • /
    • v.18 no.2
    • /
    • pp.13-24
    • /
    • 2010
  • The purpose of this research is to explore the possibility of geographic education by implementing the map-based geographic education site which mashed up with Google earth by referring the various materials of geographic education which exist in on-line and off-line. In recent years map-based geographic education is required by the radical change of geoweb environments, but there have been few researches in this field. This research is folded up as follows: First, we designed the contents through the textbook analysis and then collect various data related to the contents such as pictures, video clips, conceptual map, etc. which are required to explain the concept. Second, we mashed up the collected data on the Google earth by using the Google's open API. Third, we implemented the geographic education website based on the classification of contents in textbook and the various collected data. This research is important in both that it explores the possibility of the map-based education rather than the text-based education in the geographic field which handles mainly the space and finds the best method to express the various concepts of the textbook on the geoweb environments.

Readability Comparison of Pro- and Anti-Cancer Screening Online Messages in Japan

  • Okuhara, Tsuyoshi;Ishikawa, Hirono;Okada, Masahumi;Kato, Mio;Kiuchi, Takahiro
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.12
    • /
    • pp.5237-5242
    • /
    • 2016
  • Background: Cancer screening rates are lower in Japan than those in western countries. Health professionals publish procancer screening messages on the internet to encourage audiences to undergo cancer screening. However, the information provided is often difficult to read for lay persons. Further, anti-cancer screening activists warn against cancer screening with messages on the Internet. We aimed to assess and compare the readability of pro- and anti-cancer screening online messages in Japan using a measure of readability. Methods: We conducted web searches at the beginning of September 2016 using two major Japanese search engines (Google.jp and Yahoo!.jp). The included websites were classified as "anti", "pro", or "neutral" depending on the claims, and "health professional" or "non-health professional" depending on the writers. Readability was determined using a validated measure of Japanese readability. Statistical analysis was conducted using two-way ANOVA. Results: In the total 159 websites analyzed, anti-cancer screening online messages were generally easier to read than pro-cancer screening online messages, Messages written by health professionals were more difficult to read than those written by non-health professionals. Claim ${\times}$ writer interaction was not significant. Conclusion: When health professionals prepare pro-cancer screening materials for publication online, we recommend they check for readability using readability assessment tools and improve text for easy comprehension when necessary.

A Study on the Accuracy Improvement of Movie Recommender System Using Word2Vec and Ensemble Convolutional Neural Networks (Word2Vec과 앙상블 합성곱 신경망을 활용한 영화추천 시스템의 정확도 개선에 관한 연구)

  • Kang, Boo-Sik
    • Journal of Digital Convergence
    • /
    • v.17 no.1
    • /
    • pp.123-130
    • /
    • 2019
  • One of the most commonly used methods of web recommendation techniques is collaborative filtering. Many studies on collaborative filtering have suggested ways to improve accuracy. This study proposes a method of movie recommendation using Word2Vec and an ensemble convolutional neural networks. First, in the user, movie, and rating information, construct the user sentences and movie sentences. It inputs user sentences and movie sentences into Word2Vec to obtain user vectors and movie vectors. User vectors are entered into user convolution model and movie vectors are input to movie convolution model. The user and the movie convolution models are linked to a fully connected neural network model. Finally, the output layer of the fully connected neural network outputs forecasts of user movie ratings. Experimentation results showed that the accuracy of the technique proposed in this study accuracy of conventional collaborative filtering techniques was improved compared to those of conventional collaborative filtering technique and the technique using Word2Vec and deep neural networks proposed in a similar study.