• Title/Summary/Keyword: sentence analysis

Search Result 497, Processing Time 0.022 seconds

Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality (지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법)

  • Choi, Sukjae;Lee, Jungwon;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.119-138
    • /
    • 2017
  • Recently, SNS has become an important channel for marketing as well as personal communication. However, cybercrime has also evolved with the development of information and communication technology, and illegal advertising is distributed to SNS in large quantity. As a result, personal information is lost and even monetary damages occur more frequently. In this study, we propose a method to analyze which sentences and documents, which have been sent to the SNS, are related to financial fraud. First of all, as a conceptual framework, we developed a matrix of conceptual characteristics of cybercriminality on SNS and emergency management. We also suggested emergency management process which consists of Pre-Cybercriminality (e.g. risk identification) and Post-Cybercriminality steps. Among those we focused on risk identification in this paper. The main process consists of data collection, preprocessing and analysis. First, we selected two words 'daechul(loan)' and 'sachae(private loan)' as seed words and collected data with this word from SNS such as twitter. The collected data are given to the two researchers to decide whether they are related to the cybercriminality, particularly financial fraud, or not. Then we selected some of them as keywords if the vocabularies are related to the nominals and symbols. With the selected keywords, we searched and collected data from web materials such as twitter, news, blog, and more than 820,000 articles collected. The collected articles were refined through preprocessing and made into learning data. The preprocessing process is divided into performing morphological analysis step, removing stop words step, and selecting valid part-of-speech step. In the morphological analysis step, a complex sentence is transformed into some morpheme units to enable mechanical analysis. In the removing stop words step, non-lexical elements such as numbers, punctuation marks, and double spaces are removed from the text. In the step of selecting valid part-of-speech, only two kinds of nouns and symbols are considered. Since nouns could refer to things, the intent of message is expressed better than the other part-of-speech. Moreover, the more illegal the text is, the more frequently symbols are used. The selected data is given 'legal' or 'illegal'. To make the selected data as learning data through the preprocessing process, it is necessary to classify whether each data is legitimate or not. The processed data is then converted into Corpus type and Document-Term Matrix. Finally, the two types of 'legal' and 'illegal' files were mixed and randomly divided into learning data set and test data set. In this study, we set the learning data as 70% and the test data as 30%. SVM was used as the discrimination algorithm. Since SVM requires gamma and cost values as the main parameters, we set gamma as 0.5 and cost as 10, based on the optimal value function. The cost is set higher than general cases. To show the feasibility of the idea proposed in this paper, we compared the proposed method with MLE (Maximum Likelihood Estimation), Term Frequency, and Collective Intelligence method. Overall accuracy and was used as the metric. As a result, the overall accuracy of the proposed method was 92.41% of illegal loan advertisement and 77.75% of illegal visit sales, which is apparently superior to that of the Term Frequency, MLE, etc. Hence, the result suggests that the proposed method is valid and usable practically. In this paper, we propose a framework for crisis management caused by abnormalities of unstructured data sources such as SNS. We hope this study will contribute to the academia by identifying what to consider when applying the SVM-like discrimination algorithm to text analysis. Moreover, the study will also contribute to the practitioners in the field of brand management and opinion mining.

Development of Differential Diagnosis Scale Items for Adductor Spasmodic Dysphonia and Evaluation of Clinical Availability (내전형 연축성 발성장애 감별진단 문항 개발과 임상적 유용성 평가)

  • Cho, Jae Kyung;Choi, Seong Hee;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.30 no.2
    • /
    • pp.112-117
    • /
    • 2019
  • Background and Objectives The purpose of this study was to develop the differential diagnosis scale containing items from adductor spasmodic dysphonia (ADSD) to muscle tension dysphonia (MTD) and the determine clinical utility of newly developed items. Materials and Method The four parts of pitch, redirected phonation, automatic speech and voiced sound were selected for analyzing the characteristics of ADSD in the literature. One part of tense voiceless sound was developed according to the Korean manner of articulation. The content validity was evaluated based on 5 scales (1-5 point) analysis from 30 experts. One hundred patients (50 ADSD and 50 MTD) were recorded in reading a sentence and sustained phonation. The two speech language pathologist evaluated recorded voices through a blind test using 4 scales (0-3 point) for newly developed items. Results As a result of verifying the content validity of items with experts, it was identified that the differentiated items were valid with 4.2 out of 5. Through the differential diagnosis between two groups according to the items, the correlation between sub-domains and total scores was shown as higher than 0.710. The result of analyzing the reliability on each diagnosis domain was 0.840-0.893, which showed the internal consistency of items was great. Newly developed five parts of ADSD were significantly higher than those of MTD with strong correlation (p<0.01). The reliability among the evaluators was analyzed as high with 0.892. Conclusion In this study, the differential diagnosis scale of ADSD was revealed as having validity and reliability. It is considered that it will be useful for differentiating ADSD and MTD in the clinical field.

Language performance analysis based on multi-dimensional verbal short-term memories in patients with conduction aphasia (다차원 구어 단기기억에 따른 전도 실어증 환자의 언어수행력 분석)

  • Ha, Ji-Wan;Hwang, Yu Mi;Pyun, Sung-Bom
    • Korean Journal of Cognitive Science
    • /
    • v.23 no.4
    • /
    • pp.425-455
    • /
    • 2012
  • Multi-dimensional verbal short-term memory mechanisms are largely divided into the phonological channel and the lexical-semantic channel. The former is called phonological short-term memory and the latter is called semantic short-term memory. Phonological short-term memory is further segmented into the phonological input buffer and the phonological output buffer. In this study, the language performance of each of three patients with similar levels of conduction aphasia was analyzed in terms of multi-dimensional verbal short-term memory. To this end, three patients with conduction aphasia were instructed to perform four different aspects of language tasks that are spontaneous speaking, repetition, spontaneous writing, and dictation in both word and sentence level. Moreover, the patients' phonological memories and semantic short-term memories were evaluated using digit span tests and verbal learning tests. As a result, the three subjects exhibited various types of performances and error responses in the four aspects of language tests, and the short-term memory tests also did not produce identical results. The language performance of three patients with conduction aphasia can be explained according to whether the defects occurred in the semantic short-term memory, phonological input buffer and/or phonological output buffer. In this study, the relations between language and multi-dimensional verbal short-term memory were discussed based on the results of language tests and short-term memory tests in patients with conduction aphasia.

  • PDF

A Study on Parents' Mental Model of Media Environment and Children's Media Use (미디어 환경과 사용에 대한 부모의 심성모형 연구)

  • Lee, Ran;Hong, Jimin
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.12
    • /
    • pp.818-834
    • /
    • 2014
  • The purpose of this study is to examine parents' mental model of media environment and children's media use and to provide some educational suggestions. For this purpose, twelve parents of second-graders to fourth-graders sampled in elementary schools were interviewed with three activities such as a word-association experiment, a sentence completion task and a in-depth interview. The result was categorized into 8 elements such as interaction, source of supply and adverse effects. Furthermore, the analysis on the mental model of media use shows that firstly, the parents understand modern media reflects competence while they have a feeling of fear and newness on media themselves. Secondly, the parents show an ambivalent understanding on media use in terms of both negative and positive effects and have a tendency to control them. Another finding is the fact that the parents understand digital media as a representation of both connection and disconnection. Also, the parents realize media as a cause of conflict and as a place for reconciliation as well. Finally, it is showed that media is not only a personal territory but also a part of social system in the parents' understanding. Based on these findings, some interpretations and parents' educational applications are provided in terms of the Meyrowitz(1998; 1999)'s three perspectives on media.

An Analysis Method of User Preference by using Web Usage Data in User Device (사용자 기기에서 이용한 웹 데이터 분석을 통한 사용자 취향 분석 방법)

  • Lee, Seung-Hwa;Choi, Hyoung-Kee;Lee, Eun-Seok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.3
    • /
    • pp.189-199
    • /
    • 2009
  • The amount of information on the Web is explosively growing as the Internet gains in popularity. However, only a small portion of the information on the Web is truly relevant or useful to the user. Thus, offering suitable information according to user demand is an important subject in information retrieval. In e-commerce, the recommender system is essential to revitalize commercial transactions, raise user satisfaction and loyalty towards the information provider. The existing recommender systems are mostly based on user data collected at servers, so user data are dispersed over several servers. Therefore, web servers that lack sufficient user behavior data cannot easily infer user preferences. Also, if the user visits the server infrequently, it may be hard to reflect the dynamically changing user's interest. This paper proposes a novel personalization system analyzing the user preference based on web documents that are accessed by the user on a user device. The system also identifies non-content blocks appearing repeatedly in the dynamically generated web documents, and adds weight to the keywords extracted from the hyperlink sentence selected by the user. Therefore, the system establishes at an early stage recommendation strategies for the web server that has little user data. Also, user profiles are generated rapidly and more accurately by identifying the information blocks. In order to evaluate the proposed system, this study collected web data and purchase history from users who have current purchase activity. Then, we computed the similarity between purchase data and the user profile. We confirm the accuracy of the generated user profile since the web page containing the purchased item has higher correlation than other item pages.

An Analysis of Pre-service Science Teachers' Reflective Thinking aboutvScientific Experiment in Experimental Journal Writings (실험 저널쓰기에서 나타난 예비과학교사들의 과학실험에 대한 반성적 사고 분석)

  • Lee, Yun-Jung;Im, Sung-Min
    • Journal of The Korean Association For Science Education
    • /
    • v.31 no.2
    • /
    • pp.198-209
    • /
    • 2011
  • In this study, pre-service science teachers' reflective thinking in their journal writing was investigated. To do this, the authors used pre-service science teachers' journal writing abilities, wherein they not only reported data and result formally, but also wrote their feelings and reflections about an inquiry-based physics experiment they performed. Pre-service science teachers' writings were decomposed into sentences and each sentence was analyzed into a framework with 4 dimensions: knowledge, procedure, orientation and attitude. Reflective thinking in knowledge dimension included reflection on what they know before the experiment, what they still do not know and what they learned from the experiment. Reflective thinking in procedure dimension included recalls of experiences about general experimental procedures and specific experimental skill. Reflective thinking in orientation dimension included their views about the nature of science and science teaching and learning, and reflective thinking in attitude dimension consisted of interests, motives and values about the experiment they performed. While there were some variations in frequency distribution of reflective thinking by the topic of experiments, pre-service science teachers' reflective thinking in journal writings revealed their metacognition on their knowledge and learning, epistemological belief about science and science learning, and affective domain related to experiment. This study can infer that such kind of writing with 'their own language' in an informal way followed by formal 'scientific' reports in a scientific experiment has a significance not only as a mediator representing reflective thinking but also as an instructional activity to facilitate reflective thinking in science learning and teaching.

Correlation between Depression and Memory According to Apolipoprotein E Genotype in Elderly with Alzheimer's Dementia (알츠하이머 치매노인의 Apolipoprotein E 유전형에 따른 우울과 기억력의 상관관계)

  • Kim, Kwang-Jae;Noh, Dong-hee;Han, Seung-Hyup;Cha, Yun-Jun;Kam, Kyung-Yoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.477-486
    • /
    • 2020
  • This study was undertaken to analyze the correlation between depression and memory, by considering the occurrence of ApoE ɛ4 and clinical dementia rating in the elderly with Alzheimer's dementia. This study included 50 participants over 65 years of age, evaluated with CDR 0.5 to 2. We performed CDR, SVLT-E, RCFT, SGDS-K, and ApoE genotyping. Spearman's correlation analysis was used for determining the correlation between depression and memory. The results indicate a significant negative correlation between depression and immediate recall verbal memory in the CDR 1 and 2 without ApoE ɛ4 carrier group (p<0.05). Furthermore, a significant negative correlation was also determined between depression and delayed recall verbal memory in the CDR 1 of the same group. Ed. Notes: The previous sentence already shows this correlation. I suggest this should be deleted from this statement. However, no significant correlation was observed between depression and visual memory. This study found a significant correlation between depression and immediate recall verbal memory. Also, the presence of ApoE ɛ4 indicates a significant correlation between depression and delayed verbal recall memory. Taken together, our results indicate that verbal memory training rather than visual memory training can be more effective in early AD. Also, the treatment of depression will provide a complementary effect.

Analysis Study on Successful Hit Elements of Faction Film < Gwang-hae: The Man Who Became King > (팩션영화 <광해, 왕이 된 남자>의 흥행 요소 분석 연구)

  • Kim, kyung-Sik;Jung, Ji-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.6
    • /
    • pp.179-190
    • /
    • 2015
  • The current movie world is called the era of faction hot-wind by the continuous hits of faction films. The interpretation of the missing 15 days in the Annals of the Joseon Dynasty during Gwang-hae's reign based on one sentence in the record makes 'faction' that there was another king, drives audiences to absorption and imagination in the faction film < Gwang-hae: The Man Who Became King >. Furthermore, this film redefined historical king Gwang-hae as an ambivalent image through Gwang-hae and Ha-sun who filled the role as the king in it. Also, this film was appraised by reviews of reinterpretation of the image of leader who people want and hit success when the film released before the season of the presidential election. This thesis considers 'Faction' which is marked as a new image content and analyzes the film < Gwang-hae: The Man Who Became King > which is listed as Korea's all-time sixth highest grossing film with 12,323,555 tickets sold nationwide by three sections; < Gwang-hae: The Man Who Became King > as faction film, two images of Gwang-hae, and the appropriateness of film release time. In conclusion, the film < Gwang-hae: The Man Who Became King >. succeeded since it communicated with people to satisfy their wishes and taste. It would be necessary to study and analyze the basic connection between the trend of movies and Strategic elements of the box-office results, and a significant stride for progression of the movies.

Development of Acquisition and Analysis System of Radar Information for Small Inshore and Coastal Fishing Vessels - Position Tracking and Real-Time Monitoring- (연근해 소형 어선의 레이더 정보 수록 및 해석 시스템 개발 -위치 추적 및 실시간 모니터링 -)

  • 이대재;김광식;신형일;변덕수
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.39 no.4
    • /
    • pp.337-346
    • /
    • 2003
  • This paper describes on the system and method for automatically tracking and real-time monitoring the position of target ships relative to the own ship using a PC based radar system that displays radar images and electronic charts together on a single PC screen. This system includes a simulator for generating the GGA and VTG information of target ships and a simulator for generating the TTM and OSD outputs from a ARPA radar and then host computer accepts NMEA0183 sentences on the maneuvering information of target ships from these simulators. The results obtained are summarized as follows;1. The system developed this study can be used as a range finder for measuring the distance between two ships and as a device for providing the maneuvering information such as distance and bearing to target ships from own ship on ECS screen. 2. From the result of position tracking for a selected target ship tracked with an update rate of 5 seconds using the $\alpha$-$\beta$ tracker, we concluded that the smoothing effect by the $\alpha$-$\beta$tracker was very effective and stable except in the time interval until about one minute after the target is detected. 3. From the fact that the real-time maneuvering information of tracked ship targets via a local area network (LAN) from a host computer installed a radar target extractor was successfully transferred to various monitoring computers of ship, we concluded that this system can be used as a sub-monitoring system of ARPA radar.

A Heuristic Method for Extracting True Opinion Targets (의도된 의견 대상의 추출을 위한 경험적 방법)

  • Soh, Yun-Kyu;Kim, Han-Woo;Jung, Sung-Hun;Kim, Dong-Ju
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.9
    • /
    • pp.39-47
    • /
    • 2012
  • The opinion of user on a certain product is expressed in positive/negative sentiments for specific features of it. In some cases, they are expressed for a holistic part of homogeneous specific features, or expressed for product itself. Therefore, in the area of opinion mining, name of opinion features to be extracted are specific feature names, holonyms for theses specific features, and product names. However, when the opinion target is described with product name or holonym, sometimes it may not match feature name of opinion sentence to true opinion target intended by the reviewer. In this paper, we present a method to extract opinion targets from opinion sentences. Most importantly, we propose a method to extract true target from the feature names mismatched to a intended target. First, we extract candidate opinion pairs using dependency relation between words, and then select feature names frequently mismatched to opinion target. Each selected opinion feature name is replaced to a specific feature intended by the reviewer. Finally, in order to extract relevant opinion features from the whole candidate opinion pairs including modified opinion feature names, candidate opinion pairs are rearranged by the order of user's interest.