• Title/Summary/Keyword: 인터넷 신조어

Search Result 19, Processing Time 0.04 seconds

Methodology and Implementation of Detecting Tool for New Words Occurring in Korean Document (신조어 자동 추출 방법론과 신어 조사 도구의 개발)

  • Lee, Samuel Sangkon
    • Annual Conference on Human and Language Technology
    • /
    • 2009.10a
    • /
    • pp.271-276
    • /
    • 2009
  • 신조어 조사용 프로그램은 웹에 실시간으로 등록되는 언론 기사를 수집하는 웹 에이전트를 개발하여 텍스트를 추출하고, 간단한 어휘 분석을 통하여 국어사전에 등록된 표제어와 이미 연구자가 발견한 기존의 신조어를 제외하고, 현대의 사회상을 잘 표현하는 새로 생성된 신조어를 추출하는 작업을 하는 도구이다. 인터넷의 언론 사이트에서 규칙적인 URL 패턴을 발견하고 뉴스 기사를 수집한다. HTML 소스 분석을 통하여 언론 기사만을 추출하여 국어 전공자가 신어를 찾아내는 작업을 도와주는 조사 도구를 설계하고 구현하였다.

  • PDF

Implementation of the Automatic Indexing and New Term Processing System for Game Information Retrieval (게임 정보검색을 위한 자동색인 및 신조어 처리 시스템 구현)

  • Lee, Sang-Joon;Ryu, Keun-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.04a
    • /
    • pp.51-54
    • /
    • 2001
  • 오늘날 국내외에 인터넷 보급의 대중화가 점차 확대되고 네트워크을 이용하는 게임의 증가에 따라 게임에 관련된 웹 문서에 대한 사용자의 요구가 증가되고 있다. 기존의 수작업에 의한 색인 방식은 많은 전문인력, 시간, 경비등을 필요로 하기 때문에, 기하급수적으로 증가하는 웹 상의 정보를 처리하기에는 이미 그 한계에 이른 실정이다. 이러한 문제점의 해결을 위해 컴퓨터를 이용한 자동색인 시스템의 개발은 매우 중요하고 시급하다. 더구나 게임 분야에서 있어 신조어는 너무나 급속히 생성되고 있다. 따라서 이러한 신조어 처리는 효과적인 자동색인을 위한 중요한 요소이다. 이 논문에서는 사용자들에게 보다 적합하고 안정적인 게임 정보를 제공하기 위해 게임 용어 사전을 이용한 자동색인과 신조어 처리 시스템을 설계, 구현한다. 자동색인 및 신조어 처리를 위해 게임용어사전, TF-IDF, n-gram 추출법을 이용한다.

  • PDF

A Study on the Archiving of a Social Phenomenon through Neologism (신조어를 활용한 사회적 현상 아카이빙 방안 연구)

  • Kim, Hwan;Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.52
    • /
    • pp.315-342
    • /
    • 2017
  • Language is an important medium for communication among the members of society and a mirror that reflects society as a whole. As society and culture change and develop over centuries, language follows suit. To keep up with the changes in the new era and express new concepts, countless new neologisms continue to appear. Recently, the use of neologisms is getting increasingly focused on social networking service and other Internet communication sites, which then spread rapidly through various media. If you look at the popular neologisms on the Internet, it implicitly reflects conflicts between the eras and the generations, people's psychology and ideology, and social phenomena such as culture. The function of neologisms is not solely for the entertainment element of communication but also for criticizing social problems and their vital use as a search keyword. This study focuses on the meaning and importance of gathering information and analyzing records about neologisms that reflect the social phenomenon in a certain period, and this will be labeled as "neologism archiving." This study proposes a direction for the construction of a neologism archive by comparing the currently existing neologism archiving system with the existing dictionary concept. In addition, this study serves as a reminder of the convenience and the contemporary social phenomena, such as smooth communication between generations, and the dissemination of inequality of information sharing. Lastly, this study aims to support experts with their research on neologisms for the social phenomenon.

Design and Implementation of Detecting Tool for New Word in Korean Journal Articles (언론 기사에 나타난 신(조)어 조사 도구의 설계 및 구현)

  • Song, In-sung;Jeong, Hee-seok;Lee, Samuel Sangkon;Lee, Raeho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.04a
    • /
    • pp.114-117
    • /
    • 2009
  • 신조어 조사용 프로그램은 웹에 실시간으로 등록되는 언론 기사를 수집하는 웹 에이전트를 개발하여 텍스트를 추출하고, 간단한 어휘 분석을 통하여 국어사전에 등록된 표제어와 이미 연구자가 발견한 기존의 신조어를 제외하고 새롭게 생성된 신조어를 추출하는 작업을 하는 도구이다. 인터넷의 언론 사이트에서 규칙적인 URL 패턴을 발견하고 뉴스 기사를 수집한다. HTML 소스 분석을 통하여 언론 기사만을 추출하고 이 기사에서 사전의 표제어와 기존에 조사된 신어를 제외하여 국어 전공자가 신어를 찾아내는 작업을 하는데 사용하는 시스템을 설계하고 구현하였다.

Knowledge Graph-based Korean New Words Detection Mechanism for Spam Filtering (스팸 필터링을 위한 지식 그래프 기반의 신조어 감지 매커니즘)

  • Kim, Ji-hye;Jeong, Ok-ran
    • Journal of Internet Computing and Services
    • /
    • v.21 no.1
    • /
    • pp.79-85
    • /
    • 2020
  • Today, to block spam texts on smartphone, a simple string comparison between text messages and spam keywords or a blocking spam phone numbers is used. As results, spam text is sent in a gradually hanged way to prevent if from being automatically blocked. In particular, for words included in spam keywords, spam texts are sent to abnormal words using special characters, Chinese characters, and whitespace to prevent them from being detected by simple string match. There is a limit that traditional spam filtering methods can't block these spam texts well. Therefore, new technologies are needed to respond to changing spam text messages. In this paper, we propose a knowledge graph-based new words detection mechanism that can detect new words frequently used in spam texts and respond to changing spam texts. Also, we show experimental results of the performance when detected Korean new words are applied to the Naive Bayes algorithm.

Study on Effective Extraction of New Coined Vocabulary from Political Domain Article and News Comment (정치 도메인에서 신조어휘의 효과적인 추출 및 의미 분석에 대한 연구)

  • Lee, Jihyun;Kim, Jaehong;Cho, Yesung;Lee, Mingu;Choi, Hyebong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.149-156
    • /
    • 2021
  • Text mining is one of the useful tools to discover public opinion and perception regarding political issues from big data. It is very common that users of social media express their opinion with newly-coined words such as slang and emoji. However, those new words are not effectively captured by traditional text mining methods that process text data using a language dictionary. In this study, we propose effective methods to extract newly-coined words that connote the political stance and opinion of users. With various text mining techniques, I attempt to discover the context and the political meaning of the new words.

A Study of the New Chinese Words Under the Influence of Culture Content (문화 콘텐츠 영향의 신조 중국어 고찰)

  • Meng, Xiang-Shan;Lee, Kwang-Ho
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.8
    • /
    • pp.131-142
    • /
    • 2019
  • This paper is intended to examine and analyze the new Chinese words as the result of culture content. The development of the Korean entertainment industry has created a Korean wave around the world. Through this, many Korean words, Internet vocabulary, and cultural concepts have begun to enter China. Among them, there are many new words that have appeared on the Chinese Internet due to the culture content. As the number of Korean fans and Korean learners increases, new words on the Internet are widely used. The new Chinese words, which are influenced by Korean cultural content, are considered an important part of new Chinese vocabulary. To accurately recognize and understand this, first of all six categories of the new Chinese words were analyzed, which were figurative meaning, substitution, loan of foreign words, abbreviation, compound word, derivation. This formulation also works on the Chinese words with the influence of cultural content. There are three types of the Internet new words form Korean cultural. Which were new words in Chinese characters, new words in alphabets, extended meanings. And had analyzed new words through the acquisition of new meanings. Also took specific news titles and songs according to each category. Through new Chinese words, The influence of cultural content had been confirmed. It is expected that these new Chinese words enrich Chinese vocabulary, also help to facilitate communication. And these new Chinese words are often used in public media or in everyday life. We should recognize the existence of these new Chinese words, and have an accurate perception of them.

인터넷 산업의 공간적 분포 특성에 관한 연구

  • 이희연
    • Proceedings of the KGS Conference
    • /
    • 2003.11a
    • /
    • pp.100-105
    • /
    • 2003
  • 최근 정보통신기술의 발달과 그에 따른 변화에 있어서 가장 주목할만한 점은 인터넷의 확산이라고 볼 수 있다. 인류역사상 가장 빠른 속도로 확산된 미디어로 '제3의 혁명' 이라고까지 일컬어지고 있는 인터넷은 세계적으로5.4억 명 이상의 사용자가 있으며, 국내 이용자도 약 2,500만 명에 이르고 있다. 이러한 인터넷은 단순한 정보교환의 수단만이 아니라 새로운 시장을 창출하고 기업의 비용을 혁신적으로 절감시키면서 인터넷 관련 산업들이 새로운 경제의 주축을 차지하게 되면서 디지털 경제, 인터넷 경제, 또는 신경제라는 신조어들이 등장하고 있다. (중략)

  • PDF

Sensitivity Identification Method for New Words of Social Media based on Naive Bayes Classification (나이브 베이즈 기반 소셜 미디어 상의 신조어 감성 판별 기법)

  • Kim, Jeong In;Park, Sang Jin;Kim, Hyoung Ju;Choi, Jun Ho;Kim, Han Il;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • From PC communication to the development of the internet, a new term has been coined on the social media, and the social media culture has been formed due to the spread of smart phones, and the newly coined word is becoming a culture. With the advent of social networking sites and smart phones serving as a bridge, the number of data has increased in real time. The use of new words can have many advantages, including the use of short sentences to solve the problems of various letter-limited messengers and reduce data. However, new words do not have a dictionary meaning and there are limitations and degradation of algorithms such as data mining. Therefore, in this paper, the opinion of the document is confirmed by collecting data through web crawling and extracting new words contained within the text data and establishing an emotional classification. The progress of the experiment is divided into three categories. First, a word collected by collecting a new word on the social media is subjected to learned of affirmative and negative. Next, to derive and verify emotional values using standard documents, TF-IDF is used to score noun sensibilities to enter the emotional values of the data. As with the new words, the classified emotional values are applied to verify that the emotions are classified in standard language documents. Finally, a combination of the newly coined words and standard emotional values is used to perform a comparative analysis of the technology of the instrument.

English Word Game System Recognizing Newly Coined Words (신조어를 인식할 수 있는 영어단어 게임시스템)

  • Shim, Dong-uk;Park, So-young;Kim, Ki-sub;Kang, Han-gu;Jang, Jun-ho;Kim, Dae-woong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.521-524
    • /
    • 2009
  • Everyone can easily acquire learning materials on web environment that rapidly develops. Because the importance of English education has been emphasized day by day, many English education systems are introduced. However, previous most English education systems support only single user mode, and cannot deal with a newly coined word such as 'WIKIPEDIA'. In order to lead a user's learning ability with interest and enjoyment, this paper propose an online English word game system implementing a 'scrabble' board game. The proposed English word game system has the following characteristics. First, the proposed system supports both single user mode and multi user mode with a virtual user based on artificial intelligence. Second, the proposed system can recognize newly coined words such as 'WIKIPEDIA' by using NEVER Open API dictionary. Third, the proposed system offers familiar user interface so that a user can play the game without any manual. Therefore, it is expected that the proposed system can help users to learn English words with interest and enjoyment.

  • PDF