• Title/Summary/Keyword: multi-language

Search Result 675, Processing Time 0.023 seconds

Detection of Incivility based on Attention-embedding and multi-channel CNN (어텐션임베딩과 다채널 CNN 기반 반시민성 검출 알고리즘)

  • Park, Youn-Jung;Lee, Se-Young;Keum, Hee-Jo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.12
    • /
    • pp.1880-1889
    • /
    • 2022
  • The online portal platform provides online news with online comments, but the anonymity of comments causes incivility, and online comments are considered social problems. While there are many foreign language-based incivility detection studies, in-depth research is not being conducted in Korea since there has not been implemented Korean language dataset which is labeled detailed criteria of incivility. In this study, the incivility notation of comments was conducted in a total of 13 items, uncivil words were summarized. Furthermore, Attention algorithm was applied to each comment and summary to extract embedding vectors. 2-d CNN followed at the end to detect incivility in given data. As a result, we showed that the proposed algorithm is useful for anti-citizen detection such as name-calling and offensive tones. This study is expected to contribute to the formation of a healthy online comment culture by detecting uncivil comments which hinder democratic discourse.

A Named Entity Recognition Platform Based on Semi-Automatically Built NE-annotated Corpora and KoBERT (반자동구축된 개체명 주석코퍼스 DecoNAC과 KoBERT를 이용한 개체명인식 플랫폼 DecoNERO)

  • Kim, Shin-Woo;Hwang, Chang-Hoe;Yoon, Jeong-Woo;Lee, Seong-Hyeon;Choi, Soo-Won;Nam, Jee-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.304-309
    • /
    • 2020
  • 본 연구에서는 한국어 전자사전 DECO(Dictionnaire Electronique du COreen)와 다단어(Multi-Word Expressions: MWE) 개체명을 부분 패턴으로 기술하는 부분문법그래프(Local-Grammar Graph: LGG) 프레임에 기반하여 반자동으로 개체명주석 코퍼스 DecoNAC을 구축한 후, 이를 개체명 분석에 활용하고 또한 기계학습에 필요한 도메인별 학습 데이터로 활용하는 DecoNERO 개체명인식 플랫폼을 소개하는 데에 목적을 두었다. 최근 들어 좋은 성과를 보이는 것으로 보고되고 있는 기계학습 방법론들은 다양한 도메인을 기반으로한 대규모의 학습데이터를 필요로 한다. 본 연구에서는 정교하게 설계된 개체명 사전과 다단어 개체명 시퀀스에 대한 언어자원을 바탕으로 하는 반자동으로 학습데이터를 생성하는 방법론을 제안하였다. 본 연구에서 제안된 개체명주석 코퍼스 DecoNAC 기반 접근법의 성능을 실험하기 위해 온라인 뉴스 기사 텍스트를 바탕으로 실험을 진행하였다. 이 실험에서 DecoNAC을 적용한 경우, KoBERT 모델만으로 개체명을 인식한 결과에 비해 약 7.49%의 성능향상을 기대할 수 있음을 확인하였다.

  • PDF

The Function of the Author and the Poetic Experiments in Lyrical Ballads of 1798 (1798년 『서정민요집』의 저자의 기능과 시적 실험)

  • Joo, Hyeuk Kyu
    • Journal of English Language & Literature
    • /
    • v.56 no.5
    • /
    • pp.973-998
    • /
    • 2010
  • This paper aims at assessing the significance of Lyrical Ballads of 1798, the agreed inaugurator of English Romanticism, in terms of such key concepts as poetic "experiments," "conversation," and the authorial function. The 1798 volume marks an interesting incidence in which an author with no tangible substantiality can wield his authorial function over his works. The volume is signed without the named proper noun-its author is neither William Wordsworth nor Samuel Taylor Coleridge. The figure of the author in this case is realized by the poems he writes; he produces, and is produced by, his works-a fact that constitutes part of the poetic experiments manifested in the Advertisement. Working under this reciprocal production, the Author of the 1798 volume and his poems are collectively aiming at establishing a new class of poetry and an interpretive community. The notion of "conversation" is a key element in the thematic, stylistic ties among individual poems. Poems of the 1798 volume effect multi-layered, "blended" voices. Readers are expected to draw out the topological interweaving among poems through the practices of dialogic reading. In this light, the sequential necessity of "The Rime" and "Tintern Abbey" should be emphasized. They are stitched together in a logic of textual placement and the transition from one to the other is never arbitrary. Most of all, they are working under the same authorial function, complementing each other, and addressing the same poetic project in different textual locations. As an inaugural work of English Romanticism, Lyrical Ballads of 1798 in fact makes so many things happen and yet again anticipates something yet to come with elusiveness. The value of this poetic experiments should be judged not only by what is claimed in it, but what it sets out to do and "how far" it will be performed, as implied in the Advertisement. The efficacy of the volume, more than anything else, is dependent upon the performative power of words.

A Discord among Individual, Race, and History: Focused on Philip Roth's The Plot Against America (개인, 인종, 그리고 역사의 불협화음 -필립 로스의 『미국에 대한 음모』를 중심으로)

  • Jang, Jung-hoon
    • Journal of English Language & Literature
    • /
    • v.58 no.5
    • /
    • pp.809-837
    • /
    • 2012
  • Philip Roth rejects the narrative unity and singularity of the traditional novel and creates instead a multi-levelled, fragmentary, and repetitive narrative. It is not easy to distinguish fact from fiction in The Plot Against America. As an entertaining and creative work of the postmodern historiographic metafiction, Philip Roth's The Plot Against America interrogates the existence of historically verifiable facts, the validity of authentic and official version of history, and reexamines the narrative conventions of history writing. The aim of this paper is to examine Roth's narrative experiment or 'thought experiment' and to explore the intention of creating alternative history in The Plot Against America. Roth does a 'thought experiment' in The Plot Against America. In this cautionary "what if" political fable, Roth hypothesizes that in 1940 aviation hero Charles Lindbergh, an ardent isolationist who was sympathetic to Hiltler, won the presidency. Jewish communities are stunned and terrified as America flirts with fascism and anti-semitism. Reimagining his children-with considerable fact mixed in with the fiction-Roth narrates an alternative history that has an unsettling plausibility. Roth has constructed a brilliantly telling and disturbing historical prism by which to refract the American psyche as it pertain to the discord of individual, race, history in The Plot Against America. Roth analyzes the life of individual in a historic space, the situation of anti-semitism in world of invisible order, racial conflict between black and white in world of visible order, and the darkest side of national power in this work. Roth's stories argue for the equality of various cultures grounded on the common notion of humanity, for an ethic of mutual respect, and for the peaceful resolution of conflicts.

Intrusion Detection System based on Packet Payload Analysis using Transformer

  • Woo-Seung Park;Gun-Nam Kim;Soo-Jin Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.11
    • /
    • pp.81-87
    • /
    • 2023
  • Intrusion detection systems that learn metadata of network packets have been proposed recently. However these approaches require time to analyze packets to generate metadata for model learning, and time to pre-process metadata before learning. In addition, models that have learned specific metadata cannot detect intrusion by using original packets flowing into the network as they are. To address the problem, this paper propose a natural language processing-based intrusion detection system that detects intrusions by learning the packet payload as a single sentence without an additional conversion process. To verify the performance of our approach, we utilized the UNSW-NB15 and Transformer models. First, the PCAP files of the dataset were labeled, and then two Transformer (BERT, DistilBERT) models were trained directly in the form of sentences to analyze the detection performance. The experimental results showed that the binary classification accuracy was 99.03% and 99.05%, respectively, which is similar or superior to the detection performance of the techniques proposed in previous studies. Multi-class classification showed better performance with 86.63% and 86.36%, respectively.

Does Cultural Intelligence enhance Export SME's Capability for Utilizing Foreign Market Informations? (문화 인텔리전스는 수출중소기업의 해외시장정보 활용능력을 키우는가?)

  • Hong, Songhon
    • International Commerce and Information Review
    • /
    • v.19 no.1
    • /
    • pp.127-152
    • /
    • 2017
  • One of the biggest challenges in export activities of SME is the increasingly cultural diversity that requires especially export managers to adapt their doing business in many different kinds of cross-cultural situations effectively. A relative newly developed concept 'Cultural Intelligence' has been evaluated as a key element for the success in international business activities. This study aims to investigate empirically the role of Cultural Intelligence(CQ), one component of cultural competence, on export marketing adaptation. The statistical method used to test the hypotheses was Structural Equation Modeling using PLS. The results of this study are follows. The moderating role of Cultural Intelligence between information seeking and information using abilities is more stronger in marketing rather than relationship adaptation. Cultural Intelligence moderates between them. Critical factors affecting Cultural Intelligence are also discussed; foreign language fluency, business travels in abroad, characteristics of the business travels, multi-lingual ability, pre-education related cultural subjects, and visit experience in foreign countries. Especially, export managers' foreign language ability leads to much stronger influence on cultural intelligence. The result of the empirical study provides important implications for export SME and export supporting organizations. Export firms and supporting organizations must expand programs widely in multi cultural training and education to help managers gain a better understanding in a various export environment.

  • PDF

Multi-stage News Classification System for Predicting Stock Price Changes (주식 가격 변동 예측을 위한 다단계 뉴스 분류시스템)

  • Paik, Woo-Jin;Kyung, Myoung-Hyoun;Min, Kyung-Soo;Oh, Hye-Ran;Lim, Cha-Mi;Shin, Moon-Sun
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.2
    • /
    • pp.123-141
    • /
    • 2007
  • It has been known that predicting stock price is very difficult due to a large number of known and unknown factors and their interactions, which could influence the stock price. However, we started with a simple assumption that good news about a particular company will likely to influence its stock price to go up and vice versa. This assumption was verified to be correct by manually analyzing how the stock prices change after the relevant news stories were released. This means that we will be able to predict the stock price change to a certain degree if there is a reliable method to classify news stories as either favorable or unfavorable toward the company mentioned in the news. To classify a large number of news stories consistently and rapidly, we developed and evaluated a natural language processing based multi-stage news classification system, which categorizes news stories into either good or bad. The evaluation result was promising as the automatic classification led to better than chance prediction of the stock price change.

Development of a Remotely Sensed Image Processing/Analysis System : GeoPixel Ver. 1.0 (JAVA를 이용한 위성영상처리/분석 시스템 개발 : GeoPixel Ver. 1.0)

  • 안충현;신대혁
    • Korean Journal of Remote Sensing
    • /
    • v.13 no.1
    • /
    • pp.13-30
    • /
    • 1997
  • Recent improvements of satellite remote sensing sensors which are represented by hyperspectral imaging sensors and high spatial resolution sensors provide a large amount of data, typically several hundred megabytes per one scene. Moreover, increasing information exchange via internet and information super-highway requires the developments of more active service systems for processing and analysing of remote sensing data in order to provide value-added products. In this sense, an advanced satellite data processing system is being developed to achive high performance in computing speed and efficieney in processing a huge volume of data, and to make possible network computing and easy improving, upgrading and managing of systems. JAVA internet programming language provides several advantages for developing software such as object-oriented programming, multi-threading and robust memory managent. Using these features, a satellite data processing system named as GeoPixel has been developing using JAVA language. The GeoPixel adopted newly developed techniques including object-pipe connect method between each process and multi-threading structure. In other words, this system has characteristics such as independent operating platform and efficient data processing by handling a huge volume of remote sensing data with robustness. In the evaluation of data processing capability, the satisfactory results were shown in utilizing computer resources(CPU and Memory) and processing speeds.

Variable Cut-off Frequency and Variable Sample Rate Small-Area Multi-Channel Digital Filter for Telemetry System (텔레메트리 시스템을 위한 가변 컷 오프 주파수 및 가변 샘플 레이트 저면적 다채널 디지털 필터 설계)

  • Kim, Ho-keun;Kim, Jong-guk;Kim, Bok-ki;Lee, Nam-sik
    • Journal of Advanced Navigation Technology
    • /
    • v.25 no.5
    • /
    • pp.363-369
    • /
    • 2021
  • In this paper, We propose variable cut-off frequency and variable sample rate small-area multi-channel digital filter for telemetry system. Proposed digital filter reduced hardware area by implementing filter banks that can variably use cut-off frequency and sample rate without additional filter banks for an arbitrary cut ratio. In addition, We propose the architecture in which sample rate can variably be selected according to the number of filters that pass through the multiplexer control. By using time division multiplexing (TDM) supported by the finite impulse response (FIR) intellectual property (IP) of Quartus, the proposed digital filter can greatly reduce digital signal processing (DSP) blocks from 80 to 1 compared without TDM. Proposed digital filter calculated order and coefficients using Kaiser window function in Matlab, and implemented using very high speed integrated circuits hardware descryption language (VHDL). After applying to the telemetry system, we confirmed that the proposed digital filter was operating through the experimental results in the test environment.

High-resolution Urban Flood Modeling using Cellular Automata-based WCA2D in the Oncheon-cheon Catchment in Busan, South Korea (셀룰러 오토마타 기반 WCA2D 모형을 이용한 부산 온천천 유역 고해상도 도시 침수 해석)

  • Choi, Hyeonjin;Lee, Songhee;Woo, Hyuna;Noh, Seong Jin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.5
    • /
    • pp.587-599
    • /
    • 2023
  • As climate change increasesthe frequency and risk of flooding in major cities around theworld, the importance ofsimulation technology that can quickly and accurately analyze high-resolution 2D flooding information in large-scale areasis emerging. The physically-based approaches based on the Shallow Water Equations (SWE) often requires huge computer resources hindering high-resolution flood prediction. This study investigated the theoretical background of Weighted Cellular Automata 2D (WCA2D), which simulates spatio-temporal changes offlooding using transition rules and weight-based system, and assessed feasibility to simulate pluvial flooding in the urbancatchment, theOncheon-cheon catchmentinBusan, SouthKorea.Inaddition,the computation performancewas compared by applying versions using OpenComputing Language (OpenCL) andOpenMulti-Processing (OpenMP) parallel computing techniques. Simulationresultsshowed that the maximuminundation depthmap by theWCA2Dmodel cansimilarly reproduce historical inundation maps. Also, it can precisely simulate spatio-temporal changes of flooding extent in the urban catchment with complex topographic characteristics. For computation efficiency, parallel computing schemes, theOpenCLandOpenMP, improved the computation by about 8~14 and 5~6 folds respectively, compared to the sequential computation.