• Title/Summary/Keyword: Language value

Search Result 617, Processing Time 0.031 seconds

An Equal Pair: The Dialogic Narrative Scheme in Bleak House

  • Kim, Myungjin
    • Journal of English Language & Literature
    • /
    • v.55 no.6
    • /
    • pp.993-1011
    • /
    • 2009
  • Generally, the parts narrated by Esther in Bleak House has been considered less convincing and reliable than those by the anonymous narrator for some problematic qualities in her character and narration. However, Esther's narrative shows Dickens' masterly depiction of emotional deprivation, the psychic consequences of the Victorian sexual repression on its victim. Therefore, to restore the reliability of Esther's narrative is the prerequisite for claiming its value as an appropriate locus of the meanings of the text. On the other hand, the anonymous narrator is not so omniscient as he has been regarded. As the chapters proceed, his omniscient power and authority is conspicuously weakened, and even transferred to other characters such as Esther and Mr. Bucket. This shows that the identity of the omniscient voice is unstable and that Dickens does not intend his voice to be the sole center of meanings of the text. In short, these two narratives are the necessary partners in imagining and understanding the society in its wholeness. Alternating and sometimes intersecting each other throughout the novel, these opposing viewpoints make us see the contradictory multi-leveledness of the Victorian society. The equality of them implies Dickens' notion that more than single unified voice is needed to portray ideological conflicts of his age.

Multi Domain Dialog State Tracking using Domain State (도메인 상태를 이용한 다중 도메인 대화 상태 추적)

  • Jeon, Hyunmin;Lee, Geunbae
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.421-426
    • /
    • 2020
  • 다중 도메인 목적 지향 대화에서 기존 딥 러닝을 이용한 대화 상태 추적(Dialog state tracking)은 여러 턴 동안 누적된 사용자와 시스템 간 대화를 입력 받아 슬롯 밸류(Slot value)를 추출하는 모델들이 연구되었다. 하지만 이 모델들은 대화가 길어질수록 연산량이 증가한다. 이에 본 논문에서는 다중 도메인 대화에서 누적된 대화의 history 없이 슬롯 밸류를 추출하는 방법을 제안한다. 하지만, 단순하게 history를 제거하고 현재 턴의 발화만 입력 받는 방법은 문맥 정보의 손실로 이어진다. 따라서 본 논문에서는 도메인 상태(Domain state)를 도입하여 매 턴 마다 대화 상태와 함께 추적하는 모델을 제안한다. 도메인 상태를 같이 추적함으로써 현재 어떠한 도메인에 대하여 대화가 진행되고 있는지를 파악한다. 또한, 함축된 문맥 정보를 담고 있는 이전 턴의 대화 상태와 도메인 상태를 현재 턴의 발화와 같이 입력 받아 정보의 손실을 줄였다. 대표적인 데이터 셋인 MultiWOZ 2.0과 MultiWOZ 2.1에서 실험한 결과, 대화의 history를 사용하지 않고도 대화 상태 추적에 있어 좋은 성능을 보이는 것을 확인하였다. 또한, 시스템 응답과 과거 발화에 대한 의존성을 제거하여 end-to-end 대화 시스템으로의 확장이 좀 더 용이할 것으로 기대된다.

  • PDF

Chinese Multi-domain Task-oriented Dialogue System based on Paddle (Paddle 기반의 중국어 Multi-domain Task-oriented 대화 시스템)

  • Deng, Yuchen;Joe, Inwhee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.308-310
    • /
    • 2022
  • With the rise of the Al wave, task-oriented dialogue systems have become one of the popular research directions in academia and industry. Currently, task-oriented dialogue systems mainly adopt pipelined form, which mainly includes natural language understanding, dialogue state decision making, dialogue state tracking and natural language generation. However, pipelining is prone to error propagation, so many task-oriented dialogue systems in the market are only for single-round dialogues. Usually single- domain dialogues have relatively accurate semantic understanding, while they tend to perform poorly on multi-domain, multi-round dialogue datasets. To solve these issues, we developed a paddle-based multi-domain task-oriented Chinese dialogue system. It is based on NEZHA-base pre-training model and CrossWOZ dataset, and uses intention recognition module, dichotomous slot recognition module and NER recognition module to do DST and generate replies based on rules. Experiments show that the dialogue system not only makes good use of the context, but also effectively addresses long-term dependencies. In our approach, the DST of dialogue tracking state is improved, and our DST can identify multiple slotted key-value pairs involved in the discourse, which eliminates the need for manual tagging and thus greatly saves manpower.

Toward Shared Grounds Between Environmental Pragmatism and Foundationalist Ecology (실용주의 환경론과 근본주의 생태론의 접점 모색)

  • Kang, Yong-Ki
    • Journal of English Language & Literature
    • /
    • v.56 no.1
    • /
    • pp.47-64
    • /
    • 2010
  • It is unfair that environmental pragmatism has been regarded as a mouthpiece for industrial expediency and business boosterism. John Dewey's radical pragmatism known as 'Instrumentalism' has provoked ecological fundamentalists' criticism more vehemently than any other pragmatic philosophies. However, most of the presumptive misunderstandings of such critics as Holmes Rolston, J. Baird Calliott, Erich Katz, C. A. Bowers and many others come from their limited or reduced reading of Deweyan pragmatism. The following three aspects of Deweyan pragmatism can work out in opening up a dialogical space with those eco-centrist thinkers mentioned above. First, the concept of Dewey's 'primary experience' can articulate the foundationalist view of nature, which is often found in aboriginal cultures. Second, as Andrew Light points out, ecological essentialism can share its metaphilosophical position with the pragmatist epistemology. While Anthony Weston pursues pluralism, admitting that the foundationalism might be one of the efficient approaches to nature, Eric Katz is also clearly attracted to the metaphilosophical element in Weston's argument that anyone who attempts to claim the 'inherent value' of non-human nature never possibly avoids a pitfall of anthropomorphism. Lastly, in a more comprehensive perspective, Dewey's pragmatism shows a philosophical complexity, what Larry A. Hickman calls 'post-postmodernism.' a dynamic interaction between modernism and postmodernism. Significantly enough, the environmental version of this complexity can procure a meeting ground between foundationalist ecology and the pragmatic view of nature.

CORRECT? CORECT!: Classification of ESG Ratings with Earnings Call Transcript

  • Haein Lee;Hae Sun Jung;Heungju Park;Jang Hyun Kim
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.4
    • /
    • pp.1090-1100
    • /
    • 2024
  • While the incorporating ESG indicator is recognized as crucial for sustainability and increased firm value, inconsistent disclosure of ESG data and vague assessment standards have been key challenges. To address these issues, this study proposes an ambiguous text-based automated ESG rating strategy. Earnings Call Transcript data were classified as E, S, or G using the Refinitiv-Sustainable Leadership Monitor's over 450 metrics. The study employed advanced natural language processing techniques such as BERT, RoBERTa, ALBERT, FinBERT, and ELECTRA models to precisely classify ESG documents. In addition, the authors computed the average predicted probabilities for each label, providing a means to identify the relative significance of different ESG factors. The results of experiments demonstrated the capability of the proposed methodology in enhancing ESG assessment criteria established by various rating agencies and highlighted that companies primarily focus on governance factors. In other words, companies were making efforts to strengthen their governance framework. In conclusion, this framework enables sustainable and responsible business by providing insight into the ESG information contained in Earnings Call Transcript data.

The fundamental frequency (f0) distribution of American speakers in a spontaneous speech corpus

  • Byunggon Yang
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.11-16
    • /
    • 2024
  • The fundamental frequency (f0), representing an acoustic measure of vocal fold vibration, serves as an indicator of the speaker's emotional state and language-specific pattern in daily conversations. This study aimed to examine the f0 distribution in an English corpus of spontaneous speech, establishing normative data for American speakers. The corpus involved 40 participants engaging in free discussions on daily activities and personal viewpoints. Using Praat, f0 values were collected filtering outliers after removing nonspeech sounds and interviewer voices. Statistical analyses were performed with R. Results indicated a median f0 value of 145 Hz for all the speakers. The f0 values for all speakers exhibited a right-skewed, pointy distribution within a frequency range of 216 Hz from 75 Hz to 339 Hz. The female f0 range was wider than that of males, with a median of 113 Hz for males and 181 Hz for females. This spontaneous speech corpus provides valuable insights for linguists into f0 variation among individuals or groups in a language. Further research is encouraged to develop analytical and statistical measures for establishing reliable f0 standards for the general population.

The Study on the Analysis of the Rate of Information Acquisition and the Observation Time shown at the Observation of Interior Space (실내공간의 주시에 나타난 정보획득률과 주시시간 분석에 관한 연구)

  • Choi, Joo-Young;Kim, Joo-Hyun;Choi, Gae-Young;Lee, Jeong-Ho;Kim, Jong-Ha
    • Korean Institute of Interior Design Journal
    • /
    • v.20 no.6
    • /
    • pp.183-191
    • /
    • 2011
  • This study is to set up the appropriate range of observation time through contemplating the characteristics of observation time run for the information acquisition of space. The conclusions reached through this study are as the followings. First, this study could find out that even though the evaluation elements on the three types for image evaluation were the same, the information acquisitions were different as those types varied. On the other hand, the change of the average run-time by type for the information acquisition was found not to be big, in other words, even though the run-time was alike, the information acquisitions varied depending on the type. Second, he evaluation by language media showed the average value by element had the order of [shape>position>number>existence] and the range of their run-time was 94.6~102.9 seconds. The average rate of information acquisition shown at the visual media had the order of [composition>shape>material&color] and the range of run-time was 93.1~99.7 seconds. Third, the evaluation by language media showed that for male subjects the range of information acquisition rate was 39.1~91.4% and that of run-time 85.1~106.0 seconds and for female ones 46.0~94.6% and 96.3~112.3 seconds respectively. In case of the visual media, male subjects showed the range of information acquisition rate was 40.3-66.7% and the range of run-time 82.4~97.9 seconds and the female ones, 42.2~71.0% and 94.0~115.1 seconds respectively, through which we could see that at the evaluation by language media and visual media both the female's range of information acquisition and that of observation time were higher than the male's.

Foreign Language Education of Korean Peninsula: Insights from Nogeldae (『노걸대』 분석을 통해서 바라본 우리 반도의 외국어 교육)

  • Kim, Jeong-ryeol
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.6
    • /
    • pp.408-414
    • /
    • 2017
  • This paper aims to investigate the value and resilience of Nogeoldae which was written at the end of Koryo dynasty and has been used as the most important foreign language education materials throughout the 500 years of Chosun dynasty. To this end, 106 volumes of dialogues, 12 of meeting, 17 of lodging, 21 of Daedo bound, 34 of Daedo lives and 11 of return in Nogeoldae are analyzed by an average length of the sentences, an average length of words, type-token ratio, number of words before main verbs and number of words before nouns to identify the progressive degree of the complexity. The result of the analysis shows that Nogeoldae presents a desired progressive complexity found in modern foreign language textbooks.

Applying Traditional Korean Medical Terms to SUI in the Unified Medical Language System(UMLS) Metathesaurus

  • Hong, Seong-Cheon;Jeong, Heon-Young;Jeon, Byong-Uk
    • Journal of the Korean Institute of Oriental Medical Informatics
    • /
    • v.16 no.1
    • /
    • pp.1-8
    • /
    • 2010
  • Objective: Various controlled vocabulary such as thesaurus and classification make us to reuse and share effectively by defining different concept and linking terms each other. The UMLS(Unified Medical Language System) is one of the most universal medical terminology systems. It is needed various methods to share and reuse information of traditional Korean medicine. We will research on method that adopt SUI of the UMLS(that is de facto standard in medical terminology system) in traditional Korean medical terminology. Method: We described major problems and applying process when we tried to add traditional Korean medicine in the part of meridian into the UMLS metathesaurus. Comparing western medical terms and traditional Korean medical terms for applying UMLS metathesaurus, there is not only many consistency, but also differences. Result: We confirmed what is the differences and consistency between western medical terms and traditional Korean medical terms. And then reviewed methods that apply the CUI, LUI, SUI in traditional Korean medical terms. Traditional Korean medical terms are not discriminated by singular or plural string. In addition, traditional Korean medical terms have vary string by initial law: the law of initial sound of a syllable. Character is described with Korean, traditional Chinese, modern Chinese, etc. According to meaning, language, initial law, SUI has a distinct value respectively. Conclusion: There are many differences to apply the UMLS between western medical terms and traditional Korean medical terms. For the better implementation to traditional Korean medicine into the UMLS, further research is needed in standardization and classification of traditional Korean medical terms, medical information system, etc. We hope this study helps the implementation UMLS, EHR, knowledge based system in Oriental medicine in the future.

  • PDF

Optical Character Recognition for Hindi Language Using a Neural-network Approach

  • Yadav, Divakar;Sanchez-Cuadrado, Sonia;Morato, Jorge
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.117-140
    • /
    • 2013
  • Hindi is the most widely spoken language in India, with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English, the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script, using Artificial Neural Network (ANN), which improves its efficiency. One of the major reasons for the poor recognition rate is error in character segmentation. The presence of touching characters in the scanned documents further complicates the segmentation process, creating a major problem when designing an effective character segmentation technique. Preprocessing, character segmentation, feature extraction, and finally, classification and recognition are the major steps which are followed by a general OCR. The preprocessing tasks considered in the paper are conversion of gray scaled images to binary images, image rectification, and segmentation of the document's textual contents into paragraphs, lines, words, and then at the level of basic symbols. The basic symbols, obtained as the fundamental unit from the segmentation process, are recognized by the neural classifier. In this work, three feature extraction techniques-: histogram of projection based on mean distance, histogram of projection based on pixel value, and vertical zero crossing, have been used to improve the rate of recognition. These feature extraction techniques are powerful enough to extract features of even distorted characters/symbols. For development of the neural classifier, a back-propagation neural network with two hidden layers is used. The classifier is trained and tested for printed Hindi texts. A performance of approximately 90% correct recognition rate is achieved.