• 제목/요약/키워드: words

검색결과 9,106건 처리시간 0.035초

Color Recommendation for Text Based on Colors Associated with Words

  • Liba, Saki;Nakamura, Tetsuaki;Sakamoto, Maki
    • 한국산업정보학회논문지
    • /
    • 제17권1호
    • /
    • pp.21-29
    • /
    • 2012
  • In this paper, we propose a new method to select colors representing the meaning of text contents based on the cognitive relation between words and colors, Our method is designed on the previous study revealing the existence of crucial words to estimate the colors associated with the meaning of text contents, Using the associative probability of each color with a given word and the strength of color association of the word, we estimate the probability of colors associated with a given text. The goal of this study is to propose a system to recommend the cognitively plausible colors for the meaning of the input text. To build a versatile and efficient database used by our system, two psychological experiments were conducted by using news site articles. In experiment 1, we collected 498 words which were chosen by the participants as having the strong association with color. Subsequently, we investigated which color was associated with each word in experiment 2. In addition to those data, we employed the estimated values of the strength of color association and the colors associated with the words included in a very large corpus of newspapers (approximately 130,000 words) based on the similarity between the words obtained by Latent Semantic Analysis (LSA). Therefore our method allows us to select colors for a large variety of words or sentences. Finally, we verified that our system cognitively succeeded in proposing the colors associated with the meaning of the input text, comparing the correct colors answered by participants with the estimated colors by our method. Our system is expected to be of use in various types of situations such as the data visualization, the information retrieval, the art or web pages design, and so on.

A PHONEMIC ANALYSIS OF THE UNWRITTEN LANGUAGE OF THE PULANG TRIBE

  • Kang, Su-Hee
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 7월 학술대회지
    • /
    • pp.166-177
    • /
    • 2000
  • The purpose of this study was to create letters for of nonliterary Pulang tribe in Thailand those who immigrant from China. illiterate Pulang tribe hand down their tradition by primary oral culture therefore their tradition can't initiate and keep, moreover, it may disappear throughout history. So it is expected to crusade against unlettered people. The scheme of research adopted in this study was a minority race who habitate at the northern Machan, Chiangrai in Thailand. It is not only analysis of language but also the eradication of literacy and the research based on linguistic, ethnolinguistic, and primary oral culture. Five Pulang people who live in that area were chosen for creating letters. By using the I. P. A., after each word was listen to their pronunciation one by one it was described and repeated this process several times; the material words and humanbody were pointed in front of them while other words were described by gesture. For final description, number of people were in the lineup for listening the sound of words and phrases to sentences. In the first stage, it was an analysis segmental of Pulang: vocoid, contoid and diphthong were described with each sample syllables and words. The suprasegmental were studied with intonation and juncture of the words in the second stage. Two words were compared and different meanings within their intonation and juncture were shown. At the end of this part, each case of phonemic or morphophonemics representation described the juncture in the words. In the third stage, minimal pairs were analyzed with vowels and consonants and described in free variation based on words. In the last stage, syllable structure in open syllable and closed syllable was studied and then each syllable of its structure was analyzed with samples. There were thirty-two phonemes in apong Pulang as follows: seven vocoids; a, i, e, o, u, ${\ae}$, and $\wedge$, one diphthong; wu, 24 contoids; b, c, d, f, g, h, j, k, k, 1, m, n, ${\eta}, {\;}p^{h}$, p, p, r, s, s, sh, t, t, w, and y. Their pronunciations of p, s, d, $p^{h}$, j, and t are frequently used in speech and are unique in triphthong. Moreover, most of the words used initial and final consonant cluster.

  • PDF

Language-Independent Word Acquisition Method Using a State-Transition Model

  • Xu, Bin;Yamagishi, Naohide;Suzuki, Makoto;Goto, Masayuki
    • Industrial Engineering and Management Systems
    • /
    • 제15권3호
    • /
    • pp.224-230
    • /
    • 2016
  • The use of new words, numerous spoken languages, and abbreviations on the Internet is extensive. As such, automatically acquiring words for the purpose of analyzing Internet content is very difficult. In a previous study, we proposed a method for Japanese word segmentation using character N-grams. The previously proposed method is based on a simple state-transition model that is established under the assumption that the input document is described based on four states (denoted as A, B, C, and D) specified beforehand: state A represents words (nouns, verbs, etc.); state B represents statement separators (punctuation marks, conjunctions, etc.); state C represents postpositions (namely, words that follow nouns); and state D represents prepositions (namely, words that precede nouns). According to this state-transition model, based on the states applied to each pseudo-word, we search the document from beginning to end for an accessible pattern. In other words, the process of this transition detects some words during the search. In the present paper, we perform experiments based on the proposed word acquisition algorithm using Japanese and Chinese newspaper articles. These articles were obtained from Japan's Kyoto University and the Chinese People's Daily. The proposed method does not depend on the language structure. If text documents are expressed in Unicode the proposed method can, using the same algorithm, obtain words in Japanese and Chinese, which do not contain spaces between words. Hence, we demonstrate that the proposed method is language independent.

Analysis of Dental Hygienist Job Recognition Using Text Mining

  • Kim, Bo-Ra;Ahn, Eunsuk;Hwang, Soo-Jeong;Jeong, Soon-Jeong;Kim, Sun-Mi;Han, Ji-Hyoung
    • 치위생과학회지
    • /
    • 제21권1호
    • /
    • pp.70-78
    • /
    • 2021
  • Background: The aim of this study was to analyze the public demand for information about the job of dental hygienists by mining text data collected from the online Q & A section on an Internet portal site. Methods: Text data were collected from inquiries that were posted on the Naver Q & A section from January 2003 to July 2020 using "dental hygienist job recognition," "role recognition," "medical assistance," and "scaling" as search keywords. Text mining techniques were used to identify significant Korean words and their frequency of occurrence. In addition, the association between words was analyzed. Results: A total of 10,753 Korean words related to the job of dental hygienists were extracted from the text data. "Chi-lyo (treatment)," "chigwa (dental clinic)," "ske-illing (scaling)," "itmom (gum)," and "chia (tooth)" were the five most frequently used words. The words were classified into the following areas of job of the dental hygienist: periodontal disease treatment and prevention, medical assistance, patient care and consultation, and others. Among these areas, the number of words related to medical assistance was the largest, with sixty-six association rules found between the words, and "chi-lyo," "chigwa," and "ske-illing" as core words. Conclusion: The public demand for information about the job of dental hygienists was mainly related to "chi-lyo," "chigwa," and "ske-illing" as core words, demonstrating that scaling is recognized by the public as the job of a dental hygienist. However, the high demand for information related to treatment and medical assistance in the context of dental hygienists indicates that the job of dental hygienists is recognized by the public as being more focused on medical assistance than preventive dental care that are provided with job autonomy.

Development of Sensibility Vocabulary Classification System for Sensibility Evaluation of Visitors According to Forest Environment

  • Lee, Jeong-Do;Joung, Dawou;Hong, Sung-Jun;Kim, Da-Young;Park, Bum-Jin
    • 인간식물환경학회지
    • /
    • 제22권2호
    • /
    • pp.209-217
    • /
    • 2019
  • Generally human sensibility is expressed in a certain language. To discover the sensibility of visitors in relation to the forest environment, it is first necessary to determine their exact meanings. Furthermore, it is necessary to sort these terms according to their meanings based on an appropriate classification system. This study attempted to develop a classification system for forest sensibility vocabulary by extracting Korean words used by forest visitors to express their sensibilities in relation to the forest environment, and established the structure of the system to classify the accumulated vocabulary. For this purpose, we extracted forest sensibility words based on literature review of experiences reported in the past as well as interviews of forest visitors, and categorized the words by meanings using the Standard Korean Language Dictionary maintained by the National Institute of the Korean Language. Next, the classification system for these words was established with reference to the classification system for vocabulary in the Korean language examined in previous studies of Korean language and literature. As a result, 137 forest sensibility words were collected using a documentary survey, and we categorized these words into four types: emotion, sense, evaluation, and existence. Categorizing the collected forest sensibility words based on this Korean language classification system resulted in the extraction of 40 representative sensibility words. This experiment enabled us to determine from where our sensibilities that find expressions in the forest are derived, that is, from sight, hearing, smell, taste, or touch, along with various other aspects of how our human sensibilities are expressed such as whether the subject of a word is person-centered or object-centered. We believe that the results of this study can serve as foundational data about forest sensibility.

한글 감정단어의 의미적 관계와 범주 분석에 관한 연구 (A Study on the Analysis of Semantic Relation and Category of the Korean Emotion Words)

  • 이수상
    • 한국도서관정보학회지
    • /
    • 제47권2호
    • /
    • pp.51-70
    • /
    • 2016
  • 이 연구의 목적은 한글로 된 주요감정단어들의 리스트를 대상으로 의미적 관계의 네트워크와 극성과 각성의 범주를 분석하는데 있다. 분석결과는 다음과 같다. 첫째, 감정단어 네트워크에서 각 감정단어들은 의미적으로 연결되어 있었다. 이것은 의미적 유사성에 따라 감정단어들의 유형을 구분하는 것을 어렵게 하는 특징이다. 대신에 의미적 관계의 감정단어 네트워크에서 중심적인 역할을 수행하는 감정단어들을 확인할 수 있었다. 둘째, 극성과 각성의 차원을 혼합한 범주에서, 많은 감정단어들은 부정적인 극성과 높은 각성의 단어들 집단과 부정적인 극성과 중간수준 각성의 단어들 집단으로 분류되었다. 이러한 한글감정단어의 특성들은 도서관이나 문헌정보에 나타나는 각종 텍스트 데이터의 감정분석에 유용하게 활용될 것이다.

음절의 시작과 단어 시작의 불일치가 영어 단어 인지에 미치는 영향 (The Effects of Misalignment between Syllable and Word Onsets on Word Recognition in English)

  • 김선미;남기춘
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.61-71
    • /
    • 2009
  • This study aims to investigate whether the misalignment between syllable and word onsets due to the process of resyllabification affects Korean-English late bilinguals perceiving English continuous speech. Two word-spotting experiments were conducted. In Experiment 1, misalignment conditions (resyllabified conditions) were created by adding CVC contexts at the beginning of vowel-initial words and alignment conditions (non-resyllabified conditions) were made by putting the same CVC contexts at the beginning of consonant-initial words. The results of Experiment 1 showed that detections of targets in alignment conditions were faster and more correct than in misalignment conditions. Experiment 2 was conducted in order to avoid any possibilities that the results of Experiment 1 were due to consonant-initial words being easier to recognize than vowel-initial words. For this reason, all the experimental stimuli of Experiment 2 were vowel-initial words preceded by CVC contexts or CV contexts. Experiment 2 also showed misalignment cost when recognizing words in resyllabified conditions. These results indicate that Korean listeners are influenced by misalignment between syllable and word onsets triggered by a resyllabification process when recognizing words in English connected speech.

  • PDF

한국유아의 수단어 획득에 관한 연구 (The Acquisition of Korean Number-Word Systems of Young Children)

  • 홍혜경
    • 아동학회지
    • /
    • 제11권2호
    • /
    • pp.5-23
    • /
    • 1990
  • The purpose of this study was to investigate the acquisition of number-word systems of young children. Specifically, the acquisition of Korean Number-Words(KNW) was compared with the acquisition of Chinese-derived Number-Words(CNW). The subjects included 120 children aged 2:5 to 5:11. The subjects oral counting using the two number word systems were audiotaped. Two coders transcribed the tapes. The data were analyzed by content analysis with descriptive statistics. The findings of this study showed that the acquisition of KNW began from around age two and the acquisition of CNW from around age three. From then, the acquisition of the two number-word systems was parallel. The acquisition of number-words began from the age of 2 years, increased slowly to the age of 4. and then increased rapidly after the age of 5. Although KNW were acquired earlier than CNW, at around the age of 5 years the acquisition of CNW surpassed the acquisition of KNW. The acquisition of number words consists of four developmental levels: Level I: beginning of acquisition of traditional KNW only Level II: beginning of acquisition of CNW with extension of KNW Level III: parallel extension of the two number-word systems Level IV: superior acquisition of CNW. The major error through all stages in the sequence of number words was the omission of one number-word. Younger children produced errors of omission of one, two or three number-words, whereas older children produced errors of nonstandard number-words and repetition.

  • PDF

Word class information in perception of prosodic prominence by Korean learners of English

  • Im, Suyeon
    • 말소리와 음성과학
    • /
    • 제11권4호
    • /
    • pp.1-8
    • /
    • 2019
  • This study aims to investigate how prosodic prominence is perceived in relation to word class information (or parts-of-speech) by Korean learners of English compared with native English speakers in public speech. Two groups, Korean learners of English and native English speakers, were asked to judge words perceived as prominent simultaneously while listening to a speech. Parts-of-speech and three acoustic cues (i.e., max F0, mean phone duration, and mean intensity) were analyzed for each word in the speech. The results showed that content words tended to be higher in pitch and longer in duration than function words. Both groups of listeners rated prominence on content words more frequently than on function words. This tendency, however, was significantly greater for Korean learners of English than for native English speakers. Among the parts-of-speech of the content words, Korean learners of English were more likely than native English speakers to judge nouns and verbs as prominent. This study presents evidence that Korean learners of English consider most, if not all, content words as landing locations of prosodic prominence, in alignment with the previous study on the production of prominence.