• Title/Summary/Keyword: New Words

Search Result 1,475, Processing Time 0.03 seconds

Text Categorization Using TextRank Algorithm (TextRank 알고리즘을 이용한 문서 범주화)

  • Bae, Won-Sik;Cha, Jeong-Won
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.1
    • /
    • pp.110-114
    • /
    • 2010
  • We describe a new method for text categorization using TextRank algorithm. Text categorization is a problem that over one pre-defined categories are assigned to a text document. TextRank algorithm is a graph-based ranking algorithm. If we consider that each word is a vertex, and co-occurrence of two adjacent words is a edge, we can get a graph from a document. After that, we find important words using TextRank algorithm from the graph and make feature which are pairs of words which are each important word and a word adjacent to the important word. We use classifiers: SVM, Na$\ddot{i}$ve Bayesian classifier, Maximum Entropy Model, and k-NN classifier. We use non-cross-posted version of 20 Newsgroups data set. In consequence, we had an improved performance in whole classifiers, and the result tells that is a possibility of TextRank algorithm in text categorization.

Comparative analysis on design key-word of the four major international fashion collections - focus on 2018 fashion collection - (4대 해외 패션 컬렉션의 디자인 key-word 비교분석 - 2018년 패션 컬렉션을 중심으로 -)

  • Kim, Sae-Bom;Lee, Eun-Suk
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.21 no.3
    • /
    • pp.109-119
    • /
    • 2019
  • The purpose of this study is to examine fashion trends and the direction of the four fashion collections by analyzing the design key-words of the four major international fashion collections in 2018. The data of this study was collected by extracting the key-words from Marie Claire Korea in 2018, with the total of the collected data numbering 2,144. The data was analyzed by text mining using the R program and word-cloud, and a co-occurrence network analysis was conducted. The results of this study are as follows: First, the key-words of fashion collection designs in 2018 were fringe and ruffle detail, silk and denim fabric, vivid color, stripe and check pattern, pants suit item, and oversized silhouette, focusing on romanticism and sport. Second, seasonal characteristics of the fashion collections were pastel colors in S/S, primary and vivid colors in F/W. Details were embroidery and cutouts in S/S, patchwork and fringe in F/W. Third, the design trends of the four major fashion collections were presented in the Paris collection: stripes, check patterns, embroidery, lace, tailoring, draping, romanticism, and glamor. In the Milan collection, checks, prints, denim, and minidresses reflected sport and romanticism. The London collection included fringe, ruffles, floral patterns, flower patterns, and romanticism. The New York collections included vivid colors, neon colors, pastel colors, oversize silhouettes, bodysuits, and long dresses.

Optimal supervised LSA method using selective feature dimension reduction (선택적 자질 차원 축소를 이용한 최적의 지도적 LSA 방법)

  • Kim, Jung-Ho;Kim, Myung-Kyu;Cha, Myung-Hoon;In, Joo-Ho;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.13 no.1
    • /
    • pp.47-60
    • /
    • 2010
  • Most of the researches about classification usually have used kNN(k-Nearest Neighbor), SVM(Support Vector Machine), which are known as learn-based model, and Bayesian classifier, NNA(Neural Network Algorithm), which are known as statistics-based methods. However, there are some limitations of space and time when classifying so many web pages in recent internet. Moreover, most studies of classification are using uni-gram feature representation which is not good to represent real meaning of words. In case of Korean web page classification, there are some problems because of korean words property that the words have multiple meanings(polysemy). For these reasons, LSA(Latent Semantic Analysis) is proposed to classify well in these environment(large data set and words' polysemy). LSA uses SVD(Singular Value Decomposition) which decomposes the original term-document matrix to three different matrices and reduces their dimension. From this SVD's work, it is possible to create new low-level semantic space for representing vectors, which can make classification efficient and analyze latent meaning of words or document(or web pages). Although LSA is good at classification, it has some drawbacks in classification. As SVD reduces dimensions of matrix and creates new semantic space, it doesn't consider which dimensions discriminate vectors well but it does consider which dimensions represent vectors well. It is a reason why LSA doesn't improve performance of classification as expectation. In this paper, we propose new LSA which selects optimal dimensions to discriminate and represent vectors well as minimizing drawbacks and improving performance. This method that we propose shows better and more stable performance than other LSAs' in low-dimension space. In addition, we derive more improvement in classification as creating and selecting features by reducing stopwords and weighting specific values to them statistically.

  • PDF

A study on the development of long time exposure $SO_2$ sampler (장기 노출 $SO_2$ 간이 샘플러 개발에 관한 연구)

  • 이동인
    • Journal of Environmental Science International
    • /
    • v.2 no.3
    • /
    • pp.207-216
    • /
    • 1993
  • The concentrations of $SO_2$ and $SO_3$ were measured to estimate a new developed long time exposure $SO_2$ sampler at Onsan industrial area considering the meteorological factors from June to October, 1992. The mean concentration of $SO_3$ by $PbO_2$ method was 0.924 mg $SO_3 / 10cm^2$ $PbO_2$/day and their high values were shown in the center of the industrial area, which show potential pollution due to the increase of industrial activities and micrometeorological factors in and around the sites. As a result of statistical correlation between $SO_2$ concentration by new sampling method and $SO_3$ concentration by $PbO_2$ method in July and August, 1992, correlation coefficients were high (r=0.87, 0.91) and shown more than 0.83 value in the high concentration data set, which was arbitrarily divided into 7~10${\mu}l$$SO_2$ concentration in an attempt to further investigate these relationships. Therefore, use of new developed long time exposure TEX>$SO_2$ sampler is good for TEX>$SO_2$ measurement and valuable for estimation of air quality in the urban and industrial area. Key Words : a new developed long time exposure TEX>$SO_2$ sampler, correlation coefficients, high, $SO_2$ measurement, estimation of air Quality.

  • PDF

A Study on the Change & Flow of Shop Interior Planning & Design -Focus on Retail Stores in Great Cities in U.S.A- (상업공간에 대한 실내디자인 및 계획의 변화와 흐름에 관한 연구 -미국 대도시의 RETAIL STORE를 중심으로-)

  • 박태욱;이현경
    • Korean Institute of Interior Design Journal
    • /
    • no.10
    • /
    • pp.77-81
    • /
    • 1997
  • The study is for interior design and planning of new c conceptual modern shop(called "Value Conscious Store") t through the history of retail store, and its process is based on m most great cites in USA. The Value Conscious Store has c come into existence for consumer and retailer who have had v various lifestyles and characters. From analysis of new l lifestyle consumer to retailer's strategy. we could find i interesting design solutions and, forecast next concerns for d designing store. Store has been designed up-scaled and opened to give pleasure and comfort and made by a theme to m make unique and strong impact for customers. Also it uses M Multi-Media for excitment, and is designed as exhibition of m museum to lead constomers to new culture and trend. From t these interior trends will go on to next generation with new c concepts : environment and nature, senses and sensibility. T These words will be the new solution for creative and s successful store design by the designer who has environm mentally conscious and social responsibility in his mind. his mind.

  • PDF

A Study on Sob Nomad's Culture and Fashion Style (잡노마드(Job Nomad)의 문화와 패션스타일에 관한 연구)

  • 최지영;간호섭
    • Journal of the Korean Society of Costume
    • /
    • v.53 no.1
    • /
    • pp.129-141
    • /
    • 2003
  • Much has been said in the 21st century about advanced information society following industrial society, so information appeared obviously. Based on the development of digital network due to such highly developed information, foresee a new phenomenon in anthropology. The new phenomenon is urban nomad such jobnomad who may change the culture of settlement with a long history into the culture of nomad. This study was to analyze the culture and fashion style of job nomad who may be a trend of fashion in the future. The results of this study are as follows Firstly, the features of job nomad are new communication technology and information technology called new media. And key words for job nomad are non-possession and professionalism and their feature in labor is one(1) person project. Secondly, job nomad to be a trend of future fashion is seeking wearable electric machine - wearable computer fashion. Thirdly, Zen style fashion reflecting Zen idea has such features as naturalism. indeterminism, equalitarianism, and moderation. Those features coincide with the tendency of job nomad who may lead the culture of fashion in the 21st century and do with human being's life style in the 21st century. Expect that job nomad appears newly in social and cultural phenomenon through this study can be developed toward a new and sensible fashion.

Variation in vowel duration depending on voicing in American, British, and New Zealand English

  • Cho, Hyesun
    • Phonetics and Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.11-20
    • /
    • 2016
  • It is well known that vowels are shorter before voiceless consonants than voiced ones in English, as in many other languages. Research has shown that the ratio of vowel durations in voiced and voiceless contexts in English is in the range of 0.6~0.8. However, little work has been done as to whether the ratio of vowel durations varies depending on English variety. In the production experiment in this paper, seven speakers from three varieties of English, New Zealand, British, and American English, read 30 pairs of (C)VC monosyllabic words which differ in coda voicing (e.g. beat-bead). Vowel height, phonemic vowel length, and consonant manner were varied as well. As expected, vowel-shortening effects were found in all varieties: vowels were shorter before voiceless than before voiced codas. Overall vowel duration was the longest in American English and the shortest in New Zealand (NZ) English. In particular, vowel duration before voiceless codas is the shortest in New Zealand English, indicating the most radical degree of shortening in this variety. As a result, the ratio of vowel durations in varying voicing contexts is the lowest in NZ English, while American and British English do not show a significant difference each other. In addition, consonant closure duration was examined. Whereas NZ speakers show the shortest vowel duration before a voiceless coda, their voiceless consonants have the longest closure duration, which suggest an inverse relationship between vowel duration and closure duration.

TEACHING APPLIED MATHEMATICS FOR ENGINEERS - A NEW TEACHING PARADIGM BASED ON INDUSTRIAL MATHEMATICS

  • Taavitsainen, Veli-Matti
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.11 no.2
    • /
    • pp.31-40
    • /
    • 2007
  • What is the "new paradigm"? It is impossible express it in one or two words, but if one had to; the closest might be the "holistic approach". The expression can be justified by the fact that the conclusions above lead to a greater intermixing of mathematics with engineering and natural sciences subjects, typically expressed in the form of examples of simplified real problems. They also lead to a greater intermixing of subjects within mathematics so that the courses should have less separation e.g. between symbolic and numerical mathematics. The conclusions also lead to the spreading the mathematics courses throughout all study years, not just the first two years. Of course, this should be done with great care in order to guarantee studies that are logically linked together. The new paradigm also means that the needs arising from industrial mathematics must be taken into account in the contents of engineering mathematics courses. Such topics are e.g. multivariate methods, statistics and use of mathematical software. What are we expected to gain from the paradigm shift? The primary benefit should be in obtaining more productive engineers equipped with a better degree of mathematical preparedness for engineering problems. But in addition, it should also promote more intensive use of applied mathematics and easier communication with professional mathematicians, often needed in complicated industrial problems.?Finally, it can be noted that the new paradigm is in harmony with the basic ideas of the CDIO (Conceive - Design - Implement - Operate) initiative for producing the next generation of engineers [1]. New ideas for engineering education can be found also in the homepage of SEFI (European Society for Engineering Education) [2].

  • PDF

A Semantic Representation Based-on Term Co-occurrence Network and Graph Kernel

  • Noh, Tae-Gil;Park, Seong-Bae;Lee, Sang-Jo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.11 no.4
    • /
    • pp.238-246
    • /
    • 2011
  • This paper proposes a new semantic representation and its associated similarity measure. The representation expresses textual context observed in a context of a certain term as a network where nodes are terms and edges are the number of cooccurrences between connected terms. To compare terms represented in networks, a graph kernel is adopted as a similarity measure. The proposed representation has two notable merits compared with previous semantic representations. First, it can process polysemous words in a better way than a vector representation. A network of a polysemous term is regarded as a combination of sub-networks that represent senses and the appropriate sub-network is identified by context before compared by the kernel. Second, the representation permits not only words but also senses or contexts to be represented directly from corresponding set of terms. The validity of the representation and its similarity measure is evaluated with two tasks: synonym test and unsupervised word sense disambiguation. The method performed well and could compete with the state-of-the-art unsupervised methods.

The Production and Perception of Focus in English Yes- No Questions (영어 가부 의문문 초점 발화와 지각)

  • Jeon, Yoon-Shil;Oh, Sei-Poong;Kim, Kee-Ho
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.111-128
    • /
    • 2004
  • In English, a focused word with new information receives a pitch accent. This paper examines how English native speakers and Korean speakers produce and perceive focus in English yes-no questions. The production experiments show that native speakers realize an appropriate intonation of yes-no questions, in which a focused word has a low pitch accent followed by a high phrasal accent and a high boundary tone. However, Korean speakers usually give a high tone to a focused word. In a like manner, the perception experiments show that English native speakers judge a word with a low tone to be focused, while Korean speakers have difficulty in comprehending a focused word realized as a low tone. And it is found that Korean speakers tend to perceive low tones on sentence initial and final focused words better than those on sentence medial focused words, and they often perceive a word with a relatively high fundamental frequency or a sharp rise of fundamental frequency as a focused word. This paper shows that Korean speakers have trouble to produce and perceive an appropriate tonal pattern of a focused yes-no question, and that can cause confusion in a conversation with native speakers.

  • PDF