• 제목/요약/키워드: content words

검색결과 580건 처리시간 0.022초

교실 상황에서 영어 명사구의 초점 실현 양상 (Focus Realization of English Noun Phrases in the Classroom Situation)

  • 전지현;송재영;이동화;김기호
    • 음성과학
    • /
    • 제9권2호
    • /
    • pp.109-132
    • /
    • 2002
  • The purpose of this study is to examine the focus realization of [Adjective+Noun] phrases which are used in English classroom situations. In order to examine this, two production and one perception experiments were designed. The noun phrases in the first two production experiments are divided into three patterns according to the location of focus. The difference between the two production experiments is that in the first experiment the focused words are contextually given in the classroom situation, but in the second experiment they are presented in written form. We compare the native English teachers' focus realization of noun phrases with that of Korean teachers from the point of view of intonational phonology. In the perception test, we examine how the uttered sentences are perceived by English native speakers and Korean native speakers. The results from the three experiments show that native English teachers' focus realization is quite consistent with informational structure. Also, there is a significant difference in pitch range of adjectives and nouns when the native speakers give pitch accents on the two content words, and the uttered sentences are mostly perceived as well as the speakers' intentions. As for Korean speakers, however, they usually focus only on the adjective or they focus on both the adjective and the noun, regardless of the relative informativeness of these words. From these findings, we can conclude that focus realization of Korean teachers is rather inconsistent with respect to informational structure when compared to that of native English teachers.

  • PDF

단어 구분 및 인식 알고리즘을 이용한 안드로이드 플랫폼 기반의 멀티 성경 애플리케이션 (A Multi-Bible Application on an Android Platform Using a Word Tokenization and Recognition Algorithm)

  • 강성모;강명수;김종면
    • 대한임베디드공학회논문지
    • /
    • 제6권4호
    • /
    • pp.215-221
    • /
    • 2011
  • Mobile phones, which were used for simply calling and sending text messages, have recently moved to application-oriented digital devices such as smart phones and tablet phones. The rapid increase of smart and tablet phones which can offer advanced ability and run a variety of applications based on Java requires various digital multimedia content activities. These days, there are more than 2.2 billions of Christians around the world. Among them, more than 300 millions of people live in Asian, and all of them have and read the bible. If there is an application for the bible which translates from English to their own languages, it could be very helpful. With this reason, this paper proposes a multi-bible application that supports various languages. To do this, we implemented an algorithm that recognize sentences in the bible as word by word. The algorithm is essentially composed of the following three functions: tokenizing sentences in the bible into word by word (word tokenization), recognizing words by using touch event (word recognition), and translating the selected words to the desired language. Consequently, the proposed multi-bible application supports language translation efficiently by touching words of sentences in the bible.

피봇 기계번역시스템에서의 한국어생성을 위한 문제선정 (Style Selection for Korean Generation under the Pivot MT System)

  • 이종혁
    • 인지과학
    • /
    • 제1권2호
    • /
    • pp.279-291
    • /
    • 1989
  • 피봇 기계번역 시스템하에서 자연스런 출력문 생성을 위한 문체선정 문제는 표층 구문정보를 배제한 언어에 의존하지 않는 중간표현의 특성과 언어마다 문화적 차이에서 기인한 사고.발상의 차이로 인해 큰 어려움을 갖는다.본논문은 이들 문제점들을 해결하기 위한 시도로 첫째,출력문의 자연스런 문장구조를 크게 좌우하는 태의 화용적.문체적 결정과 한국어의 심한 피동 제약 가운데서의 태의 생성,둘째,한국어 특유의 표현양식을 위한 문장구조 변경,마지막으로 출력문에서 기능어의 의미 애매성을 제거하기 위한 실질어를 이용한 의미보완 등을 논한다.

통계적 기법을 이용한 스팸메시지 필터링 기법 (A Technique of Statistical Message Filtering for Blocking Spam Message)

  • 김성윤;차태수;박제원;최재현;이남용
    • 한국IT서비스학회지
    • /
    • 제13권3호
    • /
    • pp.299-308
    • /
    • 2014
  • Due to indiscriminately received spam messages on information society, spam messages cause damages not only to person but also to our community. Nowadays a lot of spam filtering techniques, such as blocking characters, are studied actively. Most of these studies are content-based spam filtering technologies through machine learning.. Because of a spam message transmission techniques are being developed, spammers have to send spam messages using term spamming techniques. Spam messages tend to include number of nouns, using repeated words and inserting special characters between words in a sentence. In this paper, considering three features, SPSS statistical program were used in parameterization and we derive the equation. And then, based on this equation we measured the performance of classification of spam messages. The study compared with previous studies FP-rate in terms of further minimizing the cost of product was confirmed to show an excellent performance.

Pronunciation-based Listening Teaching

  • Lee, Kyung-Mi
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 7월 학술대회지
    • /
    • pp.283-300
    • /
    • 2000
  • This paper is intended to suggest how to improve Korean high school students' awareness of the pronunciation in order to foster communicative effectiveness. Initially it is focused on the tasks of listening to the suprasegmental aspects. The strategies used in the listening process are (1)discerning intonation units, (2)recognizing rhythm pattern, and (3)identifying contraction and linking in connected speech. The tasks including in each process are listening discrimination, guided practice activity, and listening and speaking activity. The teacher should avoid methods which yield discouraging outcomes and try to help students enjoy experience of success in doing exercises and activities. So I suggested: students put the slash on the pause perceptible to chunk the stream of speech into the intonation units, and mark the content words to internalize English rhythm. And then I suggested that students listen to pop song English in order to improve the awareness of function words and connected speech in the intonation unit.

  • PDF

Text Mining and Sentiment Analysis for Predicting Box Office Success

  • Kim, Yoosin;Kang, Mingon;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권8호
    • /
    • pp.4090-4102
    • /
    • 2018
  • After emerging online communications, text mining and sentiment analysis has been frequently applied into analyzing electronic word-of-mouth. This study aims to develop a domain-specific lexicon of sentiment analysis to predict box office success in Korea film market and validate the feasibility of the lexicon. Natural language processing, a machine learning algorithm, and a lexicon-based sentiment classification method are employed. To create a movie domain sentiment lexicon, 233,631 reviews of 147 movies with popularity ratings is collected by a XML crawling package in R program. We accomplished 81.69% accuracy in sentiment classification by the Korean sentiment dictionary including 706 negative words and 617 positive words. The result showed a stronger positive relationship with box office success and consumers' sentiment as well as a significant positive effect in the linear regression for the predicting model. In addition, it reveals emotion in the user-generated content can be a more accurate clue to predict business success.

북한의 물리 교육 및 교과서 분석 연구 (An Analysis on Education and Textbooks of Physics in North Korea)

  • 민영기
    • 한국과학교육학회지
    • /
    • 제16권4호
    • /
    • pp.329-339
    • /
    • 1996
  • We examined the science education system in North Korea from the elementary to the high schools. We also analyzed the physics textbooks used in North Korea and compared the results with the textbooks used in South Korea. We compared the goal and system of physics education, and the content, order of study, and volume of the textbooks. Physics education starts at the 4th year at the elementary school, and is taught through the whole school years in North Korea. The science process skills are regarded to be important and figures, tables, problem sets, experiments, and sample solutions are exclusively used in the textbooks. Electomagnetism occupies the largest portion in physics textbooks, but subjects related to the application of physics are more stressed. There are a few subjects which are included in the North Korean textbooks but not in the South Korean textbooks. We have compiled about 60 North Korean physics words which are different from the South Korean words used in the textbooks. Overall, there will be not much difficulty in integrating the physics education system and physics textbooks after the two Koreas are unified.

  • PDF

한국어 기반 음성 인식에서 사투리 표현에 관한 연구 (A Study on Dialect Expression in Korean-Based Speech Recognition)

  • 이신협
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2022년도 춘계학술대회
    • /
    • pp.333-335
    • /
    • 2022
  • 음성인식 처리기술의 발전은 STT, TTS 기술과 함께 각종 동영상, 스트리밍 서비스에서 적용되어 사용되고 있다. 그러나 실제 대화내용의 음성인식은 사투리 사용과 불용어, 감탄사, 유사어의 중복 등으로 명료한 문어체적 표현에 장벽이 높은 편이다. 본 연구에서는 음성인식에 모호한 사투리에 대해 범주별 사투리 중요 단어 사전 처리 방식과 사투리 운율을 음성 인식 네트워크 모델 속성으로 적용한 음성인식기술을 제안한다.

  • PDF

TV 시청률과 마이크로블로그 내용어와의 시간대별 관계 분석 (Analysis of the Time-dependent Relation between TV Ratings and the Content of Microblogs)

  • 최준연;백혜득;최진호
    • 지능정보연구
    • /
    • 제20권1호
    • /
    • pp.163-176
    • /
    • 2014
  • 소셜미디어 확산으로 많은 사용자들이 SNS를 통해 자신의 생각과 의견을 표출하며 다른 사용자들과 상호작용하고 있다. 특히 트위터와 같은 마이크로블로그는 짧은 문장을 통해 영화, TV, 사회 현상 등과 같은 공통의 주제에 대해 많은 사람이 즉각적으로 의견을 표출하고 교환하는 플랫폼의 역할을 수행하고 있다. TV방송 프로그램에 대해서도 의견과 감정을 마이크로블로그를 통해 표출하고 있는데, 본 연구에서는 마이크로블로그의 내용과 시청률과의 관계를 살펴보기 위해, 지난 공중파 방송 프로그램에 대한 트윗을 수집하고 부적절한 트윗들을 제거한 후 형태소 분석을 수행하였다. 추출된 형태소뿐 아니라 이모티콘, 신조어 등 사용자가 입력한 모든 단어들을 후보 자질로 삼아 시청률과의 상관관계를 분석하였다. 실험을 위해 2013년 1월부터 10개월간의 예능프로그램 트윗의 데이터를 수집하여 전국 시청률 데이터와 비교 분석을 수행하였다. 트윗의 발생량은 일주일 중 방송된 요일에 가장 많았으며, 특히 방송시간 부근에서 급격히 증가하는 모습을 보였다. 이것은 전국에 동시간에 방송되는 공중파 프로그램의 특성상 공통된 관심 주제를 제공하기 때문에 나타나는 현상으로 여겨진다. 횟수 기반 자질로 방송 일의 총 트윗 수와 리트윗 수, 방송시간 중의 트윗 수와 리트윗 수와 시청률과의 상관 관계를 분석하였으나 모두 낮은 상관 계수를 나타냈다. 이것은 단순한 트윗 발생 빈도는 방송 프로그램의 만족도 또는 시청률을 제대로 반영하고 있지 못함을 의미한다. 내용 기반 자질로 추출한 단어들 중에는 높은 상관관계를 보여주는 단어들이 발견되었으며, 표준어가 아닌 이모티콘과 신조어 중에도 높은 상관관계를 보여주는 자질이 나타났다. 또한 방송시작 전과 후에 따라 상관계수가 높은 단어가 상이함을 발견하였다. 매주 같은 시간에 방송되는 TV 프로그램의 특성상, 방송을 기다리고 기대하는 내용의 트윗과 방송 후 소감을 표현하는 트윗의 내용에 차이가 존재하였다. 이러한 분석결과는 단어에 따라 시청률과 연관성이 높은 시간대가 달라짐을 의미하며, 시청률을 측정하고자 할 때 각 단어들의 시간대를 고려해서 사용해야 함을 의미한다. 본 연구에서 제안한 방법은 기존의 표본 추출을 통해 이루어지는 TV 시청률 측정을 보완할 수 있는 방법에 활용할 수 있으리라 기대된다.

비정형자료로부터의 평화지수 분석을 통한 한반도 정세 파악 방법 (Interpretation and Prediction of Situations on the Korean Peninsula by Peace Index Analysis from Unstructured Data)

  • 권오병;박다솔;최지혜;이재윤
    • 한국IT서비스학회지
    • /
    • 제12권4호
    • /
    • pp.423-434
    • /
    • 2013
  • Since acquiring intelligence about political situations around the Korea Peninsular in a direct manner is nearly impossible, it is inevitable for the individuals or companies to rely on open and indirect data such as newspapers. However, since the contents in the newspapers are substantially unstructured and very large, conventional content analysis is time-consuming and hence very costly. Hence, this paper aims to propose a sentimental analysis method which computes daily 'peace index' from unstructured data in the newspapers. From the content analysis, words and phrases which represent the sentiment of a nation are carefully identified. To show the feasibility of the idea proposed in this paper, a prototype system with vocabulary repository about political situations was developed for estimating peace index automatically.