Search | Korea Science

Text Categorization Using Both Lexical Information and Syntactic Information (어휘정보와 통사정보를 모두 이용한 문서분류)

박성배;장병탁
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10b
- /
- pp.37-39
- /
- 2001
현재 이용가능한 대부분의 자동문서분류 시스템의 가장 큰 문제는 문서에 포함된 단어 사이의 통사 정보는 무시한 채, 각 단어의 분포만 고려한다는 점이다. 하지만, 통사 정보도 문서 분류를 위해 매우 중요한 정보 중의 하나이다. 본 논문에서는 문서에 나타난 어휘 정보와 함께 통사 정보를 함께 고려하는 자동문서분류 방법을 제시한다. Reuters-21578 말뭉치에 대한 문서분류 실험결과 제시된 방법은 어휘정보만 사용하는 방법과 통사정보만 사용하는 방법 모두보다 높은 성능을 보인다 이 말뭉치에 대해서, 어휘정보만으로 학습된 Support Vector Machine으로 약 77%의 매우 높은 정확도를 얻을 수 있음에도 약 0.63%의 추가적인 성능 향상이 있었다.
PDF

An Extended Lexical Relational Structure Treatment of Denominal Verbs

Ahn, Sung-Ho
- Korean Journal of English Language and Linguistics
- /
- v.2 no.1
- /
- pp.77-95
- /
- 2002
This paper claims Hale and Keyser's (1992, 1993a, 2001) Lexical Relational Structure (LRS) theory should be slightly extended by allowing the syntactic principles for the “referential” component to apply to the “manner” component. Then, it shows this extension allows us to deal with most of Clark and Clark's (1979) denominal verbs, except that cases like butcher may further demand Hale and Keyser's (2001) p-signature copying treatment. It also argues that this extension is further supported by a more satisfactory treatment of the distribution of non-bridge verbs, and of an asymmetry in ditransitive passives.
PDF

Focus and Prosodic Structure

Oh, Mi-Ra
- Speech Sciences
- /
- v.8 no.1
- /
- pp.21-31
- /
- 2001
The effects of focus on prosodic phrasing, F0, and duration are investigated paying attention not only to the target of focus but also to the constituents that are outside the domain of focus in Korean. We find that the constituents preceding and following the focused word tend to be dephrased. Dephrasing does not always cover up to the Intonation Phrase boundary contrary to Jun's (1993) claim. Dephrasing caused by focus determines F0 and durational difference between focused and neutral sentences. Syntactic constituency is also shown to playa role in prosodic phrasing.
PDF

Korean Syntactic Processes in Working Memory (작업 기억내에서의 한글 통사처리과정)

Kim, Young-Jin
- Annual Conference on Human and Language Technology
- /
- 1991.10a
- /
- pp.209-218
- /
- 1991
작업 기억내에서의 통사처리과정을 살펴보기 위해 생략어를 포함하는 네가지 유형의 대등 연결문을 마지작 단어 읽기 과제를 통해 비교하였다. 특히 통사과정에 관한 설명으로 제시되는, 근접 가설, 작업 기억 가설, 최근 필러 이용 가설의 상대적 설명의 효율성을 검증하고자 하였다. 실험 결과는, 주어가 공통논항인, 표준 어순의 연결문이 다른 세 유형의 연결문보다 이해 시간이 빨랐다. 이 결과는 어느 한 가설로는 설명될 수 없으며, 대안적인 설명으로 작업 기억내에서 이용 가능한 여러 정보의 상호 제약에 의해 이루어짐을 논의 했다.
PDF

The Basic Concepts Classification as a Bottom-Up Strategy for the Semantic Web

Szostak, Rick
- International Journal of Knowledge Content Development & Technology
- /
- v.4 no.1
- /
- pp.39-51
- /
- 2014
The paper proposes that the Basic Concepts Classification (BCC) could serve as the controlled vocabulary for the Semantic Web. The BCC uses a synthetic approach among classes of things, relators, and properties. These are precisely the sort of concepts required by RDF triples. The BCC also addresses some of the syntactic needs of the Semantic Web. Others could be added to the BCC in a bottom-up process that carefully evaluates the costs, benefits, and best format for each rule considered.
https://doi.org/10.5865/IJKCT.2014.4.1.039 인용 PDF KSCI KPUBS

Homonym disambiguation using syntactic pattern and recursive definition network (구문패턴과 순환 뜻풀이망을 이용한 동형이의어 분별)

이왕우;최호섭;옥철영
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.04b
- /
- pp.457-459
- /
- 2002
뜻풀이에서 추출한 의미 정보를 이용만 통계시인 방법의 기존 동형이의어 분별 시스템에는 불필요한 의미 정보들을 많이 가지고 있었다. 그리고 동형이의어간의 의미정보가 서로 교차하는 부분이 많아 확률적인 결정에 오류를 발생시켰다. 본 논문에서는 뜻풀이에서 구문패턴을 분석하여 보다 정제된 의미 정보를 추출하였고, 구문패턴에 속하는 어휘들의 하위어를 사전에서 자동 추출하여 부족한 의미 정보를 보완하였다. 또한, 구문패턴으로 분별할 수 없는 일부 동형이의어들은 순환 뜻풀이 망(RDN)을 이용하여 동형이의어를 분별하였다. 이러한 방법으로 동형이의어 분별을 통해 기존 연구보다 8%의 정확률 향상을 가져왔다.
PDF

A Content Site Management Model by Analyzing User Behavior Patterns (사용자 행동 패턴 분석을 이용한 규칙 기반의 컨텐츠 사이트 관리 모델)

김정민;김영자;옥수호;문현정;우용태
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04a
- /
- pp.539-541
- /
- 2003
본 논문에서는 컨텐츠 사이트에서 디지털 컨텐츠를 보호하기 위하여 사용자 행동 패턴을 분석을 이용해 특이한 성향을 보이는 사용자를 탐지하기 위한 모델을 제시하였다. 사용자의 행동 패턴을 분석하기 위한 탐지 규칙(detection rule)으로 Syntactic Rule과 Semantic Rule을 정의하였다. 사용자 로그 분석 결과 탐지 규칙에 대한 위반 정도가 일정 범위를 벗어나는 사용자를 비정상적인 사용자로 추정하였다. 또한 제안 모델은 eCRM 시스템에서 이탈 가능성이 있는 고객 집단을 사전에 탐지하여 고객으로 유지하기 위한 promotion 전략 수립에 응용될 수 있다.
PDF

Conditional Beliefs in Discourse Representation Theory (담화표상이론에서의 조건적 믿음)

정소우
- Language and Information
- /
- v.6 no.1
- /
- pp.21-40
- /
- 2002
This paper explores Discourse Rep-resentation Structures which can successfully describe the mental representations that discourse participants form when they hear so-called double access sentences. The syntactic, semantic and pragmatic characteristics of double access sentences are discussed. The analysis proposed in this paper, employing a modified version of the 'conditional beliefs' of Chung(1997), successfully explains the semantic and pragmatic characteristics of present or future tense in double access sentences as well as when and why the speaker should take or can be exempted from the responsibility for using present or future tense in double access sentences.
PDF

On the Structure of Korean Comparative Constructions: A Constraint-based Approach

Kim, Jong-Bok;Sells, Peter
- Language and Information
- /
- v.13 no.2
- /
- pp.29-45
- /
- 2009
Every language employs its own morphological and syntactic ways of expressing gradable concepts and making comparison between properties of two objects. Korean uses the adverb te 'more' and the post-position pota 'than' to express such relations objects, but displays quite different grammatical properties from a language like English. This paper shows how a constraint-based grammar, HPSG, can provide a robust basis for the grammatical analysis of Korean comparative constructions.
PDF

Question Analysis based Syntactic Information in Korean Question Answering System (한국어 질의응답시스템에서 구문정보에 기반한 질의분석)

신승은;서영훈
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.931-933
- /
- 2004
본 논문에서는 한국어 질의응답시스템에서 정확한 정답추출을 위한 구문 정보에 기반한 질의분석을 제안한다. 질의분석은 세부 정답 유형 결정, 세분화된 키워드 추출을 통해 정확한 정답추출을 목적으로 한다. 술어 유형 정보를 이용하여 대분류 수준의 정답 유형으로 질의분석을 수행하고. 구문 구조 정보를 이용하여 중요 키워드와 일반 키워드를 추출한다 마지막으로 정답 유형 자질 명사를 이용하여 세부 정답 유형을 결정한다. 실험을 통해 세부 정답 유형 결정에서 정확률 59%, 세분화된 키워드 추출에서 정확을 66%를 보였다.
PDF

Search Result 717, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)