Search | Korea Science

근접 문맥정보와 대규모 웹 데이터를 이용한 단어 의미 중의성 해소

Kang, Sin-Jae;Kang, In-Su
- Proceedings of the Korea Society for Industrial Systems Conference
- /
- 2009.05a
- /
- pp.208-211
- /
- 2009
본 논문은 구글(Google), 워드넷(WordNet)과 같이 공개된 웹 자원과 리소스를 이용한 비교사학습(Unsupervised learning) 방법을 제안하여 단어 의미의 중의성 문제를 해결하고자 한다. 구글 검색 API를 이용하여 단어의 확장된 근접 문맥정보를 추출하고, 워드넷의 계층체계와 synset을 이용하여 단어 의미 구분정보를 자동 추출한 후, 추출된 정보 간 유사도 계산을 통해 중의성을 갖는 단어의 의미를 결정한다.
PDF

Automatic Named Entities Extraction Using the Graph-based Measurement Technique of the Mutual Importance (그래프 기반의 상호 중요도 측정 기법을 이용한 영역별 개체명 자동 추출)

Bae, Sangjoon;Ko, Youngjoong
- Annual Conference on Human and Language Technology
- /
- 2008.10a
- /
- pp.17-22
- /
- 2008
본 논문에서는 영역별로 자동으로 개체명을 추출하기 위하여 씨앗단어를 이용하고, 웹페이지와 개체명 후보들 간의 상호 중요도를 측정하여 개체명 후보들의 순위를 정하는 방식을 제안한다. 제안된 방식은 크게 세 단계에 의해서 수행되어 지는데 먼저 씨앗단어 정보를 이용하여 웹페이지를 검색하고, 검색되어진 웹 페이지와 씨앗단어 정보를 이용하여 패턴 규칙을 추출한다. 추출된 패턴 규칙을 웹페이지에 적용하여 개체명 후보들을 추출하고 추출된 후보들과 웹페이지 사이의 상호 중요도를 재귀적으로 계산하여 최종적으로 개체명 후보들의 순위가 정해 진다. 한국어와 영어 개체명 영역에 제안된 기법을 적용하여 실험한 결과 한국어에서는 78.72%의 MAP를 얻을 수 있었고, 영어에서는 96.48%의 MAP를 얻었다. 특히 영어 개체명 인식에서의 성능은 구글에서 제공하고 있는 구글셋의 결과보다도 높은 성능을 보였다.
PDF

Development of Search Web/App that can be used in Various Portal Sites (다양한 포털 사이트에서 활용 가능한 검색 Web/App 개발)

Yoon, Kyung Seob;Choi, JinHyeok
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2020.07a
- /
- pp.303-306
- /
- 2020
사용자들은 검색할 때 각자 자신이 애용하는 포털 사이트를 이용한다. 우리나라의 경우 네이버, 다음, 구글 등의 포털 사이트를 주로 이용하는 경향이 있다. 각 포털 사이트는 사이트마다 검색이 용이 하도록 검색 연산자들을 제공하고 있다. 하지만 각 포털 사이트마다 제공하는 검색 연산자의 종류, 사용 방법 등이 다양하게 제공 되고 있다. 일반적으로 대부분의 사용자들은 검색 연산자 활용이 저조할 뿐만 아니라 각 포털 사이트에서 제공되는 연산자를 인식하지 못하는 경향이 많이 발생 된다. 이 논문에서는 다양한 포털 사이트에서 활용 가능한 통합 검색 프로그램을 제공함으로써 포털 사이트 검색 및 각 포털 사이트의 검색 연산자 활용을 통한 검색 효율 극대화를 기하고자 하였다.
PDF

Detecting Intentionally Biased Web Pages In terms of Hypertext Information (하이퍼텍스트 정보 관점에서 의도적으로 왜곡된 웹 페이지의 검출에 관한 연구)

Lee Woo Key
- Journal of the Korea Society of Computer and Information
- /
- v.10 no.1 s.33
- /
- pp.59-66
- /
- 2005
The organization of the web is progressively more being used to improve search and analysis of information on the web as a large collection of heterogeneous documents. Most people begin at a Web search engine to find information. but the user's pertinent search results are often greatly diluted by irrelevant data or sometimes appear on target but still mislead the user in an unwanted direction. One of the intentional, sometimes vicious manipulations of Web databases is a intentionally biased web page like Google bombing that is based on the PageRank algorithm. one of many Web structuring techniques. In this thesis, we regard the World Wide Web as a directed labeled graph that Web pages represent nodes and link edges. In the Present work, we define the label of an edge as having a link context and a similarity measure between link context and target page. With this similarity, we can modify the transition matrix of the PageRank algorithm. By suggesting a motivating example, it is explained how our proposed algorithm can filter the Web intentionally biased web Pages effective about $60\%% rather than the conventional PageRank.
PDF

A Study on the Motivation and User Satisfaction of Medical Students Using the Academic Search Engines (의과대학 대학원생의 학술정보 검색엔진 이용 동기 및 이용자 만족도에 관한 연구)

Shim, Saebom;Yi, Yongjeong
- Journal of the Korean BIBLIA Society for library and Information Science
- /
- v.30 no.4
- /
- pp.197-216
- /
- 2019
The study aimed to investigate the motivation and user satisfaction of medical school graduate students using specific academic information sites, and examines the effect of usage motivation factors on user satisfaction. This study was conducted from September 10, 2018 to September 21, 2018 by a large medical school graduate student in Seoul, and analyzed 135 valid responses. According to the analysis, the degree of reliability and interactivity of PubMed is higher than that of Google Scholar in medical school graduate students' use of academic information sites. It was found to have a significant effect on user satisfaction of academic information sites. In particular, convenience has the greatest influence on users' choice of academic information sites. The results of the study are of theoretical implications in that the discussion of academic information services through web sites is explained in terms of use and satisfaction in the context of medical information. On the other hand, specific research results provide practical implications for improving the services of medical-related academic search engines.
https://doi.org/10.14699/kbiblia.2019.30.4.197 인용 PDF KSCI

Short-term Predictive Models for Influenza-like Illness in Korea: Using Weekly ILI Surveillance Data and Web Search Queries (한국 인플루엔자 의사환자 단기 예측 모형 개발: 주간 ILI 감시 자료와 웹 검색 정보의 활용)

Jung, Jae Un
- Journal of Digital Convergence
- /
- v.16 no.9
- /
- pp.147-157
- /
- 2018
Since Google launched a prediction service for influenza-like illness(ILI), studies on ILI prediction based on web search data have proliferated worldwide. In this regard, this study aims to build short-term predictive models for ILI in Korea using ILI and web search data and measure the performance of the said models. In these proposed ILI predictive models specific to Korea, ILI surveillance data of Korea CDC and Korean web search data of Google and Naver were used along with the ARIMA model. Model 1 used only ILI data. Models 2 and 3 added Google and Naver search data to the data of Model 1, respectively. Model 4 included a common query used in Models 2 and 3 in addition to the data used in Model 1. In the training period, the goodness of fit of all predictive models was higher than 95% ($R^2$). In predictive periods 1 and 2, Model 1 yielded the best predictions (99.98% and 96.94%, respectively). Models 3(a), 4(b), and 4(c) achieved stable predictability higher than 90% in all predictive periods, but their performances were not better than that of Model 1. The proposed models that yielded accurate and stable predictions can be applied to early warning systems for the influenza pandemic in Korea, with supplementary studies on improving their performance.
https://doi.org/10.14400/JDC.2018.16.9.147 인용 PDF KSCI

Image comparison and GPS technology for Search attractions (관광지 검색을 위한 이미지비교와 GPS기술)

Lee, Gun-Hee;Ha, Jin-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.403-406
- /
- 2011
대한민국의 스마트폰 보유자 수는 1300만 명 이상으로 세계최고의 수준을 자랑하고 있다. 그에 따라 스마트폰의 다양한 기능을 활용해 손쉽게 필요한 정보를 얻을 수 있어졌다. 본 논문서는 스마트폰 의 GPS기능과 카메라 기능을 활용해 쉽고 새로운 방법으로 서울시관광지를 검색 할 수 있는 기능을 제안한다. 기존에는 관광지의 이름을 이용해서 검색하는 방법이 일반적이었다. 검색의 정확성을 요할 수 있지만 사용자가 관광자의 이름을 모르는 상활에서는 검색에 어려움이 있었다, 하지만 이미지를 이용해 관광지를 검색하므로 사진 한 장만 있으면 바로 이미지의 검색이 가능해서 글자 검색의 한계를 극복 할 수 있었다. 논문에서는 이미지 비교알고리즘 중 색의 분포도를 이용한 이미지 비교알고리즘을 기술하고자 한다. 그리고 안드로이드의 GPS기능을 이용해 사용자의 위치와 관광지의 위치를 구글맵에 표시해서 사용자가 관광지를 보다 쉽게 찾아갈 수 있는 방법에 대해서도 기술했다.
https://doi.org/10.3745/PKIPS.y2011m11a.403 인용 PDF

Design and Implementation of Spatial Guide System Using Google Maps API (구글맵 API를 이용한 공간적 방향안내 시스템의 설계 및 구현)

Kim, Taehyeok;Moon, Hyoenjun;Moon, Mikyeong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.11a
- /
- pp.1264-1266
- /
- 2017
현재 사람들이 원하는 목적지를 가기 위해 스마트 폰의 여러 지도 애플리케이션들이나 이정표를 이용한다. 그러나 2차원으로 나타나는 스마트폰의 지도 애플리케이션들로 3차원인 현실의 방향을 파악하기란 쉽지 않고, 이정표 또한 안내 가능한 방향, 게시되는 언어의 수가 제한되는 등의 문제가 있다. 본 논문에서는 이러한 문제를 보완할 수 있는 구글 맵 API를 이용하여 공간적으로 방향을 안내 해주는 시스템의 설계 및 개발 내용에 대해 기술한다. 이 시스템은 구글맵 API의 검색 기능을 통해 얻은 목적지를 안내판이 회전하여 이용자에게 방향과 거리를 안내함으로써, 3차원인 현실에서의 공간적 방향을 파악하는데 도움을 준다. 또한 기존의 이정표와 달리 새로운 장소에 대해 소프트웨어적인 업데이트만 실시하면 되므로 유지비용을 줄이는 효과를 기대 할 수 있다.
https://doi.org/10.3745/PKIPS.y2017m11a.1264 인용 PDF

Disease related Gene Identification Using Literature and Google data (텍스트마이닝 기법과 구글데이터를 이용한 질병관련 유전자 식별)

Kim, Jeong-U;Kim, Hyeon-Jin;Park, Sang-Hyeon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1084-1087
- /
- 2013
텍스트마이닝은(Text mining) 바이오분야에서 사용되는 도구 중 하나이다. 본 논문에서는 전립선암(Prostate cancer)과 관련된 질병 유전자(Disease gene)를 찾기 위해 텍스트마이닝을 이용하여 유전자 네트워크(Gene-network)를 구축하였다. 추가적으로 구글(Google) 검색을 통해 네트워크 내의 유전자 노드(Node)들 사이의 간선(Edge)에 새로운 가중치(Weight)를 추가하고 네트워크를 재구성하였다. 구축된 네트워크에서 노드와 노드 사이의 가중치를 기반으로 전립선암과 관련된 질병 유전자를 추출하였다. 본 논문의 방법은 성공적으로 네트워크를 구축하고 질병 유전자를 찾았으며, 구글 데이터를 사용하지 않고 네트워크를 구축하는 경우보다 더 높은 정확성을 입증했다.
https://doi.org/10.3745/PKIPS.y2013m11a.1084 인용 PDF

Use Case of Google Trigram: Application to Text Editor (구글 Trigram의 활용 사례: 문서 편집기로의 활용)

Hwang, Myung-Gwon;Lee, Hyo-Gap;Choi, Dong-Jin;Kim, Pan-Koo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2010.11a
- /
- pp.3-5
- /
- 2010
본 논문은 구글(Google)에서 제공하는 n-gram 데이터가 사람들에게 실제로 어느정도의 효율성을 제공할 수 있는지에 대한 내용을 다루고 있다. 이를 위해, 구글의 trigram만을 이용하며, 규모를 줄이기 위해 필터링 과정을 거쳐 trigram 데이터베이스를 형성하였다. 그리고 이를 반영할 수 있는 텍스트 에디터를 구현하여, 사람의 타이핑 속도에 따라 얻을 수 있는 효율성을 측정하였다. 실험 결과에서 n-gram 데이터가 업무 향상에 효율성을 제공할 수 있지만, 여전히 큰 규모의 데이터집합에서 검색하는데 소모되는 시간의 한계점을 발견할 수 있었다.
https://doi.org/10.3745/PKIPS.y2010m11a.3 인용 PDF

Search Result 178, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)