Search | Korea Science

A Document Collection Method for More Accurate Search Engine (정확도 높은 검색 엔진을 위한 문서 수집 방법)

하은용;최선완
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10c
- /
- pp.471-473
- /
- 1999
인터넷상의 정보 검색 엔진들은 웹 로봇을 실행해서 인터넷에 연결되어 있는 수많은 웹 서버들을 방문해서 웹 문서를 획득하고, 인덱싱 기법을 써서 자료를 추출하고 분류해서 검색 엔진의 기초가 되는 데이터 베이스를 구축한다. 정보 추출을 위해 웹 로봇을 운영할 때 웹 서버에 대한 사전 지식 없이 진행된다면 수많은 불필요한 요구가 전송돼서 인터넷 트래픽을 증가시키는 요인이 된다. 하지만 웹 서버가 사전에 자신이 공개할 문서에 대한 요약 정보를 웹 로봇에게 통보하고, 웹 로봇은 이 정보를 이용해서 웹 서버의 해당 문서에 대한 정보 수집 작업을 처리한다면 불필요한 인터넷 트래픽을 줄일 수 있을 뿐만 아니라 검색 엔진의 정보의 정확도를 높이고, 웹 서버상의 웹 문서 파일의 변동 사항을 자동으로 검사하고 변동된 사항들을 종합 정리해서 등록된 각 웹 로봇에게 전송하는 문서 감시 통보 시스템과 통보된 요약 정보를 토대로 웹 서버로부터 해당 문서를 전송받아 필요한 인덱스 정보를 추출하는 효율적인 웹 로봇을 제안한다.
PDF

Development of Yóukè Mining System with Yóukè's Travel Demand and Insight Based on Web Search Traffic Information (웹검색 트래픽 정보를 활용한 유커 인바운드 여행 수요 예측 모형 및 유커마이닝 시스템 개발)

Choi, Youji;Park, Do-Hyung
- Journal of Intelligence and Information Systems
- /
- v.23 no.3
- /
- pp.155-175
- /
- 2017
As social data become into the spotlight, mainstream web search engines provide data indicate how many people searched specific keyword: Web Search Traffic data. Web search traffic information is collection of each crowd that search for specific keyword. In a various area, web search traffic can be used as one of useful variables that represent the attention of common users on specific interests. A lot of studies uses web search traffic data to nowcast or forecast social phenomenon such as epidemic prediction, consumer pattern analysis, product life cycle, financial invest modeling and so on. Also web search traffic data have begun to be applied to predict tourist inbound. Proper demand prediction is needed because tourism is high value-added industry as increasing employment and foreign exchange. Among those tourists, especially Chinese tourists: Youke is continuously growing nowadays, Youke has been largest tourist inbound of Korea tourism for many years and tourism profits per one Youke as well. It is important that research into proper demand prediction approaches of Youke in both public and private sector. Accurate tourism demands prediction is important to efficient decision making in a limited resource. This study suggests improved model that reflects latest issue of society by presented the attention from group of individual. Trip abroad is generally high-involvement activity so that potential tourists likely deep into searching for information about their own trip. Web search traffic data presents tourists' attention in the process of preparation their journey instantaneous and dynamic way. So that this study attempted select key words that potential Chinese tourists likely searched out internet. Baidu-Chinese biggest web search engine that share over 80%- provides users with accessing to web search traffic data. Qualitative interview with potential tourists helps us to understand the information search behavior before a trip and identify the keywords for this study. Selected key words of web search traffic are categorized by how much directly related to "Korean Tourism" in a three levels. Classifying categories helps to find out which keyword can explain Youke inbound demands from close one to far one as distance of category. Web search traffic data of each key words gathered by web crawler developed to crawling web search data onto Baidu Index. Using automatically gathered variable data, linear model is designed by multiple regression analysis for suitable for operational application of decision and policy making because of easiness to explanation about variables' effective relationship. After regression linear models have composed, comparing with model composed traditional variables and model additional input web search traffic data variables to traditional model has conducted by significance and R squared. after comparing performance of models, final model is composed. Final regression model has improved explanation and advantage of real-time immediacy and convenience than traditional model. Furthermore, this study demonstrates system intuitively visualized to general use -Youke Mining solution has several functions of tourist decision making including embed final regression model. Youke Mining solution has algorithm based on data science and well-designed simple interface. In the end this research suggests three significant meanings on theoretical, practical and political aspects. Theoretically, Youke Mining system and the model in this research are the first step on the Youke inbound prediction using interactive and instant variable: web search traffic information represents tourists' attention while prepare their trip. Baidu web search traffic data has more than 80% of web search engine market. Practically, Baidu data could represent attention of the potential tourists who prepare their own tour as real-time. Finally, in political way, designed Chinese tourist demands prediction model based on web search traffic can be used to tourism decision making for efficient managing of resource and optimizing opportunity for successful policy.
https://doi.org/10.13088/jiis.2017.23.3.155 인용 PDF KSCI

Multiple Web-Information Viewer removing repetitive web searching (반복적 웹 검색을 제거한 다중 웹정보 뷰어)

Lee, Jung-Soo;Lee, Sang-Ho
- Proceedings of the Korea Information Processing Society Conference
- /
- 2014.04a
- /
- pp.964-966
- /
- 2014
인터넷 이용자 급증으로 정보들은 무한히 생산되고 사방에 산재되어 가고 있다. 이로 인해 정보들을 탐색하는 시간은 계속 증가하고 있다. 특히 공지사항이나 날씨처럼 반복적으로 갱신되는 정보들을 얻기 위해 사람들은 동일한 정보를 주기적으로 검색하고 있으며 이에 따른 불필요한 트래픽 유발 및 검색시간이 낭비되고 있는 실정이다. 본 논문은 동일한 정보를 주기적으로 검색함으로써 야기되는 문제점을 서술하고 이를 해결하기 위해 다수의 웹상에서 각종 정보들만을 추출하여 하나의 웹페이지 내에 배치하는 웹 컴포넌트를 설계 및 구현한다. 이 시스템을 사용한다면 사용자는 단순히 하나의 웹페이지를 클릭함으로써 다수의 웹상에 저장된 정보들을 웹서핑 없이 얻을 수 있기 때문에 정보검색 시간을 크게 단축시킬 수 있다. 이 시스템을 구현하기 위해 크로스 도메인상의 웹문서에서 정보를 추출하고 조작하는 것을 금지하는 웹 표준 정책인 동일출처정책을 우회할 수 있는 방법을 서술하였으며 이 정책을 회피함으로써 파생되는 문제점과 해결방안을 서술하였다. 마지막으로 현존하는 관련 시스템들과 비교하여 우수성을 보인다.
https://doi.org/10.3745/PKIPS.y2014m04a.964 인용 PDF

Design of CORBA-based Multi-Agent Model for Distributed Information Service (분산 정보 서비스를 위한 CORBA 기반의 멀티 에이전트 모델 설계)

Kim, Kwang-Jong;Ko, Hyun;Lee, Yon-Sik
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.04a
- /
- pp.327-330
- /
- 2002
웹 환경에서 효율적인 인터넷 서비스를 위한 동적 서비스의 다양한 요구 사항들을 만족시키고자 많은 연구들이 시도되고 있다. 그러나, 한정적인 네트웍 대역폭으로 인한 네트웍 트래픽 증가 및 서버 시스템의 부하로 안정적인 정보 서비스가 이루지지 않고 있는 실정이며, 또한 기존의 정보 서비스 형태에 있어 직접적인 사용자에 의한 정보 검색의 형태로만 정보를 서비스 받음으로써 새로운 형태의 정보 서비스 지원방식이 요구된다. 따라서, 본 논문에서는 분산환경에서 효율적인 정보 검색 과 안정적인 정보 서비스, 네트웍 트래픽 감소를 지원하는 CORBA 기반의 멀티 에이전트 모델을 설계한다. 이는 각 개별 에이전트들이 상호 보완적 관계를 유지하여 에이전트 간 상호 작용을 통해 네트웍 트래픽 감지를 통한 안정적이고 능동적인 정보 서비스, 검색 시간 및 네트웍 트래픽 감소, 검색 키워드 유지를 통한 정확한 정보 검색 서비스, 시스템 자원의 자동 관리 등을 지원함으로써 사용자에 대한 정보 서비스의 질을 향상시킬 수 있다.
PDF

Intelligent Brand Positioning Visualization System Based on Web Search Traffic Information : Focusing on Tablet PC (웹검색 트래픽 정보를 활용한 지능형 브랜드 포지셔닝 시스템 : 태블릿 PC 사례를 중심으로)

Jun, Seung-Pyo;Park, Do-Hyung
- Journal of Intelligence and Information Systems
- /
- v.19 no.3
- /
- pp.93-111
- /
- 2013
As Internet and information technology (IT) continues to develop and evolve, the issue of big data has emerged at the foreground of scholarly and industrial attention. Big data is generally defined as data that exceed the range that can be collected, stored, managed and analyzed by existing conventional information systems and it also refers to the new technologies designed to effectively extract values from such data. With the widespread dissemination of IT systems, continual efforts have been made in various fields of industry such as R&D, manufacturing, and finance to collect and analyze immense quantities of data in order to extract meaningful information and to use this information to solve various problems. Since IT has converged with various industries in many aspects, digital data are now being generated at a remarkably accelerating rate while developments in state-of-the-art technology have led to continual enhancements in system performance. The types of big data that are currently receiving the most attention include information available within companies, such as information on consumer characteristics, information on purchase records, logistics information and log information indicating the usage of products and services by consumers, as well as information accumulated outside companies, such as information on the web search traffic of online users, social network information, and patent information. Among these various types of big data, web searches performed by online users constitute one of the most effective and important sources of information for marketing purposes because consumers search for information on the internet in order to make efficient and rational choices. Recently, Google has provided public access to its information on the web search traffic of online users through a service named Google Trends. Research that uses this web search traffic information to analyze the information search behavior of online users is now receiving much attention in academia and in fields of industry. Studies using web search traffic information can be broadly classified into two fields. The first field consists of empirical demonstrations that show how web search information can be used to forecast social phenomena, the purchasing power of consumers, the outcomes of political elections, etc. The other field focuses on using web search traffic information to observe consumer behavior, identifying the attributes of a product that consumers regard as important or tracking changes on consumers' expectations, for example, but relatively less research has been completed in this field. In particular, to the extent of our knowledge, hardly any studies related to brands have yet attempted to use web search traffic information to analyze the factors that influence consumers' purchasing activities. This study aims to demonstrate that consumers' web search traffic information can be used to derive the relations among brands and the relations between an individual brand and product attributes. When consumers input their search words on the web, they may use a single keyword for the search, but they also often input multiple keywords to seek related information (this is referred to as simultaneous searching). A consumer performs a simultaneous search either to simultaneously compare two product brands to obtain information on their similarities and differences, or to acquire more in-depth information about a specific attribute in a specific brand. Web search traffic information shows that the quantity of simultaneous searches using certain keywords increases when the relation is closer in the consumer's mind and it will be possible to derive the relations between each of the keywords by collecting this relational data and subjecting it to network analysis. Accordingly, this study proposes a method of analyzing how brands are positioned by consumers and what relationships exist between product attributes and an individual brand, using simultaneous search traffic information. It also presents case studies demonstrating the actual application of this method, with a focus on tablets, belonging to innovative product groups.
https://doi.org/10.13088/jiis.2013.19.3.093 인용 PDF KSCI

4차 산업혁명 관련 기술의 정부R&D과제 현황과 관심도 분석 -그래핀, 빅데이터, 바이오마커를 중심으로

Jeong, Jae-Ung;Park, Hyeon-U;Seong, Tae-Eung
- Proceedings of the Korea Technology Innovation Society Conference
- /
- 2017.05a
- /
- pp.549-558
- /
- 2017
바야흐로 4차 산업혁명의 시대가 도래하면서, 4차 산업혁명과 관련된 기술에 대한 관심 또한 나날이 높아져 가고 있다. 이런 시대적 상황에 맞춰, 본 연구에서는 웹 검색트래픽을 활용하여 4차 산업혁명 관련 기술의 국내 외 트렌드를 비교하고 관련된 기술에 대한 국내 정부R&D 과제 현황에 대해 분석해 보고자 한다. 본 분석은 먼저 관련 기술에 대한 정부 R&D 현황 과제 현황을 네트워크 분석하고 분석된 기술을 4차 산업혁명의 3가지 카테고리로 구분한다. 이후 카테고리별로 선별된 각각의 기술에 대해 구글의 웹 검색트래픽을 활용하여 관련된 기술의 국내 외 트렌드를 비교 분석한다. 본 연구를 통해 4차 산업혁명 관련 기술에 대한 과거와 현재의 관심 동향을 확인할 수 있을 것으로 예상된다. 또한, 해당 기술과 관련된 정부 R&D 과제 현황을 네트워크 분석하고 시각화하여 4차 산업혁명 관련 R&D 현황에 대한 직관적인 정보를 제공하고자 한다. 본 연구는 인류 최대의 혁명으로 정의되는 4차 산업혁명 기술에 대한 국내 외 트렌드 정보와 함께 현재까지 진행되어온 정부의 R&D 현황을 직관적으로 볼 수 있게 제공함으로써, 4차 산업혁명과 함께 다가오는 미래에 대한 올바른 정책 방향 설정에 기여할 것으로 기대된다.
PDF

A Document Collection Method for More Accurate Search Engine (정확도 높은 검색 엔진을 위한 문서 수집 방법)

Ha, Eun-Yong;Gwon, Hui-Yong;Hwang, Ho-Yeong
- The KIPS Transactions:PartA
- /
- v.10A no.5
- /
- pp.469-478
- /
- 2003
Internet information search engines using web robots visit servers conneted to the Internet periodically or non-periodically. They extract and classify data collected according to their own method and construct their database, which are the basis of web information search engines. There procedure are repeated very frequently on the Web. Many search engine sites operate this processing strategically to become popular interneet portal sites which provede users ways how to information on the web. Web search engine contacts to thousands of thousands web servers and maintains its existed databases and navigates to get data about newly connected web servers. But these jobs are decided and conducted by search engines. They run web robots to collect data from web servers without knowledge on the states of web servers. Each search engine issues lots of requests and receives responses from web servers. This is one cause to increase internet traffic on the web. If each web server notify web robots about summary on its public documents and then each web robot runs collecting operations using this summary to the corresponding documents on the web servers, the unnecessary internet traffic is eliminated and also the accuracy of data on search engines will become higher. And the processing overhead concerned with web related jobs on web servers and search engines will become lower. In this paper, a monitoring system on the web server is designed and implemented, which monitors states of documents on the web server and summarizes changes of modified documents and sends the summary information to web robots which want to get documents from the web server. And an efficient web robot on the web search engine is also designed and implemented, which uses the notified summary and gets corresponding documents from the web servers and extracts index and updates its databases.
https://doi.org/10.3745/KIPSTA.2003.10A.5.469 인용 PDF KSCI

Design and Implementation of Web-Based Network Management System (웹 기반 네트워크 관리 시스템 설계 및 구현)

김형길;이수영;김명균
- Proceedings of the Korea Multimedia Society Conference
- /
- 2000.11a
- /
- pp.549-552
- /
- 2000
본 논문은 웹 기반 네트워크 관리 시스템 WNMS(Web-based Network Management System)에 대한 설계 및 구현에 대해 기술한다. WNMS는 크게 네트워크 구성 관리, MIB(Management Information Base: 이하 MIB ) Browser, 그리고 트래픽 모니터링 기능을 갖는다. 네트워크 구성 관리는 관리 서버가구동 되면, 네트워크들과 네트워크 장비들을 찾는다. 이는 관리자에게 전체적인 네트워크 구성을 한눈에 쉽게 파악할 수 있게 한다. MIB Browsers 는 관리자가 직접 SNMP 에이전트의 특정한 관리정보(MIB 노드 값)를 검색 또는 조작할 수 있게 한다. 그리고 트래픽 모니터링은 구성 관리에서 찾아진 네트워크나 네트워크 장비의 트래픽을 실시간으로 그래프로 보여준다
PDF

Design of Chatting Architecture that Handle Large-Scale Traffic (대용량 트래픽 처리를 위한 채팅 구조 설계)

Hong, Seong-Mun;Lee, Yoon-jae;Ko, Se-Young;Jung, Seung-Woo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.13-16
- /
- 2019
웹 서비스의 트래픽은 변화의 폭이 크다. 또한 서비스는 실시간으로 변화하는 트래픽에 대비하기 위하여 트래픽의 최대치를 가정하여 서버를 구성해야한다. 하지만 트래픽의 최대치와 평균적인 트래픽은 큰 차이가 있어 위와 같은 서버 구성은 많은 자원의 낭비로 이어진다. 이렇듯 실시간으로 변화하는 트래픽에 대응하기 위하여 분산 시스템 구조와 InMemory Cache, Messaging Queue 등을 활용하여 대응하도록 설계했다. 또한 InMemory Cache 와 NoSQL 을 활용하여 효과적으로 메세지를 저장하고 검색할 수 있도록 설계하였다.
https://doi.org/10.3745/PKIPS.y2019m10a.13 인용 PDF

An Implementation of The multimedia Information searching System Using Extended object-Oriented Middleware(CORBA/JAVA) on Web Envionments (웹 환경에서 확장된 객체지향형 미들웨어(CORBA/JAVA)를 이용한 멀티미디어 정보 검색 시스템 구현)

Lee, Won-Jung;Ahn, Gil-Sou;Joo, Su-Chong
- The Transactions of the Korea Information Processing Society
- /
- v.5 no.7
- /
- pp.1847-1854
- /
- 1998
최근, 웹(World Wide Web) 환경에서 효율적인 인터넷 서비스를 위해서 정적 서비스보다는 동적 서비스에 대한 다양한 요구사항들을 만족시키고자 많은 연구자들의 노력이 시도되고 있다. 웹 상에 있는 서버들로부터 정보를 얻기 위하여, CGI는 분산 응용들 간의 상호연동의 한 방법으로 제안되었다. 이 방ㅂ버에서 서버는 서비스 객체들간의 정적 바인딩 인한 과중한 처리 부담 및 네트워크 트래픽 부하를 극복할 수 없는 문제점들을 가지고 있다. 이러한 이유에서 본 논문에서는 기존의 CGI를 사용하는 대신에 확장된 객체 지향형 미들웨어를 사용하므로 웹 상에서 분산자원을 효율적으로 지원할 수 있도록 한다. 목표 시스템을 웹 상에서 구현하였다. 여기에서 클라이언트 모듈들은 임의의 서버에게 정보를 요청하기 위한 CORBA 미들웨어 접속용으로 서버 모듈은 클라이언트의 요청을 CORBA/JAVA 미들웨어를 통해 받아 웹 서버내의 멀티미디어 정보를 검색하도록 각각 구현되었다.
PDF

Search Result 24, Processing Time 0.037 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)