• Title/Summary/Keyword: Web of Data

Search Result 5,562, Processing Time 0.036 seconds

Design and Implementation of a Search Engine based on Apache Spark (아파치 스파크 기반 검색엔진의 설계 및 구현)

  • Park, Ki-Sung;Choi, Jae-Hyun;Kim, Jong-Bae;Park, Jae-Won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.1
    • /
    • pp.17-28
    • /
    • 2017
  • Recently, a study on data has been actively conducted because the value of the data has become more useful. Web crawler that is program of data collection recently spotlighted because it can take advantage of the various fields. Web crawler can be defined as a tool to analyze the web pages and collects the URL by traversing the web server in an automated manner. For the treatment of Big-data, distributed Web crawler is widely used which is based on the Hadoop MapReduce. But, it is difficult to use and has constraints on the performance. Apache spark that is the In-memory computing platform is an alternative to MapReduce. The search engine which is one of the main purposes of web crawler displays the information you search by keyword gathered by web crawler. If search engines implement a spark-based web crawler instead of traditional MapReduce-based web crawler, it would be a more rapid data collection.

A Designing for Successful Learning on the Web

  • Ahn, Jeong-Yong;Han, Kyung-Soo;Han, Beom-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.1083-1090
    • /
    • 2003
  • Web-based learning is currently an active area of research and a considerable number of studies have been conducted on its application in the learning environment. However, in spite of many advances in the research and development of the educational contents, questions about how the environment affects learning remains largely unanswered. In this article, we propose a Web-based learning environment to improve the educational effect. The goal of this article is not to provide a complete system to support Web-based learning but rather to describe some meaningful strategies and fundamental design concepts that utilize information technologies to support teaching and learning.

  • PDF

Representation of Process Plant Equipment Using Ontology and ISO 15926 (온톨로지와 ISO 15926을 이용한 공정 플랜트 기자재의 표현)

  • Mun, Du-Hwan;Kim, Byung-Chul;Han, Soon-Hung
    • Korean Journal of Computational Design and Engineering
    • /
    • v.14 no.1
    • /
    • pp.1-9
    • /
    • 2009
  • ISO 15926 is an international standard for the representation of process plant lifecycle data. However, it is not easy to implement the part 2-data model and the part 4-initial reference data because of their complexity in terms of data structure and shortages of related development toolkits. To overcome this problem, ISO 15926-7(part 7) is under development. ISO 15926-7 specifies implementation methods for sharing and exchange of process plant lifecycle data, which is based on semantic web technologies such as OWL, Web Services, and SPARQL. For the application of ISO 15926-7, this paper discusses how to represent technical specifications of process plant equipment by defining user-defined reference data and object information model with an example of reactor coolant pumps located in the reactor coolant system of an APR 1400 nuclear power plant.

Behavior analysis of entrance applicants using web log data (웹 로그데이터를 이용한 대학입시 지원자 행태 분석)

  • Choi, Seung-Bae;Kang, Chang-Wan;Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.3
    • /
    • pp.493-504
    • /
    • 2009
  • The web log data analysis is to analysis traces which visitors remain while they drop by a web-site. Ultimately it can help to obtain a lot of useful information that can efficiently manage homepage and perform CRM(customer relationship management) using obtained information. In this paper, we provide a basic information to manage efficiently homepage of D university and to establish strategy for invitation of new pupil, as analyzing web log data for D university.

  • PDF

A STUDY ON BIM-BASED 5D SIMULATION IN WEB ENVIRONMENT

  • Jae-Bok Lim;Jae-Hong Ahn;Ju-Hyung Kim;Jae-Jun Kim
    • International conference on construction engineering and project management
    • /
    • 2013.01a
    • /
    • pp.169-172
    • /
    • 2013
  • Building Information Modeling (BIM) is an effective decision-making platform that helps to save project cost and enhance quality of construction. By generating and linking a wide variety of objects data, BIM can be effectively utilized, and it should be ensured that object properties maintain consistency throughout the project period of design, estimates, construction, maintenance and repair. This study examined how to utilize BIM data in a construction project, by linking cost and schedule data in web environment, to better utilize the information and maintain consistency of the BIM information. To do so, the model integrated WBS data and CBS data, linked them with BIM model to realize 5D simulation in web environment. As a result, cost and schedule data could be simultaneously acquired, and object properties-cost, schedule, location-as well. These are expected to contribute to developing a BIM-based automatic data-processing system in web environment.

  • PDF

Integration of Gear Design Data using XML in the Web-based Environment (웹 기반 환경에서 XML을 이용한 기어 설계 데이터의 통합)

  • 정태형;박승현
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2001.04a
    • /
    • pp.627-630
    • /
    • 2001
  • XML is suitable to integrate various forms of engineering design data since it possesses the characteristics of both documents and data. In this research a web-based design system has been developed, which integrates various gear design data in the form of XML. The system generates XML document containing gear design data and transforms gear design data in the relational database into XML document form automatically. The XML documents are transmitted to gear modeler agent through SOAP, and then the agent is automatically executed and generates CAD model files and VRML files. The designer can check the generated VRML model of gear immediately in the web service.

  • PDF

An Exploratory Study on the Semantic Network Analysis of Food Tourism through the Big Data (빅데이터를 활용한 음식관광관련 의미연결망 분석의 탐색적 적용)

  • Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.23 no.4
    • /
    • pp.22-32
    • /
    • 2017
  • The purpose of this study was to explore awareness of food tourism using big data analysis. For this, this study collected data containing 'food tourism' keywords from google web search, google news, and google scholar during one year from January 1 to December 31, 2016. Data were collected by using SCTM (Smart Crawling & Text Mining), a data collecting and processing program. From those data, degree centrality and eigenvector centrality were analyzed by utilizing packaged NetDraw along with UCINET 6. The result showed that the web visibility of 'core service' and 'social marketing' was high. In addition, the web visibility was also high for destination, such as rural, place, ireland and heritage; 'socioeconomic circumstance' related words, such as economy, region, public, policy, and industry. Convergence of iterated correlations showed 4 clustered named 'core service', 'social marketing', 'destinations' and 'social environment'. It is expected that this diagnosis on food tourism according to changes in international business environment by using these web information will be a foundation of baseline data useful for establishing food tourism marketing strategies.

Shoring STEP Data over Internet using WWW (WWW를 이용한 제품정보의 공유)

  • Choi, Young;Shin, Ha-Yong;Park, Myung-Jin;Lee, Jong-Gap
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.23 no.3
    • /
    • pp.597-608
    • /
    • 1997
  • Life cycle product data is very important yet difficult to handle for manufacturing companies. Shoring and exchanging product data over world-wide-web is a part of key technology to implement PDM or CALS. STEP is widely accepted as a standard to represent the life-cycle product model data. Described in this paper is a web browser plug-in that can graphically display and explore product data represented by STEP over internet. By the use of the plug-in (named "npSTEP"), a product model data stored in STEP format on a web server can be displayed on a commonly used web client (browser), such as Netscape navigator, without any format conversion process. Furthermore one can explore the components or attributes of the product model data in hierarchical manner.

  • PDF

Spatial Index based on Main Memory for Web CIS (Web GIS를 위한 주기억 장치 기반 공간 색인)

  • 김진덕;진교홍
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.10a
    • /
    • pp.191-194
    • /
    • 2001
  • The availability of the inexpensive, large main memories coupled with the demand for faster response time are bringing a new perspective to database technology. The Web GIS used by u unspecified number of general public in the internet needs high speed response time and frequent data retrieval for spatial analysis rather than data update. Therefore, it is appropriate to use main memory as a underlying storage structures for the Web GIS data. In this paper, we propose a data representation method based on relative coordinates and the size of the MBR. The method is able to compress the spatial data widely used in the Web GIS into smaller volume of memory. We also propose a memory resident spatial index with simple mechanism for processing point and region queries. The performance test shows that the index is suitable for managing the skewed data in terms of the size of the index and the number of the MBR intersection check operations.

  • PDF

Merchandise Management Using Web Mining in Business To Customer Electronic Commerce (기업과 소비자간 전자상거래에서의 웹 마이닝을 이용한 상품관리)

  • 임광혁;홍한국;박상찬
    • Journal of Intelligence and Information Systems
    • /
    • v.7 no.1
    • /
    • pp.97-121
    • /
    • 2001
  • Until now, we have believed that one of advantages of cyber market is that it can virtually display and sell goods because it does not necessary maintain expensive physical shops and inventories. But, in a highly competitive environment, business model that does away with goods in stock must be modified. As we know in the case of AMAZON, leading companies already consider merchandise management as a critical success factor in their business model. That is, a solution to compete against one's competitors in a highly competitive environment is merchandise management as in the traditional retail market. Cyber market has not only past sales data but also web log data before sales data that contains information of path that customer search and purchase on cyber market as compared with traditional retail market. So if we can correctly analyze the characteristics of before sales patterns using web log data, we can better prepare for the potential customers and effectively manage inventories and merchandises. We introduce a systematic analysis method to extract useful data for merchandise management - demand forecasting, evaluating & selecting - using web mining that is the application of data mining techniques to the World Wide Web. We use various techniques of web mining such as clustering, mining association rules, mining sequential patterns.

  • PDF