• Title/Summary/Keyword: Web Mining

Search Result 549, Processing Time 0.024 seconds

An Efficient Search Method of Product Reviews using Opinion Mining Techniques (오피니언 마이닝 기술을 이용한 효율적 상품평 검색 기법)

  • Yune, Hong-June;Kim, Han-Joon;Chang, Jae-Young
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.2
    • /
    • pp.222-226
    • /
    • 2010
  • With the continuously increasing volume of e-commerce transactions, it is now popular to buy some products and to evaluate them on the World Wide Web. The product reviews are very useful to customers because they can make better decisions based on the indirect experiences obtainable through these reviews. However, since online shopping malls do not provide ranking results, it is not easy for users to read all the relevant review documents effectively. Product reviews include subjective and emotional opinions. Thus, the review search is different from the general web search in terms of ranking strategy. In this paper, we propose an effective method of ranking the reviews that can reflect user's intention by using opinion mining techniques. The proposed method analyzes product reviews with query words, and sentimental polarity of subjective opinions. Through diverse experiments, we show that our proposed method outperforms conventional ones.

Research Trends Investigation Using Text Mining Techniques: Focusing on Social Network Services (텍스트마이닝을 활용한 연구동향 분석: 소셜네트워크서비스를 중심으로)

  • Yoon, Hyejin;Kim, Chang-Sik;Kwahk, Kee-Young
    • Journal of Digital Contents Society
    • /
    • v.19 no.3
    • /
    • pp.513-519
    • /
    • 2018
  • The objective of this study was to examine the trends on social network services. The abstracts of 308 articles were extracted from web of science database published between 1994 and 2016. Time series analysis and topic modeling of text mining were implemented. The topic modeling results showed that the research topics were mainly 20 topics: trust, support, satisfaction model, organization governance, mobile system, internet marketing, college student effect, opinion diffusion, customer, information privacy, health care, web collaboration, method, learning effectiveness, knowledge, individual theory, child support, algorithm, media participation, and context system. The time series regression results indicated that trust, support satisfaction model, and remains of the topics were hot topics. This study also provided suggestions for future research.

Nonlinear stability of the upper chords in half-through truss bridges

  • Wen, Qingjie;Yue, Zixiang;Liu, Zhijun
    • Steel and Composite Structures
    • /
    • v.36 no.3
    • /
    • pp.307-319
    • /
    • 2020
  • The upper chords in half-through truss bridges are prone to buckling due to a lack of the upper transverse connections. Taking into account geometric and material nonlinearity, nonlinear finite-element analysis of a simple supported truss bridge was carried out to exhibit effects of different types of initial imperfections. A half-wave of initial imperfection was proved to be effective in the nonlinear buckling analysis. And a parameter analysis of initial imperfections was also conducted to reveal that the upper chords have the greatest impact on the buckling, followed by the bottom chords, vertical and diagonal web members. Yet initial imperfections of transverse beams have almost no effect on the buckling. Moreover, using influence surface method, the combinatorial effects of initial imperfections were compared to demonstrate that initial imperfections of the upper chords play a leading role. Furthermore, the equivalent effective length coefficients of the upper chord were derived to be 0.2~0.28 by different methods, which implies vertical and diagonal web members still provide effective constraints for the upper chord despite a lack of the upper transverse connections between the two upper chords. Therefore, the geometrical and material nonlinear finite-element method is effective in the buckling analysis due to its higher precision. Based on nonlinear analysis and installation deviations of members, initial imperfection of l/500 is recommended in the nonlinear analysis of half-through truss bridges without initial imperfection investigation.

Ontology and Text Mining-based Advanced Historical People Finding Service (온톨로지와 텍스트 마이닝 기반 지능형 역사인물 검색 서비스)

  • Jeong, Do-Heon;Hwang, Myunggwon;Cho, Minhee;Jung, Hanmin;Yoon, Soyoung;Kim, Kyungsun;Kim, Pyung
    • Journal of Internet Computing and Services
    • /
    • v.13 no.5
    • /
    • pp.33-43
    • /
    • 2012
  • Semantic web is utilized to construct advanced information service by using semantic relationships between entities. Text mining can be applied to generate semantic relationships from unstructured data resources. In this study, ontology schema guideline, ontology instance generation, disambiguation of same name by text mining and advanced historical people finding service by reasoning have been proposed. Various relationships between historical event, organization, people, which are created by domain experts, are linked to literatures of National Institute of Korean History (NIKH). It improves the effectiveness of user access and proposes advanced people finding service based on relationships. In order to distinguish between people with the same name, we compares the structure and edge, nodes of personal social network. To provide additional information, external resources including thesaurus and web are linked to all of internal related resources as well.

Recommending System of Products based on Data mining Technique (데이터 마이닝 기법을 이용한 상품 추천 시스템)

  • Jung, Min-A.;Park, Kyung-Woo;Cho, Sung-Eui
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.3
    • /
    • pp.608-613
    • /
    • 2006
  • There are many e-showing mall because of revitalization of e-commerce system. It is necessary to recommending system of products that is for saving time and effort of customer. In this paper, we propose the system that is applying classification among data mining techniques to analysis of log data of customer. This log data contains access of user and purchasing of products. The proposed system operates in two phases. The first phase is composed of data filter module and association extraction module among web pages. The second phase is composed of personalization module and rule generation module. Customer can easily know the recommended sites because the proposed system can present rank of the recommended web pages to customer. As a result, the proposed system can efficiently do recommending of products to customer.

Web Document-based Associate Knowledge Extraction Method : Applying to Bioinformatics (웹 도큐먼트 기반 연관 지식 추출 기법 : 생명정보분야에의 적용)

  • 문현정;김교정
    • Journal of Internet Computing and Services
    • /
    • v.2 no.5
    • /
    • pp.9-19
    • /
    • 2001
  • In this paper. we develop associate knowledge extraction method for finding and expanding user preference knowledge automatically from web document database. To reflect user interest or preferences, agent explores and extracts relevant information to central term involving the intent of users from the example documents. To do so, we apply association rule exploration data-mining method to the extraction of the relevant objects in the web documents. Also, to give the weighted-value to the extracted and relevant information, we present associate tag block-based weighting method. We applied to bioinformatics above associate knowledge extraction method to find related keywords.

  • PDF

Development of Decision Tree Program based on Web for Analyzing Clinical Information of Sasang Constitutional Medicine (사상체질 임상정보 분석을 위한 웹 기반의 의사결정 나무 프로그램 개발)

  • Jin, Hee-Jeong;Kim, Myoung-Geun;Kim, Jong-Yeol
    • Korean Journal of Oriental Medicine
    • /
    • v.14 no.3
    • /
    • pp.81-87
    • /
    • 2008
  • Sasanag Contitution Medicine(SCM) is the traditional medicine theory based on constitutional medicine in Korea. It is most import ant that a personal SCM type is determined accurately ahead of applying any Sasang treatments. For this, many researches have been studied to diagnose the SCM type using constitutional clinical data. The decision tree is a tree-structured data-mining methodology. Recently, in the Korean traditional medicine society, there have been several efforts to find diagnosing tools using the decision tree method. So, we developed a decision tree program based on web for analyzing constitutional clinical information. It can use various clinical data as input data, offer filtering function to select clinical data to be used. We can find useful factor to be influential on SCM types using this program.

  • PDF

A Study of Main Contents Extraction from Web News Pages based on XPath Analysis

  • Sun, Bok-Keun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.7
    • /
    • pp.1-7
    • /
    • 2015
  • Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information servece, and contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page, in this paper, we solve the problem through the implementation of XTractor(XPath Extractor). Since XPath is used to navigate the attribute data and the data elements in the XML document, the XPath analysis to be carried out through the XTractor. XTractor Extracts main text by html parsing, XPath grouping and detecting the XPath contains the main data. The result, the recognition and precision rate are showed in 97.9%, 93.9%, except for a few cases in a large amount of experimental data and it was confirmed that it is possible to properly extract the main text of the news.

Profiling Green IT Leaders Quantitatively and Qualitatively

  • Kim, Yong Seog;Kwag, Seung Woog
    • Industrial Engineering and Management Systems
    • /
    • v.12 no.2
    • /
    • pp.118-129
    • /
    • 2013
  • In this study, we intend to identify key financial variables that can accurately classify Green IT leaders against Green IT followers. In particular, we build and compare single and meta-classifiers to identify the relationship between environmental performance and financial performance, while focusing on selecting and interpreting a final prediction model with a smaller set of financial performance indicators. Our experimental results demonstrate that several key variables representing the size, financial resources, operational efficiency, and risk-taking tendency of an organization can successfully identify Green IT leaders with approximately 90% of accuracy. In addition, we find that Green IT leaders show a higher utilization rate of Web pages as a green marketing channel than Green IT followers while they share common layouts of Web publication to build green IT brands with some differences.

Biotea-2-Bioschemas, facilitating structured markup for semantically annotated scholarly publications

  • Garcia, Leyla;Giraldo, Olga;Garcia, Alexander;Rebholz-Schuhmann, Dietrich
    • Genomics & Informatics
    • /
    • v.17 no.2
    • /
    • pp.14.1-14.6
    • /
    • 2019
  • The total number of scholarly publications grows day by day, making it necessary to explore and use simple yet effective ways to expose their metadata. Schema.org supports adding structured metadata to web pages via markup, making it easier for data providers but also for search engines to provide the right search results. Bioschemas is based on the standards of schema.org, providing new types, properties and guidelines for metadata, i.e., providing metadata profiles tailored to the Life Sciences domain. Here we present our proposed contribution to Bioschemas (from the project "Biotea"), which supports metadata contributions for scholarly publications via profiles and web components. Biotea comprises a semantic model to represent publications together with annotated elements recognized from the scientific text; our Biotea model has been mapped to schema.org following Bioschemas standards.