• Title/Summary/Keyword: Mining sites

Search Result 259, Processing Time 0.027 seconds

Discovering News Keyword Associations Using Association Rule Mining (연관규칙 마이닝을 활용한 뉴스기사 키워드의 연관성 탐사)

  • Kim, Han-Joon;Chang, Jae-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.6
    • /
    • pp.63-71
    • /
    • 2011
  • The current Web portal sites provide significant keywords with high popularity or importance; specifically, user-friendly services such as tag clouds and associated word search are provided. However, in general, since news articles are classified only with their date and categories, it is not easy for users to find other articles related to some articles while reading news articles classified with categories. And the conventional associated keyword service has not satisfied users sufficiently because it depends only upon user queries. This paper proposes a way of searching news articles by utilizing the keywords tightly associated with users' queries. Basically, the proposed method discovers a set of keyword association patterns by using the association rule mining technique that extracts association patterns for keywords by focusing upon sentences containing some keywords. The method enables users to navigate the space of associated keywords hidden in large news articles.

Mining Association Rules from the Web Access Log of an Online News website (온라인 뉴스 웹사이트의 로그를 이용한 연관규칙 발견에 관한 연구)

  • Hwang, Hyunseok;Yoo, Keedong
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.18 no.2
    • /
    • pp.47-57
    • /
    • 2013
  • Today a lot of functional areas of a firm are operated on the Web. Online shopping malls analyze web log recording customers' activities on the web to connect them to business outcomes. Not only commercial websites, but online news sites also need to collect and analyze web logs to understand their news readers' interest. However, little research has been performed yet. In this research we mined the web access log of an online news website and conduct Market Basket Analysis to uncover the association rules among the categories of news articles. The research is composed of two stages: 1) Identifying the individual session of a visitor; 2) Mining association rule from news articles read by each session. We gather 7-day access logs two times. The results of log mining and meanings of association rules are suggested with managerial implications in conclusion section.

An Analysis of Research Trends in Computational Thinking using Text Mining Technique (텍스트 마이닝 기법을 활용한 컴퓨팅 사고력 연구 동향 분석)

  • Lee, Jaeho;Jang, Junhyung
    • Journal of The Korean Association of Information Education
    • /
    • v.23 no.6
    • /
    • pp.543-550
    • /
    • 2019
  • In 2006, Janet Wing defined computational thinking and operated SW education as a formal curriculum in the UK in 2013. This study collected related research papers by using computational thinking, which has recently increased in importance, and analyzed it using text mining. In the first, CONCOR analysis was conducted with the keyword of computational thinking. In the second, text mining of the components of computational thinking was selected by the repr23esentative academic journals at domestic and foreign. As a result of the two-time analysis, first, abstraction, algorithm, data processing, problem decomposition, and pattern recognition were the core of the study of computational thinking component. Second, research on convergence education centered on computational thinking and science and mathematics subjects was actively conducted. Third, research on computational thinking has been expanding since 2010. Research and development of the classification and definition of computational thinking and components and applying them to education sites should be conducted steadily.

An Efficient Algorithm for Mining Interactive Communication Sequence Patterns (대화형 통신 순서열 패턴의 마이닝을 위한 효율적인 알고리즘)

  • Haam, Deok-Min;Song, Ji-Hwan;Kim, Myoung-Ho
    • Journal of KIISE:Databases
    • /
    • v.36 no.3
    • /
    • pp.169-179
    • /
    • 2009
  • Communication log data consist of communication events such as sending and receiving e-mail or instance message and visiting web sites, etc. Many countries including USA and EU enforce the retention of these data on the communication service providers for the purpose of investigating or detecting criminals through the Internet. Because size of the retained data is very large, the efficient method for extracting valuable information from the data is needed for Law Enforcement Authorities to use the retained data. This paper defines the Interactive Communication Sequence Patterns(ICSPs) that is the important information when each communication event in communication log data consists of sender, receiver, and timestamp of this event. We also define a Mining(FDICSP) problem to discover such patterns and propose a method called Fast Discovering Interactive Communication Sequence Pattern(FDICSP) to solve this problem. FDICSP focuses on the characteristics of ICS to reduce the search space when it finds longer sequences by using shorter sequences. Thus, FDICSP can find Interactive Communication Sequence Patterns efficiently.

Comparative Analysis of Work-Life Balance Issues between Korea and the United States (워라밸 이슈 비교 분석: 한국과 미국)

  • Lee, So-Hyun;Kim, Minsu;Kim, Hee-Woong
    • The Journal of Information Systems
    • /
    • v.28 no.2
    • /
    • pp.153-179
    • /
    • 2019
  • Purpose This study collects the issues about work-life balance in Korea and United States and suggests the specific plans for work-life balance by the comparison and analysis. The objective of this study is to contribute to the improvement of people's life quality by understanding the concept of work-life balance that has become the issue recently and offering the detailed plans to be considered in respect of individual, corporate and governmental level for society of work-life balance. Design/methodology/approach This study collects work-life balance related issues through recruit sites in Korea and United States, compares and analyzes the collected data from the results of three text mining techniques such as LDA topic modeling, term frequency analysis and keyword extraction analysis. Findings According to the text mining results, this study shows that it is important to build corporate culture that support work-life balance in free organizational atmosphere especially in Korea. It also appears that there are the differences against whether work-life balance can be achieved and recognition and satisfaction about work-life balance along type of company or sort of working. In case of United States, it shows that it is important for them to work more efficiently by raising teamwork level among team members who work together as well as the role of the leaders who lead the teams in the organization. It is also significant for the company to provide their employees with the opportunity of education and training that enables them to improve their individual capability or skill. Furthermore, it suggests the roles of individuals, company and government and specific plans based on the analysis of text mining results in both countries.

Effects of mining activities on Nano-soil management using artificial intelligence models of ANN and ELM

  • Liu, Qi;Peng, Kang;Zeng, Jie;Marzouki, Riadh;Majdi, Ali;Jan, Amin;Salameh, Anas A.;Assilzadeh, Hamid
    • Advances in nano research
    • /
    • v.12 no.6
    • /
    • pp.549-566
    • /
    • 2022
  • Mining of ore minerals (sfalerite, cinnabar, and chalcopyrite) from the old mine has led in significant environmental effects as contamination of soils and plants and acidification of water. Also, nanoparticles (NP) have obtained global importance because of their widespread usage in daily life, unique properties, and rapid development in the field of nanotechnology. Regarding their usage in various fields, it is suggested that soil is the final environmental sink for NPs. Nanoparticles with excessive reactivity and deliverability may be carried out as amendments to enhance soil quality, mitigate soil contaminations, make certain secure land-software of the traditional change substances and enhance soil erosion control. Meanwhile, there's no record on the usage of Nano superior substances for mine soil reclamation. In this study, five soil specimens have been tested at 4 sites inside the region of mine (<100 m) to study zeolites, and iron sulfide nanoparticles. Also, through using Artificial Neural Network (ANN) and Extreme Learning Machine (ELM), this study has tried to appropriately estimate the mechanical properties of soil under the effect of these Nano particles. Considering the RMSE and R2 values, Zeolite Nano materials could enhance the mine soil fine through increasing the clay-silt fractions, increasing the water holding capacity, removing toxins and improving nutrient levels. Also, adding iron sulfide minerals to the soils would possibly exacerbate the soil acidity problems at a mining site.

Comparison of Carbon Storage between Forest Restoration of Abandoned Coal Mine and Natural Vegetation Lands (폐탄광 산림복원지와 자연식생지의 탄소저장량 비교)

  • Kim, So-Jin;Jung, Yu-Gyeong;Park, Ki-Hyung;Kim, Ju-Eun;Bae, Jeong-Hyeon;Kang, Won-Seok
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.26 no.5
    • /
    • pp.33-46
    • /
    • 2023
  • In this study, carbon storage in the aboveground biomass, litter layer, and soil layer was calculated for abandoned mining restoration areas to determine the level of carbon storage after the restoration project through comparison with the ecological reference. Five survey sites were selected for each abandoned mining restoration area in Boryeong-si, Chungcheongnam-do, and the ecological reference that can be a goal and model for the restoration project. The carbon storage in the restoration area was 0~21.3Mg C ha-1, the deciduous layer 3.3~6.0Mg C ha-1, and the soil layer(0-30cm) 8.3~35.1Mg C ha-1, showing a significant difference in carbon storage by target site. The total carbon storage was between 6.1 and 35.3% of the ecological reference, with restoration area ranging from 14.0 to 62.4 Mg C ha-1. The total carbon storage in the restoration area and the ecological reference differed the most in the aboveground biomass and was less than 12%. Based on these results, forest restoration area need to improve the carbon storage of forests through continuous management and monitoring so trees can grow and restore productivity in the early stages of the restoration project. The results of this study can be used as primary data for preparing future forest restoration indicators by identifying the storage of abandoned mining restoration areas.

Efficient Mining of Frequent Subgraph with Connectivity Constraint

  • Moon, Hyun-S.;Lee, Kwang-H.;Lee, Do-Heon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.267-271
    • /
    • 2005
  • The goal of data mining is to extract new and useful knowledge from large scale datasets. As the amount of available data grows explosively, it became vitally important to develop faster data mining algorithms for various types of data. Recently, an interest in developing data mining algorithms that operate on graphs has been increased. Especially, mining frequent patterns from structured data such as graphs has been concerned by many research groups. A graph is a highly adaptable representation scheme that used in many domains including chemistry, bioinformatics and physics. For example, the chemical structure of a given substance can be modelled by an undirected labelled graph in which each node corresponds to an atom and each edge corresponds to a chemical bond between atoms. Internet can also be modelled as a directed graph in which each node corresponds to an web site and each edge corresponds to a hypertext link between web sites. Notably in bioinformatics area, various kinds of newly discovered data such as gene regulation networks or protein interaction networks could be modelled as graphs. There have been a number of attempts to find useful knowledge from these graph structured data. One of the most powerful analysis tool for graph structured data is frequent subgraph analysis. Recurring patterns in graph data can provide incomparable insights into that graph data. However, to find recurring subgraphs is extremely expensive in computational side. At the core of the problem, there are two computationally challenging problems. 1) Subgraph isomorphism and 2) Enumeration of subgraphs. Problems related to the former are subgraph isomorphism problem (Is graph A contains graph B?) and graph isomorphism problem(Are two graphs A and B the same or not?). Even these simplified versions of the subgraph mining problem are known to be NP-complete or Polymorphism-complete and no polynomial time algorithm has been existed so far. The later is also a difficult problem. We should generate all of 2$^n$ subgraphs if there is no constraint where n is the number of vertices of the input graph. In order to find frequent subgraphs from larger graph database, it is essential to give appropriate constraint to the subgraphs to find. Most of the current approaches are focus on the frequencies of a subgraph: the higher the frequency of a graph is, the more attentions should be given to that graph. Recently, several algorithms which use level by level approaches to find frequent subgraphs have been developed. Some of the recently emerging applications suggest that other constraints such as connectivity also could be useful in mining subgraphs : more strongly connected parts of a graph are more informative. If we restrict the set of subgraphs to mine to more strongly connected parts, its computational complexity could be decreased significantly. In this paper, we present an efficient algorithm to mine frequent subgraphs that are more strongly connected. Experimental study shows that the algorithm is scaling to larger graphs which have more than ten thousand vertices.

  • PDF

3-dimensional Modeling and Mining Analysis for Open-pit Limestone Mine Stope Using a Rotary-wing Unmanned Aerial Vehicle (회전익 무인항공기를 이용한 노천석회석광산 채굴장 3차원 모델링 및 채굴량 분석)

  • Kang, Seong-Seung;Lee, Geon-Ju;Noh, Jeongdu;Jang, Hyeongdoo;Kim, Sun-Myung;Ko, Chin-Surk
    • The Journal of Engineering Geology
    • /
    • v.28 no.4
    • /
    • pp.701-714
    • /
    • 2018
  • The purpose of this study is to show the possibility of 3-dimensional modeling of open-pit limestone mine by using a rotary-wing unmanned aerial vehicle, a drone, and to estimate the amount of mining before and after mining of limestone by explosive blasting. Analysis of the image duplication of the mine has shown that it is possible to achieve high image quality. Analysis of each axis error at the shooting position after analyzing the distortions through camera calibration was shown the allowable range. As a result of estimating the amount of mining before and after explosive blasting, it was possible to estimate the amount of mining of a wide range quickly and accurately in a relatively short time. In conclusion, it is considered that the drone of a rotary-wing unmanned aerial vehicle can be usefully used for the monitoring of open-pit limestone mines and the estimation of the amount of mining. Furthermore, it is expected that this method will be utilized for periodic monitoring of construction sites and road slopes as well as open-pit mines in the future.

Recolonization of benthic macroinvertebrates after anthropogenic disturbance in natural streams, South Korea

  • Chun, Seung-Phil;Chon, Seung-Hoon;Lee, Seung-Oh;Im, Jang-Hyuk;Lee, Woo-Kyun;Kim, Myoung-Chul
    • Korean Journal of Environment and Ecology
    • /
    • v.29 no.2
    • /
    • pp.228-235
    • /
    • 2015
  • Stream ecosystems are closely related to many human activities. Therefore, streams are affected by anthropogenic disturbances such as riverine development and gravel-mining as well as deterioration of water quality. The goal of this study was to elucidate the recolonization process of the macroinvertebrate community after a small-scale anthropogenic disturbance. Field studies were conducted at three sites in a natural stream. The number of recolonizing species tended to increase slightly over time, exceeding the total species number of the control. Ephemeroptera contributed the most to shaping the recolonizing pattern of the entire community. From the result of changes in dominant species, the early recolonizers of each site were the species that showed more frequent occurrence particulary at each sites. But the late recolonizers are Chironomidae at all the sites commonly. This result implies that the actual differences exist among the recolonizing trends of each benthic macroinvertebrate taxon. Collector-gatherers and scrapers comprised about 70% of the recolonizing species. These results indicate that the recolonizing process of an aquatic community after an artificial disturbance depends on the environmental conditions(particularly substratum composition or organic pollution) of the habitat.