• Title/Summary/Keyword: User logs analysis

Search Result 66, Processing Time 0.026 seconds

An Analysis of Library User and Circulation Status based on Bigdata Logs A Case Study of National Library of Korea, Sejong (빅데이터 로그 기반 도서관 이용자 및 대출 현황 분석 - 국립세종도서관을 중심으로 -)

  • Kim, Tae-Young;Baek, Ji-Yeon;Oh, Hyo Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.49 no.2
    • /
    • pp.357-388
    • /
    • 2018
  • This study aims to analyze library user and circulation status based on the bigdata logs to identify characteristics by user group and propose methods for efficient management of library. The logs to be analyzed consist of user information, circulation information, service usage information registered at the National Library of Korea, Sejong. The user information logs contain 107,369 age data, 106,918 gender data, 106,838 residential data. The circulation information logs contain 536,083 circulation user data, 6,509,369 circulation count data, and the service usage information logs contain 82,813 data. For the analysis of characteristics by user group, the data were used for analyzing user status by age, gender, residence and circulation status by year, month, day. In addition, this study conducts FGI(Focus Group Interview) and linkage analysis with external data to identify factors for analysis results. Based on analysis results, improvement methods for helping library make effective decision-making were proposed. This study analyze empirically user and circulation status based on bigdata logs, and it has significance for being different form proceeding researches with less analysis data.

User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis (다계층 이원 네트워크를 활용한 사용자 관점의 이슈 클러스터링)

  • Kim, Jieun;Kim, Namgyu;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.93-107
    • /
    • 2014
  • In this paper, we report what we have observed with regard to user-perspective issue clustering based on multi-layered two-mode network analysis. This work is significant in the context of data collection by companies about customer needs. Most companies have failed to uncover such needs for products or services properly in terms of demographic data such as age, income levels, and purchase history. Because of excessive reliance on limited internal data, most recommendation systems do not provide decision makers with appropriate business information for current business circumstances. However, part of the problem is the increasing regulation of personal data gathering and privacy. This makes demographic or transaction data collection more difficult, and is a significant hurdle for traditional recommendation approaches because these systems demand a great deal of personal data or transaction logs. Our motivation for presenting this paper to academia is our strong belief, and evidence, that most customers' requirements for products can be effectively and efficiently analyzed from unstructured textual data such as Internet news text. In order to derive users' requirements from textual data obtained online, the proposed approach in this paper attempts to construct double two-mode networks, such as a user-news network and news-issue network, and to integrate these into one quasi-network as the input for issue clustering. One of the contributions of this research is the development of a methodology utilizing enormous amounts of unstructured textual data for user-oriented issue clustering by leveraging existing text mining and social network analysis. In order to build multi-layered two-mode networks of news logs, we need some tools such as text mining and topic analysis. We used not only SAS Enterprise Miner 12.1, which provides a text miner module and cluster module for textual data analysis, but also NetMiner 4 for network visualization and analysis. Our approach for user-perspective issue clustering is composed of six main phases: crawling, topic analysis, access pattern analysis, network merging, network conversion, and clustering. In the first phase, we collect visit logs for news sites by crawler. After gathering unstructured news article data, the topic analysis phase extracts issues from each news article in order to build an article-news network. For simplicity, 100 topics are extracted from 13,652 articles. In the third phase, a user-article network is constructed with access patterns derived from web transaction logs. The double two-mode networks are then merged into a quasi-network of user-issue. Finally, in the user-oriented issue-clustering phase, we classify issues through structural equivalence, and compare these with the clustering results from statistical tools and network analysis. An experiment with a large dataset was performed to build a multi-layer two-mode network. After that, we compared the results of issue clustering from SAS with that of network analysis. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The sample dataset contains 150 million transaction logs and 13,652 news articles of 5,000 panels over one year. User-article and article-issue networks are constructed and merged into a user-issue quasi-network using Netminer. Our issue-clustering results applied the Partitioning Around Medoids (PAM) algorithm and Multidimensional Scaling (MDS), and are consistent with the results from SAS clustering. In spite of extensive efforts to provide user information with recommendation systems, most projects are successful only when companies have sufficient data about users and transactions. Our proposed methodology, user-perspective issue clustering, can provide practical support to decision-making in companies because it enhances user-related data from unstructured textual data. To overcome the problem of insufficient data from traditional approaches, our methodology infers customers' real interests by utilizing web transaction logs. In addition, we suggest topic analysis and issue clustering as a practical means of issue identification.

An Optimized User Behavior Prediction Model Using Genetic Algorithm On Mobile Web Structure

  • Hussan, M.I. Thariq;Kalaavathi, B.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.5
    • /
    • pp.1963-1978
    • /
    • 2015
  • With the advancement of mobile web environments, identification and analysis of the user behavior play a significant role and remains a challenging task to implement with variations observed in the model. This paper presents an efficient method for mining optimized user behavior prediction model using genetic algorithm on mobile web structure. The framework of optimized user behavior prediction model integrates the temporary and permanent register information and is stored immediately in the form of integrated logs which have higher precision and minimize the time for determining user behavior. Then by applying the temporal characteristics, suitable time interval table is obtained by segmenting the logs. The suitable time interval table that split the huge data logs is obtained using genetic algorithm. Existing cluster based temporal mobile sequential arrangement provide efficiency without bringing down the accuracy but compromise precision during the prediction of user behavior. To efficiently discover the mobile users' behavior, prediction model is associated with region and requested services, a method called optimized user behavior Prediction Model using Genetic Algorithm (PM-GA) on mobile web structure is introduced. This paper also provides a technique called MAA during the increase in the number of models related to the region and requested services are observed. Based on our analysis, we content that PM-GA provides improved performance in terms of precision, number of mobile models generated, execution time and increasing the prediction accuracy. Experiments are conducted with different parameter on real dataset in mobile web environment. Analytical and empirical result offers an efficient and effective mining and prediction of user behavior prediction model on mobile web structure.

Research on Efficient Live Evidence Analysis System Based on User Activity Using Android Logging System (안드로이드 로그 시스템을 이용한 효율적인 사용자 행위기반 라이브 증거수집 및 분석 시스템 연구)

  • Hong, Il-Young;Lee, Sang-Jin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.22 no.1
    • /
    • pp.67-80
    • /
    • 2012
  • Recently as the number of smartphone user is growing rapidly, android is also getting more interest in digital forensic. However, there is not enough research on digital data acquisition and analysis based on android platform's unique characteristics so far. Android system stores all the related recent systemwide logs from the system components to applications in volatile memory, and therefore, the logs can potentially serve as important evidences. In this paper, we propose a digital data acquisition and analysis system for android which extracts meaningful information based on the correlation of android logs and user activities from a device at runtime. We also present an efficient search scheme to facilitate realtime analysis on site. Finally, we demonstrate how the proposed system can be used to reconstruct the sequence of user activities in a more intuitive manner, and show that the proposed search scheme can reduce overall search and analysis time approximately 10 times shorter than the normal regular search method.

A Study on the Analysis of Current Status and Improvements of the Children and Youth Services in the Library based on Bigdata: - A Case Study of National Library of Korea, Sejong - (빅데이터 기반 도서관 어린이청소년서비스 현황분석 및 개선방안 - 국립세종도서관을 중심으로 -)

  • Baek, Ji-Yeon;Kim, Tae-Young;Yang, Dongmin;Oh, Hyo-Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.49 no.4
    • /
    • pp.295-328
    • /
    • 2018
  • This study aims to analyze circulation status of children's material and participation in culture program based on the bigdata to identify the current status of children and youth services and suggest ways to improve the services. The logs to be analyzed consist of children's material information, circulation count information, circulation user information registered at the National Library of Korea, Sejong. The children's material information logs contain 77,297 data, circulation count information logs contain 4,160,484 data, circulation user information logs contain 189,060 data The current status analysis of children and youth services was conducted in various ways, including analysis of circulation status and culture program by subject, age, and residential area. Based on analysis results, improvement methods of children and youth services were proposed in terms of books, users and residences. This study analyze empirically current status of children and youth services based on bigdata logs, and it has significance for being different form proceeding researches. We expect this study to be used as an empirical basis for the establishment of operational strategies in the future.

An Analysis of Library Culture Program Management based on Users' Participation Logs: A Case Study of National Library of Korea, Sejong (이용자 참여 로그 기반 도서관 문화프로그램 운영현황 분석: 국립세종도서관 사례를 통해)

  • Choi, Doo-Won;Gang, Ju-Yeon;Yang, Dongmin;Lee, Hyunju;Oh, Hyo-Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.49 no.1
    • /
    • pp.293-320
    • /
    • 2018
  • This study aims to analyze library culture-program management and propose methods for improving the program management. To achieve this research goal, this study examines current state of culture-program management and user's participation by analyzing users' participation logs of the National Library of Korea, Sejong. The users' participation logs have been accumulated from May, 2015, to December, 2017, and the logs contain 722 program data, 24,816 program participation data, and 6,729 users data. The data were used for analyzing of program management, users' characteristics, and changes based on timelines. Based on the analysis results, culture-program management problems were pointed out and future improvement methods for solving the problems were proposed. This study analyzes culture-program management in view of users using real users' participation logs, and it has significance for being different from preceding researches focusing on culture-program providers.

Analysis of Behavior Patterns from Human and Web Crawler Events Log on ScienceON (ScienceON 웹 로그에 대한 인간 및 웹 크롤러 행위 패턴 분석)

  • Poositaporn, Athiruj;Jung, Hanmin;Park, Jung Hoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.6-8
    • /
    • 2022
  • Web log analysis is one of the essential procedures for service improvement. ScienceON is a representative information service that provides various S&T literature and information, and we analyze its logs for continuous improvement. This study aims to analyze ScienceON web logs recorded in May 2020 and May 2021, dividing them into humans and web crawlers and performing an in-depth analysis. First, only web logs corresponding to S (search), V (detail view), and D (download) types are extracted and normalized to 658,407 and 8,727,042 records for each period. Second, using the Python 'user_agents' library, the logs are classified into humans and web crawlers, and third, the session size was set to 60 seconds, and each session is analyzed. We found that web crawlers, unlike humans, show relatively long for the average behavior pattern per session, and the behavior patterns are mainly for V patterns. As the future, the service will be improved to quickly detect and respond to web crawlers and respond to the behavioral patterns of human users.

  • PDF

Information Seeking Behavior of the NAVER Users via Query Log Analysis (질의 로그 분석을 통한 네이버 이용자의 검색 형태 연구)

  • Lee, Joon-Ho;Park, So-Yeon;Kwon, Hyuk-Sung
    • Journal of the Korean Society for information Management
    • /
    • v.20 no.2
    • /
    • pp.27-41
    • /
    • 2003
  • Query logs are online records that capture user interactions with information retrieval systems and all the search processes. Query log analysis offers ad advantage of providing reasonable and unobtrusive means of collecting search information from a large number of users. In this paper, query logs of NAVER, a major Korean Internet search service, were analyzed to investigate the information seeking behabior of NAVER users. The query logs were collected over one week from various collecions such as comprehensive search, directory search and web ducument searc. It is expected that this study could contribute to the development and implementation of more effective web search systems and services.

User Participation Evaluation of A Scholarly Information Site (학술정보사이트의 이용자 참여형 평가)

  • Park, Min-Soo
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.4
    • /
    • pp.85-97
    • /
    • 2011
  • The purpose of this study was to develop a methodology of user participation evaluation of a scholarly information site in the field of science and technology and to enhance the site by applying a set of testing protocols. Experiments were conducted in a laboratory setting. Data from multiple sources, including eyetracking, search logs and post surveys, were collected and analyzed quantitatively. Based on the results of eyetracking, the contents and images were reorganized after removing unessential elements in the site. The resulting data from the search logs showed that the users were able to finish the tasks more quickly with the revised user interface. The results of the data analysis of post surveys indicated an overall improvement in the revised website compared to the original one.

Using Transaction Logs to Better Understand User Search Session Patterns in an Image-based Digital Library (이미지 기반 디지털 도서관에서 이용자 검색 패턴의 효과적 이해를 위한 트랜잭션 로그 데이터 분석)

  • Han, Hye-Jung;Joo, Soohyung;Wolfram, Dietmar
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.25 no.1
    • /
    • pp.19-37
    • /
    • 2014
  • Server transaction logs containing complete click-through data from a digital library of primarily image-based documents were analyzed to better understand user search session behavior. One month of data was analyzed using descriptive statistics and network analysis methods. The findings reveal iterative search behaviors centered on result views and evaluation and topical areas of focus for the search sessions. The study is novel in its combined analytical techniques and use of click-through data for image collections.