• Title/Summary/Keyword: Automated analysis system

Search Result 846, Processing Time 0.032 seconds

Social Network Analysis for the Effective Adoption of Recommender Systems (추천시스템의 효과적 도입을 위한 소셜네트워크 분석)

  • Park, Jong-Hak;Cho, Yoon-Ho
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.305-316
    • /
    • 2011
  • Recommender system is the system which, by using automated information filtering technology, recommends products or services to the customers who are likely to be interested in. Those systems are widely used in many different Web retailers such as Amazon.com, Netfix.com, and CDNow.com. Various recommender systems have been developed. Among them, Collaborative Filtering (CF) has been known as the most successful and commonly used approach. CF identifies customers whose tastes are similar to those of a given customer, and recommends items those customers have liked in the past. Numerous CF algorithms have been developed to increase the performance of recommender systems. However, the relative performances of CF algorithms are known to be domain and data dependent. It is very time-consuming and expensive to implement and launce a CF recommender system, and also the system unsuited for the given domain provides customers with poor quality recommendations that make them easily annoyed. Therefore, predicting in advance whether the performance of CF recommender system is acceptable or not is practically important and needed. In this study, we propose a decision making guideline which helps decide whether CF is adoptable for a given application with certain transaction data characteristics. Several previous studies reported that sparsity, gray sheep, cold-start, coverage, and serendipity could affect the performance of CF, but the theoretical and empirical justification of such factors is lacking. Recently there are many studies paying attention to Social Network Analysis (SNA) as a method to analyze social relationships among people. SNA is a method to measure and visualize the linkage structure and status focusing on interaction among objects within communication group. CF analyzes the similarity among previous ratings or purchases of each customer, finds the relationships among the customers who have similarities, and then uses the relationships for recommendations. Thus CF can be modeled as a social network in which customers are nodes and purchase relationships between customers are links. Under the assumption that SNA could facilitate an exploration of the topological properties of the network structure that are implicit in transaction data for CF recommendations, we focus on density, clustering coefficient, and centralization which are ones of the most commonly used measures to capture topological properties of the social network structure. While network density, expressed as a proportion of the maximum possible number of links, captures the density of the whole network, the clustering coefficient captures the degree to which the overall network contains localized pockets of dense connectivity. Centralization reflects the extent to which connections are concentrated in a small number of nodes rather than distributed equally among all nodes. We explore how these SNA measures affect the performance of CF performance and how they interact to each other. Our experiments used sales transaction data from H department store, one of the well?known department stores in Korea. Total 396 data set were sampled to construct various types of social networks. The dependant variable measuring process consists of three steps; analysis of customer similarities, construction of a social network, and analysis of social network patterns. We used UCINET 6.0 for SNA. The experiments conducted the 3-way ANOVA which employs three SNA measures as dependant variables, and the recommendation accuracy measured by F1-measure as an independent variable. The experiments report that 1) each of three SNA measures affects the recommendation accuracy, 2) the density's effect to the performance overrides those of clustering coefficient and centralization (i.e., CF adoption is not a good decision if the density is low), and 3) however though the density is low, the performance of CF is comparatively good when the clustering coefficient is low. We expect that these experiment results help firms decide whether CF recommender system is adoptable for their business domain with certain transaction data characteristics.

Development of an Automated Gangform Climbing System for Apartment Housing Construction - Structural Stability and Tower Crane Lifting Load Analysis - (공동주택 전용 갱폼 인양 자동화 기술의 개발 - 구조적 안정성 및 타워크레인 양중부하 분석 -)

  • Lee, Jeong-Ho;Yang, Sang-Hoon;Kim, Young-Suk
    • Korean Journal of Construction Engineering and Management
    • /
    • v.13 no.4
    • /
    • pp.48-59
    • /
    • 2012
  • Gangform, compared to the traditional forms, is a systemized form which can reduce construction duration and cost by the advantage of using it repeatedly. However, transportation and climbing process of the Gangform is highly dependant on the performance of tower crane. Gangform climbing process takes one day out of six to seven days of a structural work cycle. Tower cranes can not be used in other lifting works when they lift the Gangform during the structural work cycle, causing the delay in the construction project. Numerous efforts and researches have been done in domestic and international industry to solve such limitations of Gangform climbing process. Especially, "A Study on the Development of Automatic Gangform Climbing System for Apartment Housing Construction"has suggested a conceptual model which can climb the Gangform system without a tower crane. In this paper, the technical and economical feasibilities of previously proposed Automatic Gangform climbing system are examined by evaluating its structural stability and lifting load reduction effect.

A Study of Big data-based Machine Learning Techniques for Wheel and Bearing Fault Diagnosis (차륜 및 차축베어링 고장진단을 위한 빅데이터 기반 머신러닝 기법 연구)

  • Jung, Hoon;Park, Moonsung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.1
    • /
    • pp.75-84
    • /
    • 2018
  • Increasing the operation rate of components and stabilizing the operation through timely management of the core parts are crucial for improving the efficiency of the railroad maintenance industry. The demand for diagnosis technology to assess the condition of rolling stock components, which employs history management and automated big data analysis, has increased to satisfy both aspects of increasing reliability and reducing the maintenance cost of the core components to cope with the trend of rapid maintenance. This study developed a big data platform-based system to manage the rolling stock component condition to acquire, process, and analyze the big data generated at onboard and wayside devices of railroad cars in real time. The system can monitor the conditions of the railroad car component and system resources in real time. The study also proposed a machine learning technique that enabled the distributed and parallel processing of the acquired big data and automatic component fault diagnosis. The test, which used the virtual instance generation system of the Amazon Web Service, proved that the algorithm applying the distributed and parallel technology decreased the runtime and confirmed the fault diagnosis model utilizing the random forest machine learning for predicting the condition of the bearing and wheel parts with 83% accuracy.

Development of an Edge-based Point Correlation Algorithm Avoiding Full Point Search in Visual Inspection System (전탐색 회피에 의한 고속 에지기반 점 상관 알고리즘의 개발)

  • Kang, Dong-Joong;Kim, Mun-Jo;Kim, Min-Sung;Lee, Eung-Joo
    • The KIPS Transactions:PartB
    • /
    • v.11B no.3
    • /
    • pp.327-336
    • /
    • 2004
  • For visual inspection system in real industrial environment, it is one of most important tasks to design fast and stable pattern matching algorithm. This paper presents an edge-based point correlation algorithm avoiding full search in visual inspection system. Conventional algorithms based on NGC(normalized gray-level correlation) have to overcome some difficulties for applying to automated inspection system in factory environment. First of all, NGC algorithms need high time complexity and thus high performance hardware to satisfy real-time process. In addition, lighting condition in realistic factory environments if not stable and therefore intensity variation from uncontrolled lights gives many roubles for applying directly NGC as pattern matching algorithm in this paper, we propose an algorithm to solve these problems from using thinned and binarized edge data and skipping full point search with edge-map analysis. A point correlation algorithm with the thinned edges is introduced with image pyramid technique to reduce the time complexity. Matching edges instead of using original gray-level pixel data overcomes NGC problems and pyramid of edges also provides fast and stable processing. All proposed methods are preyed from experiments using real images.

Analysis of Meteorological Factors on Yield of Chinese Cabbage and Radish in Winter Cropping System (월동작형 배추와 무의 생산량에 영향을 미치는 기상요인 분석)

  • Kim, In-Gyum;Park, Ki-Jun;Kim, Baek-Jo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.15 no.2
    • /
    • pp.59-66
    • /
    • 2013
  • Among many factors, especially meteorological conditions can impact agricultural productivities. This study was conducted to analyze the relationships between crop yield and meteorological factors. We collected meteorological data (i.e., temperature and precipitation) from the Automated Weather System (AWS) of Korea Meteorological Administration (KMA) and the yield data of Chinese cabbage and Radish from local Nonghyup (NCAF:National Agricultural Cooperative Federation) and Farmers' Corporate Association. The agricultural data were classified into two groups. These groups are comprised of the farmers who produced a crop under 30 kg per $3.3m^2$ and over 30k g per $3.3m^2$ respectively. The daily meteorological data were calculated from the average value for ten days. Based on the regression analysis, we concluded that the yield of Chinese cabbage (Haenam) was related to average temperature, minimum temperature, precipitation, and number of days with precipitation, whereas that of Radish (Jeju) was related to average temperature, maximum temperature, and minimum temperature. The result suggests that these meteorological data can be used more effectively for the prediction of crop yield.

The Meaning and Usefulness of Simulation Method for Business Process Reengineering -Focused on the Korean Supreme Court BPR Project (1994-2003)-

  • Hong, Sung-wan;Roh, Tae-hoon;Kang, Sung-min;Lee, Jung-woo;Kang, Ga-na
    • Proceedings of the Korea Society for Simulation Conference
    • /
    • 2001.10a
    • /
    • pp.170-202
    • /
    • 2001
  • Simulation is used to reduce a risk involved in the new project and decision-making in an organization and to save cost and time by forecasting different situations. The objectives of this research are to acknowledge the need of simulation through the real life sample and to encourage the use of the simulation method in the future consulting project by continuously making the necessary improvements. This research analyzed the effectiveness of the simulation based on the sample use of simulation method in 1994 and 1997 for the BPR project of certification issuance process at the Supreme Court. In order to evaluate the value of the proposed simulation model, we examined the gap, which existed between the simulation result and the operational data collected by visiting the actual sites where AROS (Automated Registry Office System: automation system developed by LG-EDS Systems) is being utilized. We also identified the causes for the existing gap. According to the analysis result, (1) the gap came from the status change of thinking that the concentration of certification issuance request has eased after the computerization, (2) the gap existed in the operational process because they failed to consider the situational factors of each registry office in the simulation model, and (3) lastly the gap came from the difficulty of formulating the mathematical model for predicting the complex and diverse behavior pattern of individuals requesting the certification issuance. In order to narrow the existing gaps, we made a proposal to improve the certification issuance process where software of certification issuance vending machine was upgraded in order to help the people to use the service conveniently, more part time workers were hared when there was a overload of certification issuance request, and the quality of the certification Issuance vending machine is improved, In this research, we examined an efficient way of resource allocation based on the simulation conducted in 1994 and 1997. By reflecting changes since the simulation of 1994 and allocating the clerk and machine based on the predicted results of the simulation, we maximized the efficiency of the certification issuance process. In conclusion, this research examined the future usability of simulation method based on the analysis result and identified the key issues to consider when using the simulation method in the future consulting project.

  • PDF

Application of Terrestrial LiDAR for Displacement Detecting on Risk Slope (위험 경사면의 변위 검출을 위한 지상 라이다의 활용)

  • Lee, Keun-Wang;Park, Joon-Kyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.1
    • /
    • pp.323-328
    • /
    • 2019
  • In order to construct 3D geospatial information about the terrain, current measurement using a total station, remote sensing, GNSS(Global Navigation Satellite System) have been used. However, ground survey and GNSS survey have time and economic disadvantages because they have to be surveyed directly in the field. In case of using aerial photographs and satellite images, these methods have the disadvantage that it is difficult to obtain the three-dimensional shape of the terrain. The terrestrial LiDAR can acquire 3D information of X, Y, Z coordinate and shape obtained by scanning innumerable laser pulses at densely spaced intervals on the surface of the object to be observed at high density, and the processing can also be automated. In this study, terrestrial LiDAR was used to analyze slope displacement. Study area slopes were selected and data were acquired using LiDAR in 2016 and 2017. Data processing has been used to generate slope cross section and slope data, and the overlay analysis of the generated data identifies slope displacements within 0.1 m and suggests the possibility of using slope LiDAR on land to manage slopes. If periodic data acquisition and analysis is performed in the future, the method using the terrestrial lidar will contribute to effective risk slope management.

Research on regional spatial information analysis platform about NTIS raw data (국가과학기술지식 원시데이터에 관한 지역 공간정보 분석 플랫폼 연구)

  • Lim, Jung-Sun;Kim, Sanggook;Bae, Seoung Hun;Kim, Kwang-Hoon;Won, Dong-Kyu
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.2
    • /
    • pp.21-35
    • /
    • 2020
  • Due to the coronavirus pandemic and diplomatic disputes, governments are actively developing a policy to revitalize·reshore manufacturing and to diversify international cooperations. In order to develop such a policy, it is very important to compare and analyze domestic·international geospatial information. Over the decade, the US·EC governments have conducted a series of national researches to build data-based tools that can monitor·analyze regional geospatial information driven by government R&D investments. In the case of the EC system, it can compare geospatial information in domestic and international(including Korea) regions. Compared to US·EC cases, Korean examples of national researches with available data analplatform need future improvements. Current study is investigating an automated analysis methodologies using "National Institute of Science and Technology Information (NTIS)" DB, which was national security data until recently. Research on data-mining regional geospatial information can contribute to support policy fields that need to discover new issues in response to unexpected social problems such as recently faced corona and trade disputes.

Analysis of Georeferencing Accuracy in 3D Building Modeling Using CAD Plans (CAD 도면을 활용한 3차원 건축물 모델링의 Georeferencing 정확도 분석)

  • Kim, Ji-Seon;Yom, Jae-Hong;Lee, Dong-Cheon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.25 no.2
    • /
    • pp.117-131
    • /
    • 2007
  • Representation of building internal space is an active research area as the need for more geometrically accurate and visually realistic increases. 3 dimensional representation is common ground of research for disciplines such as computer graphics, architectural design and engineering and Geographic Information System (GIS). In many cases CAD plans are the starting point of reconstruction of 3D building models. The main objectives of building reconstruction in GIS applications are visualization and spatial analysis. Hence, CAD plans need to be preprocessed and edited to adapt to the data models of GIS SW and then georeferenced to enable spatial analysis. This study automated the preprocessing of CAD data using AutoCAD VBA (Visual Basic Application), and the processed data was topologically restructured for further analysis in GIS environment. Accuracy of georeferencing CAD data was also examined by comparing the results of coordinate transformation by using digital maps and GPS measurements as the sources of ground control points. The reconstructed buildings were then applied to visualization and network modeling.

Estimation and assessment of long-term drought outlook information using the long-term forecasting data (장기예보자료를 활용한 장기 가뭄전망정보 산정 및 평가)

  • So, Jae-Min;Oh, Taesuk;Bae, Deg-Hyo
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.10
    • /
    • pp.691-701
    • /
    • 2017
  • The objective of this study is to evaluate the long-term drought outlook information based on long-term forecast data for the 2015 drought event. In order to estimate the Standardized Precipitation Index (SPI) for different durations (3-, 6-, 9-, 12-months), we used the observation precipitation of 59 Automated Synoptic Observing System (ASOS) sites, forecast and hindcast data of GloSea5. The Receiver Operating Characteristic (ROC) analysis and statistical analysis (Correlation Coefficient, CC; Root Mean Square Error, RMSE) were used to evaluate the utilization of drought outlook information for the forecast lead-times (1~6months). As a result of ROC analysis, ROC scores of SPI(3), SPI(6), SPI(9) and SPI(12) were estimated to be over 0.70 until the 2-, 3-, 4- and 5-months. The CC and RMSE values of SPI(3), SPI(6), SPI(9) and SPI(12) for forecast lead-time were estimated as (0.60, 0.87), (0.72, 0.95), (0.75, 0.95) and (0.77, 0.89) until the 2-, 4-, 5- and 6-months respectively.