• Title/Summary/Keyword: development on standard model

Search Result 1,073, Processing Time 0.03 seconds

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

CAS 500-1/2 Image Utilization Technology and System Development: Achievement and Contribution (국토위성정보 활용기술 및 운영시스템 개발: 성과 및 의의)

  • Yoon, Sung-Joo;Son, Jonghwan;Park, Hyeongjun;Seo, Junghoon;Lee, Yoojin;Ban, Seunghwan;Choi, Jae-Seung;Kim, Byung-Guk;Lee, Hyun jik;Lee, Kyu-sung;Kweon, Ki-Eok;Lee, Kye-Dong;Jung, Hyung-sup;Choung, Yun-Jae;Choi, Hyun;Koo, Daesung;Choi, Myungjin;Shin, Yunsoo;Choi, Jaewan;Eo, Yang-Dam;Jeong, Jong-chul;Han, Youkyung;Oh, Jaehong;Rhee, Sooahm;Chang, Eunmi;Kim, Taejung
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_2
    • /
    • pp.867-879
    • /
    • 2020
  • As the era of space technology utilization is approaching, the launch of CAS (Compact Advanced Satellite) 500-1/2 satellites is scheduled during 2021 for acquisition of high-resolution images. Accordingly, the increase of image usability and processing efficiency has been emphasized as key design concepts of the CAS 500-1/2 ground station. In this regard, "CAS 500-1/2 Image Acquisition and Utilization Technology Development" project has been carried out to develop core technologies and processing systems for CAS 500-1/2 data collecting, processing, managing and distributing. In this paper, we introduce the results of the above project. We developed an operation system to generate precision images automatically with GCP (Ground Control Point) chip DB (Database) and DEM (Digital Elevation Model) DB over the entire Korean peninsula. We also developed the system to produce ortho-rectified images indexed to 1:5,000 map grids, and hence set a foundation for ARD (Analysis Ready Data)system. In addition, we linked various application software to the operation system and systematically produce mosaic images, DSM (Digital Surface Model)/DTM (Digital Terrain Model), spatial feature thematic map, and change detection thematic map. The major contribution of the developed system and technologies includes that precision images are to be automatically generated using GCP chip DB for the first time in Korea and the various utilization product technologies incorporated into the operation system of a satellite ground station. The developed operation system has been installed on Korea Land Observation Satellite Information Center of the NGII (National Geographic Information Institute). We expect the system to contribute greatly to the center's work and provide a standard for future ground station systems of earth observation satellites.

Analysis of Changes in Forest According to Urban Expansion Pattern and Morphological Features - Focused on Seoul and Daegu - (도시의 공간 확장 및 형태적 특징에 따른 산림녹지의 변화 분석 - 서울, 대구를 중심으로 -)

  • Ryu, Jieun;Hwang, Jinhoo;Lee, Junhee;Chung, Hye-In;Lee, Kyung-il;Choi, Yu-Young;Zhu, Yongyan;Sung, Min-Jun;Jang, Raeik;Sung, Hyun-Chan;Jeon, Seongwoo;Kang, Jin-Yung
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.5_3
    • /
    • pp.835-854
    • /
    • 2017
  • Government regulations and policies are important means of restraining the indiscreet expansion of urban areas. According to the standards from those means, it is clear that the fluctuation of forest green proportion encroached by the increase of urban space is obvious. In this study, we interpreted the changes of urban areas as well as the green ones owing to the urban expansion by the decades from 1996, with focusing on the cities of Seoul and Daegu highly developed in South Korea. The purpose of this study is to analyze the spatial expansion and morphological characteristics of urban land cover using not only satellite imageries (1996, 2006, 2016). but also the urban expansion intensity index (UEII) and GUIDOS program. Ultimately, this study is to compare the changes in the rate of forests due to urban expansions annually analyzed based on areas of forest elevation, slope, and the area of single forest patch. In Seoul, the expansion begun from urban space to suburban areas was comparatively rapid, which led the forest fragmentation and the gradual decline of the single patch. However, when it comes to DEM (Digital elevation model) and slope above a certain standard, by the development regulations, there was little decrease in area by anthropogenic developments. The city of Daegu has increased at a slow speed since 1996 in urban and suburban areas, whereas green forests have greatly increased through green forest conservation campaigns. In this way, as to the government policies and regulations, the quantitative and morphological expansion of cities owing to development could be controlled and forest spaces could be preserved as well. Therefore, regulations and policies by the government should be appropriately utilized for sustainable cities.

Classification of Landscape Type on Land and Evaluation of Site-suitability Based on It (토지의 경관유형분류와 이에 기초한 입지타당성 평가)

  • Ra, Jung-Hwa;Ku, Ji-Na;Lee, Hyun-Taek;Cho, Hyun-Ju
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.39 no.5
    • /
    • pp.57-75
    • /
    • 2011
  • The purpose of this study is to find ways of evaluating the suitability of sites being considered for development of different types of parks in the vicinity of yangmock-myun kyoung buk, where a large project(as large as about14.0$km^2$) has been planned. The results are as follows. Three surveys for selecting the assessment indicators were performed. ${\cdot}$ The first survey analyzed the importance of 23 selected assessment indicators based on a review of existing literature review and an on-the-spot research. ${\cdot}$ The second survey selected assessment indicators for each park type. ${\cdot}$ The third survey computed additive values of selected assessment indicators by the park types. It used a method of standardizing the average importance of indicators by making their sum equal to 10. These additive values were then multiplied by each grade of indicators to make a final evaluation. An evaluation of the site-suitability of park types was performed twice. The purpose of the first evaluation was to figure out how much each type met the minimum requirements targeted for all landscape types. The minimum requirements were derived by using a relative comparison between the standard of value rating of the assessment indicators, which was over the medium magnitude on the importance analysis, and the result of field research. A second evaluation estimated the targeted sites that met the minimum requirements. Value ratings of second assessment indicators were quantitatively divided as 1 to 3 grade and the evaluation scores were added, giving an additive value for each assessment indicator. The evaluation score on each park type was rated on a scale of 1 to 3 according to their averages, (from lowest to highest). Since this evaluation model of the site suitability on park types only focused on the 'face' of space in this study, additional analysis is necessary for setting the evaluation model and incorporating the overall impact of space, network connection and other factors, considering 'spot', 'line' and 'face' aspects of space.

Analysis of Design Live Load of Railway Bridge Through Statistical Analysis of WIM Data for High-speed Rail (고속철도 WIM 데이터에 대한 통계분석을 통한 철도교량 설계활하중 분석)

  • Park, Sumin;Yeo, Inho;Paik, Inyeol
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.28 no.6
    • /
    • pp.589-597
    • /
    • 2015
  • In this paper, the live load model for the design of high-speed railway bridge is analyzed by statistic and probabilistic methods and the safety level that is given by the load factors of the load combination is analyzed. This study is a part of the development of the limit state design method for the railway bridge, and the train data collected from the Gyeongbu high-speed railway for about one month are utilized. The four different statistical methods are applied to estimate the design load to match the bridge design life and the results are compared. In order to examine the safety level that the design load combination of the railway bridge gives, the reliability indexes are determined and the results are analyzed. The load effect from the current design live load for the high-speed rail bridge which is 0.75 times of the standard train load is came out greater than at least 30-22% that from the estimated load from the measured data. If it is judged based on the ultimate limit state, there is a possibility of additional reduction of the safety factors through the reliability analysis.

Estimation of Human Lower-Extremity Muscle Force Under Uncertainty While Rising from a Chair (의자에서 일어서는 동작 시 불확실성을 고려한 인체 하지부 근력 해석)

  • Jo, Young Nam;Kang, Moon Jeong;Chae, Je Wook;Yoo, Hong Hee
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.38 no.10
    • /
    • pp.1147-1155
    • /
    • 2014
  • Biomechanical models are often used to predict muscle and joint forces in the human body. For estimation of muscle forces, the body and muscle properties have to be known. However, these properties are difficult to measure and differ from person to person. Therefore, it is necessary to predict the change in muscle forces depending on the body and muscle properties. The objective of the present study is to develop a numerical procedure for estimating the muscle forces in the human lower extremity under uncertainty of body and muscle properties during rising motion from a seated position. The human lower extremity is idealized as a multibody system in which eight Hill-type muscle force models are employed. Each model has four degrees of freedom and is constrained in the sagittal plane. The eight muscle forces are determined by minimizing the metabolic energy consumption during the rising motion. Uncertainty analysis is performed using a first-order reliability method. The one-standard-deviation range of agonistic muscle forces is calculated to be about 150-300 N.

Analysis of City Size Distribution and Spatial Structure - with Korean Metroplitan Statistical Areas (MSA) (한국 도시의 규모분포와 도시공간구조 분석 - 광역도시통계권을 중심으로)

  • Kim, Dong-Soo;Huh, Mun-Gu;Lee, Doo-Hee
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.11 no.4
    • /
    • pp.549-563
    • /
    • 2008
  • The purpose of this research is to identify the urban structure in Korea. Though there is research regarding urbanization, there is little regarding the urban structure of the Korean economy. In this paper, two issues will be discussed: the measurements of inter-city and intra-city structure in Korean Metropolitan Statistical Areas (MSAs), which is newly defined. First, the city size rank rule, widely known as Zipf’s Law, will illustrate Korean the inter-city structure. The city size rank rule gives an idea whether Korean MSAs are balanced or not. In general, Korea has a heavy concentration in the Seoul MSA in terms of population. It could be either that the Seoul MSA is too big or that the Busan MSA is too small or both. If this is the primacy problem, a decentralization policy is necessary. On the other hand, if it is a second city problem, development policies for the Busan MSA and Daegu MSA are more important. Next, the Korean intra-city structure will be discussed. The evolutions of the MSAs explain intra-city structure by analyzing population density function and the housing price function. Some large MSAs such as Seoul and Busan have experienced urban sprawl, while other MSAs have experienced urban concentration. The population density gradient by the distance from the ARC GIS shows the growth rate of a city. According to the Spatial Mismatch Index between population and employment, the Ulsan MSA, Gwangju MSA, and Suwon-Hwaseong-Osan MSA are more mismatched, while the Daejeon MSA and Incheon MSA are less mismatched. Therefore, these analyses of Korean urban structure are meaningful in developing regional policy.

  • PDF

Development of Survey Framework for Prevailing Wage in the Construction Industry (건설분야 적정임금 산정을 위한 임금조사 프레임워크 개발)

  • Lee, Ju-hyun;Baek, Seung-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.138-147
    • /
    • 2020
  • The construction field is one of the most representative job creation businesses, but it has been pointed out that the overall quality of the jobs is low because of the nature of the order-made production industry, such as unstable employment structure, aging workforce, etc.. Accordingly, the government plans to implement the "prevailing wage system" that guarantees a minimum wage for construction site workers. In reality, however, only a market wage could be used for a construction cost estimation because there was no standard for the prevailing wage. A comparative analysis of the prevailing wage and market wage was performed. This paper proposes a framework for estimating the reasonable prevailing wage in the construction industry. The results showed that the prevailing wage was estimated to be 4.7% lower than the market wage when the proposed framework is applied to the carpenters' case. This suggests that the proposed model could be used as an alternative for market wage considering the original purpose of the prevailing wage. This study will construct the basic data for scientific analysis on the wage, and finally, help estimate the reliable prevailing wage in the future.

Development of ICT-based road safety integrated facilities for pedestrian crossing (ICT기반 횡단보도용 교통안전 통합시설물 개발)

  • Cho, Choong-Yuen;Yim, Hong-Kyu;Lee, Min-Jae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.12
    • /
    • pp.93-99
    • /
    • 2017
  • The rate of traffic accidents that occurred in Korea last year is 10 out of every 100,000 people, ranking it 6th among the 35 OECD member countries. The accident rate of children with disabilities and elderly people is also high. The purpose of this study is to introduce traffic safety facilities which have been developed for the reduction of traffic accidents in non-urban areas in Korea through an analysis of the related literature, the accident factors using traffic accident analysis system data and traffic accident characteristics. Traffic safety integrated facilities for ICT-based pedestrian crossings are subject to cross-sectional coverage of child protection zones. The smart safety fence prevents vehicles from parking illegally and informs pedestrians that there is an access vehicle on the pedestrian crossing. The smart bump is designed to warn drivers who are not aware of the pedestrians. In order to standardize the appropriate form and size of the traffic safety facilities for pedestrian crossings, we constructed a standard model for each type, considering the road function, press classification, power, lane number, geometric form, etc. As a result, the rate of traffic accidents involving vulnerable people was reduced. In addition, it is anticipated that the maintenance costs will be reduced by the use of a solar power supply and their compatibility with the existing installed safety fences.

Feasibility Study of Developing Ship Engineering Control System based on DDS Middle-ware (DDS 미들웨어 기반의 선박 통합기관감시제어체계 개발 가능성 연구)

  • Seongwon Oh
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.6
    • /
    • pp.653-658
    • /
    • 2023
  • In systems like the combat management system of a naval ship or smart city of civilians, where many sensors and actuators are connected, the middle-ware DDS (Data Distribution Service) is mainly used to transmit large amounts of data. It is scalable and can effectively respond to the increase in sensors or equipment connected to the system in the future. The engineering control system (ECS), which plays an important role similar to the combat management system of a naval ship, still uses Server-Client model with industrial protocols such as Modbus and CAN (Controller Area Network) bus, to transmit data, which is unfavorable in terms of scalability. However, as automation and unmanned systems advance, more sensors and actuators are expected to be added, necessitating substantial program modification. DDS can effectively address such situations. The purpose of this study is to confirm the development possibility of an integrated monitoring and control system of a ship by using OpenDDS, which follows the OMG (Object Management Group) standard among the middle-ware DDS used in the combat management system. To achieve this goal, field equipment simulators and an ECS server were configured to perform field equipment data input/output and simulation using DDS was performed. The ECS prototype successfully handled data transmission, confirming that DDS is capable of serving as the middle-ware for the ECS of a ship.