A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)
-
- Journal of Intelligence and Information Systems
- /
- v.26 no.1
- /
- pp.1-21
- /
- 2020
With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
This study was peformed to document the association between nutrient intakes, body mass index (BMI), waist/hip ratio (WHR), and a major risk factor for chronic diseases. A three-day dietary intake survey, using a 24 hour recall method, was obtained from 187 subjects aged 46 to 84 (mean age 65.3) living in Wando island area. The average daily mean energy intakes were 1869.0 kcal for male and 1943.9 kcal for female, respectively. Daily intakes of protein for male and female were 28.0 and 30.4 g, and those of fat were 31.5 and 28.51 g, respectively Carbohydrate dependency was decreased with age. Protein dependency was increased with age. The mean intakes of energy, protein, Vit. A, Vit. D, Vit. E, Ca, Zn did not meet Korean RDA for elderly. The level of serum triglyceride was higher in males than in females and showed the tendency to increase with age in both sexes, whereas HDL-cholesterol tended to decrease with age in both sexes. The levels of serum total-cholesterol and LDL-cholesterol were significantly higher in males than in females, particularly in the age of
There is increasing interest in freshly cut products, that is, foods produced without washing and cutting. In this study, the quality of freshly cut sliced Deodeok was compared with that of what based on its washing methods. In bubble washing, the Deodeok rises to the water surface apace and is broken into centimeter sizes. Microbubble washing calls for the production of a great number of 0.1 mm-sized bubbles in anions-bearing water and their passing through a trumpet-shaped hole at a high pressure. To compare the product deterioration rates of the specimens, they were stored at
Three dimensional analysis of malocclusion and craniofacial deformation is essential for the successful orthodontic treatment. But the orthodontists are not familiar with diagnosis and treatment plane based on lateral cephalometric analysis. Since orthodontists do not posses a sufficient knowledge in standard value of posteroanterior cephalometric anaysis and of clinical importance for transverse jaw growth. In this study male(n=130) and female(n=171) aged from 6 to 16 and diagnosed as Class I malocclusion were selected to analysis width of cranium, maxilla and mandible on the posteroanterior cephalogram. The changes as a function of chronologic age and cervical vertebrae maturity index(CVXI) were examined. The Proper regression model was selected by sex with polynominal regression models and method of variable selection. Mean of each measurements and 70% confidence interval of individual measurement according to age was assesed and a graphs were made. Results are as follows :1. All the measurements for the width are gradually incresed as increase in chronologic age and CVMI. From the total amount of change between age 6 and 16, there is a tendency that mandibular width is broader than maxillary width and the width of male is broader than female. 2. There is no statistically significant sexual difference in Mx-Mn difference, Mx-Mn width differential, Mx/Mn ratio according to age and CVMI. 3. Mean of each measurement and 70% confidence interval of individual measurement according to age and sex were assessed and graphs were made for maxillary width, mandibular width, Mx-Mn difference, Mx/Mn ratio. 4. The width of maxilla and mandible in Korean children are broader than Western children during growth period.
This study, as a basic research to manage a Chinese Medicine Health Promotion Center by way of showing an example, is a before and after experiment research for simple group to verify a difference with cholesterol, health status and perception of health in order to confirm a effectiveness of diet and regimen according to the 4th status of physical constitution. Research object was chosen of 42 persons who operate a physical constitutional dietary regimen among them after selecting professors and clinical nurses (55 persons) majoring in the science of nursing who participated in Chinese Medicine-oriented Nurse Training Course from Aug. of 2001 to Feb. of 2002 all over the country. Diagnostic tools for physical constitution was used of the questionary that is currently consisted of physical constitution grouping test in Eastern & Western Diagnose Center of K Medical Center, and rating of health status was used of the tool that standardized CMI(Cornell Medical Index) to be available for Korean, and perception measurement for health status was used of a visual analogue scale for the health status that each one perceive personally, and physiological status was measured of cholesterol in blood. Analysis for the collected data was carried out by percentage,
The purpose of this study is to investigate the total polyphenol and antioxidant activities of radish buds (Raphanus sativus L.) based on sprouting periods and extraction solvents in order to present basic data that are needed for using the radish buds as functional food material. The antioxidant activities were assessed by using various antioxidant models (DPPH, TBARS, Rancimat method, POV). The total polyphenol contents according to the extraction solvents were 84.11 and 296.51 mg/g, and the ethanol extract on day 4 of showed the highest value as 296.51 mg/g. As for DPPH radical-scavenging activity, on the day 4 of sprouting, water extracts indicated the highest scavenging activity by 86.67%, and the acetone extracts indicated a rather low scavenging activity as 77.23%. As for TBARS measurement of the radish bud extracts on day 4 of sprouting the extract of 70% ethanol was highest (71.48%). On day 8 of sprouting the TBARS value was increased and the methanol extract was highest (78.99%). As for the oxidative induction period on day 4 of sprouting in Rancimat measurement, the methanol extract was highest (6.07 hours) on day 4 and the antioxidant index was 1.16. On day 12 of sprouting, the general oxidative induction period tended to be reduced to 5.25 to 5.91 hours. In the peroxide value measurement on day 4 of sprouting and beginning of the storage, the extracts showed no difference between 3.02 meq/kg oil and 4.12 meq/kg oil, and on the day 60 of storage, the water extract (43.83 meq/kg oil) and the methanol extract (45.42 meq/kg oil) were lowest with higher antioxidant effect. In conclusion, the radish bud extract with higher total polyphenol contents and antioxidant activities may serve as functional material for food additives, such as natural antioxidants and food preserving agents.
The increase of the supply of medical service and the increase of hospitals have intensified the competition of hospitals, and the advancement towards internationalization in the opening of medical industry has triggered the infinite competition of medical profession. In addition, the high expectation of customers and quality improvement in the medical care in accordance with the improvement of overall income, and the change of active role of medical consumers according to the popularization and the improvement of rights awareness reflect the customer needs and choice in the medical service. Customers wanted to receive the kind and pleasant service under the up-to-date medical service. Therefore, as a solution, hospital coordinators were emerged for the purpose of smooth treatment and customer satisfaction by generalizing all service of hospital. Accordingly, this thesis attempted to investigate the effect of hospital coordinator education curriculum on the education satisfaction and the quality of medical service. In order to solve the purpose of this study, I, author reviewed the existing literatures, established hypothesis, and verified hypothesis by using the variety of statistics techniques such as reliability, validity, frequency analysis, and regression analysis. The verification of hypothesis is as followings: First, among education training factors of hospital coordinators, the quality of instructor significantly affects the satisfaction of hospital coordinator education training. Second, among training factors of hospital coordinator, the attitude of trainee significantly affects the training satisfaction of hospital coordinator. Third, among education training factors of hospital coordinator, education course significantly affects the training satisfaction of hospital coordinator education. As the qualities of instructor are better equipped, the satisfaction of education becomes higher. It indicates that the education method of instructors is important as an index to represent the qualities of instructor such as the appropriateness of education method, preparation, passion, visual materials, the adequacy of education procession, and specialized knowledge, and it has important effect on the satisfaction of education. In order to enhance the satisfaction of hospital coordinator education, the creation of education environment, making trainee concentrate on the education, is required by appropriately allocating programs, arousing interest in education, based on the attitude of trainee, discussion, and preliminary programs, preparation, ahead of enforcement of education. Fourth, the satisfaction of hospital coordinator education training significantly affects the reliability among the qualities of medical service. Fifth, satisfaction of hospital coordinator education training significantly affects hospitality I kindness among the qualities of medical service. If the education satisfaction of trainee is high, it is effective in the practical application such as dealing with complaints, the duty performance for the patients, and so on in offering the medical service, related to reliability and furthermore, we can find the positive change in the attitude change of medical professions related to the reliability of hospital coordinator. In addition, in the process of offering medical services such as the kind explanation on the duty, rapid response to the customers inquiry, and tidy uniform, practical effect was verified. Sixth, the education training factor of hospital coordinator significantly affects the reliability among the quality of medical service. Seventh, the education training factors of hospital coordinator significantly affect hospitality/kindness. In the education of hospital coordinator, the methods to attract the interest of trainee by emphasizing reliability should be sought and for gaining the practical effect of hospital coordinator education, the sufficient preparation and investigation on the education curriculum should be prerequisite and under this condition, intensified discussion on the instructor and education course is needed. In the design of education course, more education hours and subjects should be allocated in the part of hospitality in order to improve the practical application of hospitality. Therefore, it is meaningful in a sense that this study newly approached the components of hospital coordinator education and the need to modify the quality components of medical service in accordance with the study subjects was raised. This study also finds its meaning in that it provides basic materials for the study of future hospital coordinator education by suggesting the system development model of hospital coordinator education through preliminary study of education training. In addition, this study is meaningful in the aspect that it suggested the direction of education training by showing how the hospital coordinator education training would applied to the hospital coordinator course of the Continuing Education Center at Pusan and Kyungnam National University to some extent. Since all investigation of this study was approached from the side of hospital coordinator, the thoughts of patients who are beneficiaries of medical service, and care givers cannot be identified. Therefore, the satisfaction of patients and care givers through the experience of medical service, which is the essential prerequisite of medical service, should be importantly considered and investigated. Accordingly, The study of comparing and analyzing the views of both patients and care givers should be carried out in the future.
With significant influences of old industrial complex in September 2009, Ministry of Land, Infrastructure and Transport chose the 4 districts for the first pilot project. In December 2014, the second pilot project districts were established. In addition, there were 10 districts in April 2016 and 5 districts in April 2016 as the third pilot project and 5 districts in March 2017 as the fourth pilot project. In order to promote smooth business operation of the recycling business, we introduced the effective area designation and special system as stipulated in Article 39.12-13 of the Industrial Location and Development Act revised in May 2015. The effective area, It is a method that can promote propagation and diffusion of the rehabilitation business through visualization by making effective the promotion of the rehabilitation business and by promoting the business in consideration of the geographical feature of the region and industry group, The setting of the unreasonable effective area is based on the criteria and classification of the plan and the objective promotion method according to the individual characteristics of the aged industrial park because the delay of the rehabilitation business and the possibility of the increase of many problems are presented Be sure to Data Envelopment Analysis (DEA) and the old industrial complex database were constructed and utilized to classify the types of recycling projects. Therefore, in this study, it is necessary to strengthen the competitiveness of aged industrial complex by examining the correlation between the diagnosis of 83 aged industrial complex sites and the rehabilitation projects supported by the Ministry of Land, and the types of business promotion for aged industrial parks. It can be used as a guideline for the feasibility of the project.