• Title/Summary/Keyword: Classification Problem

Search Result 1,735, Processing Time 0.029 seconds

Comparison of Models for Stock Price Prediction Based on Keyword Search Volume According to the Social Acceptance of Artificial Intelligence (인공지능의 사회적 수용도에 따른 키워드 검색량 기반 주가예측모형 비교연구)

  • Cho, Yujung;Sohn, Kwonsang;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.103-128
    • /
    • 2021
  • Recently, investors' interest and the influence of stock-related information dissemination are being considered as significant factors that explain stock returns and volume. Besides, companies that develop, distribute, or utilize innovative new technologies such as artificial intelligence have a problem that it is difficult to accurately predict a company's future stock returns and volatility due to macro-environment and market uncertainty. Market uncertainty is recognized as an obstacle to the activation and spread of artificial intelligence technology, so research is needed to mitigate this. Hence, the purpose of this study is to propose a machine learning model that predicts the volatility of a company's stock price by using the internet search volume of artificial intelligence-related technology keywords as a measure of the interest of investors. To this end, for predicting the stock market, we using the VAR(Vector Auto Regression) and deep neural network LSTM (Long Short-Term Memory). And the stock price prediction performance using keyword search volume is compared according to the technology's social acceptance stage. In addition, we also conduct the analysis of sub-technology of artificial intelligence technology to examine the change in the search volume of detailed technology keywords according to the technology acceptance stage and the effect of interest in specific technology on the stock market forecast. To this end, in this study, the words artificial intelligence, deep learning, machine learning were selected as keywords. Next, we investigated how many keywords each week appeared in online documents for five years from January 1, 2015, to December 31, 2019. The stock price and transaction volume data of KOSDAQ listed companies were also collected and used for analysis. As a result, we found that the keyword search volume for artificial intelligence technology increased as the social acceptance of artificial intelligence technology increased. In particular, starting from AlphaGo Shock, the keyword search volume for artificial intelligence itself and detailed technologies such as machine learning and deep learning appeared to increase. Also, the keyword search volume for artificial intelligence technology increases as the social acceptance stage progresses. It showed high accuracy, and it was confirmed that the acceptance stages showing the best prediction performance were different for each keyword. As a result of stock price prediction based on keyword search volume for each social acceptance stage of artificial intelligence technologies classified in this study, the awareness stage's prediction accuracy was found to be the highest. The prediction accuracy was different according to the keywords used in the stock price prediction model for each social acceptance stage. Therefore, when constructing a stock price prediction model using technology keywords, it is necessary to consider social acceptance of the technology and sub-technology classification. The results of this study provide the following implications. First, to predict the return on investment for companies based on innovative technology, it is most important to capture the recognition stage in which public interest rapidly increases in social acceptance of the technology. Second, the change in keyword search volume and the accuracy of the prediction model varies according to the social acceptance of technology should be considered in developing a Decision Support System for investment such as the big data-based Robo-advisor recently introduced by the financial sector.

Development of Diameter Distribution Change and Site Index in a Stand of Robinia pseudoacacia, a Major Honey Plant (꿀샘식물 아까시나무의 지위지수 도출 및 직경분포 변화)

  • Kim, Sora;Song, Jungeun;Park, Chunhee;Min, Suhui;Hong, Sunghee;Yun, Junhyuk;Son, Yeongmo
    • Journal of Korean Society of Forest Science
    • /
    • v.111 no.2
    • /
    • pp.311-318
    • /
    • 2022
  • We conducted this study to derive the site index, which is a criterion for the planting of Robinia pseudoacacia, a honey plant, and to investigate the diameter distribution change by derived site index. We applied the Chapman-Richards equation model to estimate the site index of the Robinia pseudoacacia stand. The site index was distributed within the range of 16-22 when the base age was 30 years. The fitness index of the site index estimation model was low, but we judged that there was no problem in the application because the residual distribution of the equation had not shifted to one side. We used the Weibull diameter distribution function to determine the diameter distribution of the Robinia pseudoacacia stand by site index. We used the mean diameter and the dominant tree height as independent variables to present the diameter distribution, and our analysis procedure was to estimate and recover the parameters of the Weibull diameter distribution function. We used the mean diameter and the dominant tree height of the Robinia pseudoacacia stand to show distribution by diameter class, and the fitness index for dbh distribution estimation was about 80.5%. As a result of schematizing the diameter distribution by site indices as a 30-year-old, we found that the higher the site index, the more the curve of the diameter distribution moved to the right. This suggests that if the plantation were to be established in a high site index stand, considering the suitable trees on the site, the growth of Robinia pseudoacacia woul d become active, and not onl y the production of wood but al so the production of honey would increase. We therefore anticipate that the site index classification table and curve of this Robinia pseudoacacia stand will become the standard for decision making in the plantation and management of this tree.

A Basis Study on the Optimal Design of the Integrated PM/NOx Reduction Device (일체형 PM/NOx 동시저감장치의 최적 설계에 대한 기초 연구)

  • Choe, Su-Jeong;Pham, Van Chien;Lee, Won-Ju;Kim, Jun-Soo;Kim, Jeong-Kuk;Park, Hoyong;Lim, In Gweon;Choi, Jae-Hyuk
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.28 no.6
    • /
    • pp.1092-1099
    • /
    • 2022
  • Research on exhaust aftertreatment devices to reduce air pollutants and greenhouse gas emissions is being actively conducted. However, in the case of the particulate matters/nitrogen oxides (PM/NOx) simultaneous reduction device for ships, the problem of back pressure on the diesel engine and replacement of the filter carrier is occurring. In this study, for the optimal design of the integrated device that can simultaneously reduce PM/NOx, an appropriate standard was presented by studying the flow inside the device and change in back pressure through the inlet/outlet pressure. Ansys Fluent was used to apply porous media conditions to a diesel particulate filter (DPF) and selective catalytic reduction (SCR) by setting porosity to 30%, 40%, 50%, 60%, and 70%. In addition, the ef ect on back pressure was analyzed by applying the inlet velocity according to the engine load to 7.4 m/s, 10.3 m/s, 13.1 m/s, and 26.2 m/s as boundary conditions. As a result of a computational fluid dynamics analysis, the rate of change for back pressure by changing the inlet velocity was greater than when inlet temperature was changed, and the maximum rate of change was 27.4 mbar. This was evaluated as a suitable device for ships of 1800kW because the back pressure in all boundary conditions did not exceed the classification standard of 68mbar.

Transfer Learning using Multiple ConvNet Layers Activation Features with Principal Component Analysis for Image Classification (전이학습 기반 다중 컨볼류션 신경망 레이어의 활성화 특징과 주성분 분석을 이용한 이미지 분류 방법)

  • Byambajav, Batkhuu;Alikhanov, Jumabek;Fang, Yang;Ko, Seunghyun;Jo, Geun Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.205-225
    • /
    • 2018
  • Convolutional Neural Network (ConvNet) is one class of the powerful Deep Neural Network that can analyze and learn hierarchies of visual features. Originally, first neural network (Neocognitron) was introduced in the 80s. At that time, the neural network was not broadly used in both industry and academic field by cause of large-scale dataset shortage and low computational power. However, after a few decades later in 2012, Krizhevsky made a breakthrough on ILSVRC-12 visual recognition competition using Convolutional Neural Network. That breakthrough revived people interest in the neural network. The success of Convolutional Neural Network is achieved with two main factors. First of them is the emergence of advanced hardware (GPUs) for sufficient parallel computation. Second is the availability of large-scale datasets such as ImageNet (ILSVRC) dataset for training. Unfortunately, many new domains are bottlenecked by these factors. For most domains, it is difficult and requires lots of effort to gather large-scale dataset to train a ConvNet. Moreover, even if we have a large-scale dataset, training ConvNet from scratch is required expensive resource and time-consuming. These two obstacles can be solved by using transfer learning. Transfer learning is a method for transferring the knowledge from a source domain to new domain. There are two major Transfer learning cases. First one is ConvNet as fixed feature extractor, and the second one is Fine-tune the ConvNet on a new dataset. In the first case, using pre-trained ConvNet (such as on ImageNet) to compute feed-forward activations of the image into the ConvNet and extract activation features from specific layers. In the second case, replacing and retraining the ConvNet classifier on the new dataset, then fine-tune the weights of the pre-trained network with the backpropagation. In this paper, we focus on using multiple ConvNet layers as a fixed feature extractor only. However, applying features with high dimensional complexity that is directly extracted from multiple ConvNet layers is still a challenging problem. We observe that features extracted from multiple ConvNet layers address the different characteristics of the image which means better representation could be obtained by finding the optimal combination of multiple ConvNet layers. Based on that observation, we propose to employ multiple ConvNet layer representations for transfer learning instead of a single ConvNet layer representation. Overall, our primary pipeline has three steps. Firstly, images from target task are given as input to ConvNet, then that image will be feed-forwarded into pre-trained AlexNet, and the activation features from three fully connected convolutional layers are extracted. Secondly, activation features of three ConvNet layers are concatenated to obtain multiple ConvNet layers representation because it will gain more information about an image. When three fully connected layer features concatenated, the occurring image representation would have 9192 (4096+4096+1000) dimension features. However, features extracted from multiple ConvNet layers are redundant and noisy since they are extracted from the same ConvNet. Thus, a third step, we will use Principal Component Analysis (PCA) to select salient features before the training phase. When salient features are obtained, the classifier can classify image more accurately, and the performance of transfer learning can be improved. To evaluate proposed method, experiments are conducted in three standard datasets (Caltech-256, VOC07, and SUN397) to compare multiple ConvNet layer representations against single ConvNet layer representation by using PCA for feature selection and dimension reduction. Our experiments demonstrated the importance of feature selection for multiple ConvNet layer representation. Moreover, our proposed approach achieved 75.6% accuracy compared to 73.9% accuracy achieved by FC7 layer on the Caltech-256 dataset, 73.1% accuracy compared to 69.2% accuracy achieved by FC8 layer on the VOC07 dataset, 52.2% accuracy compared to 48.7% accuracy achieved by FC7 layer on the SUN397 dataset. We also showed that our proposed approach achieved superior performance, 2.8%, 2.1% and 3.1% accuracy improvement on Caltech-256, VOC07, and SUN397 dataset respectively compare to existing work.

Spatial Distribution of Aging District in Taejeon Metropolitan City (대전광역시 노령화 지구의 공간적 분포 패턴)

  • Jeong, Hwan-Yeong;Ko, Sang-Im
    • Journal of the Korean association of regional geographers
    • /
    • v.6 no.2
    • /
    • pp.1-19
    • /
    • 2000
  • This study is to investigate and analyze regional patterns of aging in Taejeon Metropolitan city-the overpopulated area of Choong-Cheong Province-by cohort analysis method. According to the population structure transition caused by rapid social and economic changes, Korea has made a rapid progress in population aging since 1970. This trend is so rapid that we should prepare for and cope with aging society. It is not only slow to cope with it in our society, but also there are few studies on population aging of the geographical field in Korea. The data of this study are the reports of Population and Housing Censuses in 1975 and 1985 and General Population and Housing Censuses with 10% sample survey in 1995 taken by National Statistical Office. The research method is to sample as the aging district the area with high aged population rate where the populations over 60 reside among total population during the years of 1975, 1985, 1995 and to sample the special districts of decreasing population where the population decreases very much and the special districts of increasing population in which the population increases greatly, presuming that the reason why aged population rate increases is that non-elderly population high in mobility moves out. It is then verified and ascertained whether it is true or not with cohort analysis method by age. Finally regional patterns in the city are found through the classification and modeling by type based on the aging district, the special districts of decreasing population, and the special districts of increasing population. The characteristics of the regional patterns show that there is social population transition and that non-elderly population moves out. The aging district with the high aged population rate is divided into high-level keeping-up type, relative falling type below the average of Taejeon city in aging progress, and relative rising type above the average of the city. This district can be found at both the central area of the city and the suburbs because Taejeon city has the characteristic of over-bounded city. But it cannot be found at the new built-up area with the in-migration of large population. The special districts of decreasing population where the population continues to decrease can be said to be the population doughnuts found at the CBD and its neighboring inner area. On the other hand, the special districts of increasing population where the population continues to increase are located at the new built-up area of the northern part in Taejeon city. The special districts of decreasing population are overlapping with the aging district and higher in aged population rate by the out-migration of non-elderly population. The special districts of increasing population are not overlapping with the aging district and lower in aged population rate by the in-migration of non-elderly population. To clarify the distribution map of the aging district, the special districts of decreasing and increasing population and the aging district are divided into four groups such as the special districts of decreasing population group-the same one as the aging district, the special districts of decreasing population group, the special districts of increasing population group, and the other district. With the cohort analysis method by age used to investigate the definite increase and decrease of aging population through population transition of each group, it is found that the progress of population aging is closely related to the social population fluctuation, especially that aged population rate is higher with the out-migration of non-elderly population. This is to explain each model of CBD, inner area, and the suburbs after modeling the aging district, the special districts of decreasing population, and the special districts of increasing population in Taejeon city. On the assumption that the city area is a concentric circle, it is possible to divide it into three areas such as CBD(A), the inner area(B), and the suburbs(C). The special districts of increasing and decreasing population in the city are divided into three districts-the special districts of decreasing population(a), the special districts of increasing population(b), and the others(c). The aging district of this city is divided into the aging district($\alpha$) and the others($\beta$). And then modeling these districts, it is probable to find regional patterns in the city. $Aa{\alpha}$ and $Ac{\beta}$ patterns are found in the CBD, in which $Aa{\alpha}$ is the special district of decreasing population and is higher in aged population rate because of aged population low in mobility staying behind and out-migration of non-elderly population. $Ba{\alpha}$, $Ba{\beta}$, $Bb{\beta}$, and $Bc{\beta}$ patterns are found in the inner area, in which neighboring area $Ba{\alpha}$ pattern is located. $Bb{\beta}$ pattern is located at the new developing area of newly built apartment complex. $Cb{\beta}$, $Cc{\alpha}$, and $Cc{\beta}$ patterns are found in the suburbs, among which $Cc{\alpha}$ pattern is highest in population aging. It is likely that the $Cc{\beta}$ under housing land readjustment on a large scale will be the $Cb{\beta}$ pattern. As analyzed above, marriage and out-migration of new family, non-elderly population, with house purchase are main factors in accelerating population aging in the central area of the city. Population aging is responsible for the great increase of aged population with longer life expectancy by the low death rate, the out-migration of non-elderly population, and the age group of new aged population in the suburbs. It is necessary to investigate and analyze the regional patterns of population aging at the time when population problems caused by aging as well as longer life expectancy are now on the increase. I hope that this will help the future study on population aging of the geographical field in Korea. As in the future population aging will be a major problem in our society, local autonomy should make a plan for the problem to the extent that population aging progresses by regional groups and inevitably prepare for it.

  • PDF

CLINICAL CHARACTERISTICS OF CHILD AND ADOLESCENT PSYCHIATRIC INPATIENTS WITH MOOD DISORDER (입원한 기분장애 소아청소년의 임상특성 - 주요 우울증과 양극성장애의 우울삽화 비교를 중심으로 -)

  • Cho, Su-Chul;Paik, Ki-Chung;Lee, Kyung-Kyu;Kim, Hyun-Woo;Hong, Kang-E;Lim, Myung-Ho
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.11 no.2
    • /
    • pp.209-220
    • /
    • 2000
  • The purpose of this study is to find out the characteristics of depressive episode about major depression and bipolar disorder in child and adolescent. The subjects of this study were 34 major depression patients and 17 bipolar disorder patients hospitalized at child and adolescent psychiatry in OO university children's hospital from 1st March 1993 to 31st October 1999. The method of this study is to review socio-demographic characteristics, diagnostic classification, chief problems and symptoms at admission, frequency of symptoms, maternal pregnancy problem history, childhood developmental history, coexisting psychiatric disorders, family psychopathology and family history and therapeutic response through their chart. 1) The ratio of male was higher than that of female in major depressive disorder while they are similar in manic episode, bipolar disorder. 2) Average onset age of bipolar disorder was 14 years 1 month and it was 12 years 8 months in the case of major depression As a result, average onset age of major depression is lower than that of bipolar disorder. 3) The patients complained of vegetative symptoms than somatic symptoms in both bipolar disorder and depressive disorder. Also, the cases of major depression developed more suicide idea symptom while the case of bipolar disorder developed more aggressive symptoms. In the respect of psychotic symptoms, delusion was more frequently shown in major depression, but halucination was more often shown in bipolar disorder. 4) Anxiety disorder coexisted most frequently in two groups. And there coexisted symptoms such as somartoform disorder, mental retardation and personality disorder in both cases. 5) The influence of family loading was remarkable in both cases. Above all, the development of major depression had to do with child abuse history and inappropriate care of family. It is apparent that there are distinctive differences between major depression and bipolar disorder in child and adolescent through the study, just as in adult cases. Therefore the differences of clinical characteristics between two disorders is founded in coexisting disorders and clinical symptoms including onset age, somatic symptoms and vegetative symptoms.

  • PDF

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

A Study on the Born Global Venture Corporation's Characteristics and Performance ('본글로벌(born global)전략'을 추구하는 벤처기업의 특성과 성과에 관한 연구)

  • Kim, Hyung-Jun;Jung, Duk-Hwa
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.3
    • /
    • pp.39-59
    • /
    • 2007
  • The international involvement of a firm has been described as a gradual development process "a process in which the enterprise gradually increases its international involvement in many studies. This process evolves in the interplay between the development of knowledge about foreign markets and operations on one hand and increasing commitment of resources to foreign markets on the other." On the basis of Uppsala internationalization model, many studies strengthen strong theoretical and empirical support. According to the predictions of the classic stages theory, the internationalization process of firms have been recognized and characterized gradual evolution to foreign markets, so called stage theory: indirect & direct export, strategic alliance and foreign direct investment. However, termed "international new ventures" (McDougall, Shane, and Oviatt 1994), "born globals" (Knight 1997; Knight and Cavusgil 1996; Madsen and Servais 1997), "instant internationals" (Preece, Miles, and Baetz 1999), or "global startups" (Oviatt and McDougall 1994) have been used and come into spotlight in internationalization study of technology intensity venture companies. Recent researches focused on venture company have suggested the phenomenons of 'born global' firms as a contradiction to the stages theory. Especially the article by Oviatt and McDougall threw the spotlight on international entrepreneurs, on international new ventures, and on their importance in the globalising world economy. Since venture companies have, by definition. lack of economies of scale, lack of resources (financial and knowledge), and aversion to risk taking, they have a difficulty in expanding their market to abroad and pursue internalization gradually and step by step. However many venture companies have pursued 'Born Global Strategy', which is different from process strategy, because corporate's environment has been rapidly changing to globalization. The existing studies investigate that (1) why the ventures enter into overseas market in those early stage, even in infancy, (2) what make the different international strategy among ventures and the born global strategy is better to the infant ventures. However, as for venture's performance(growth and profitability), the existing results do not correspond each other. They also, don't include marketing strategy (differentiation, low price, market breadth and market pioneer) that is important factors in studying of BGV's performance. In this paper I aim to delineate the appearance of international new ventures and the phenomenons of venture companies' internationalization strategy. In order to verify research problems, I develop a resource-based model and marketing strategies for analyzing the effects of the born global venture firms. In this paper, I suggested 3 research problems. First, do the korean venture companies take some advantages in the aspects of corporate's performances (growth, profitability and overall market performances) when they pursue internationalization from inception? Second, do the korean BGV have firm specific assets (foreign experiences, foreign orientation, organizational absorptive capacity)? Third, What are the marketing strategies of korean BGV and is it different from others? Under these problems, I test then (1) whether the BGV that a firm started its internationalization activity almost from inception, has more intangible resources(foreign experience of corporate members, foreign orientation, technological competences and absorptive capacity) than any other venture firms(Non_BGV) and (2) also whether the BGV's marketing strategies-differentiation, low price, market diversification and preemption strategy are different from Non_BGV. Above all, the main purpose of this research is that results achieved by BGV are indeed better than those obtained by Non_BGV firms with respect to firm's growth rate and efficiency. To do this research, I surveyed venture companies located in Seoul and Deajeon in Korea during November to December, 2005. I gather the data from 200 venture companies and then selected 84 samples, which have been founded during 1999${\sim}$2000. To compare BGV's characteristics with those of Non_BGV, I also had to classify BGV by export intensity over 50% among five or six aged venture firms. Many other researches tried to classify BGV and Non_BGV, but there were various criterion as many as researchers studied on this topic. Some of them use time gap, which is time difference of establishment and it's first internationalization experience and others use export intensity, ration of export sales amount divided by total sales amount. Although using a mixed criterion of prior research in my case, I do think this kinds of criterion is subjective and arbitrary rather than objective, so I do mention my research has some critical limitation in the classification of BGV and Non_BGV. The first purpose of research is the test of difference of performance between BGV and Non_BGV. As a result of t-test, the research show that there are statistically efficient difference not only in the growth rate (sales growth rate compared to competitors and 3 years averaged sales growth rate) but also in general market performance of BGV. But in case of profitability performance, the hypothesis that is BGV is more profit (return on investment(ROI) compared to competitors and 3 years averaged ROI) than Non-BGV was not supported. From these results, this paper concludes that BGV grows rapidly and gets a high market performance (in aspect of market share and customer loyalty) but there is no profitability difference between BGV and Non_BGV. The second result is that BGV have more absorptive capacity especially, knowledge competence, and entrepreneur's international experience than Non_BGV. And this paper also found BGV search for product differentiation, exemption strategy and market diversification strategy while Non_BGV search for low price strategy. These results have never been dealt with other existing studies. This research has some limitations. First limitation is concerned about the definition of BGV, as I mentioned above. Conceptually speaking, BGV is defined as company pursue internationalization from inception, but in empirical study, it's very difficult to classify between BGV and Non_BGV. I tried to classify on the basis of time difference and export intensity, this criterions are so subjective and arbitrary that the results are not robust if the criterion were changed. Second limitation is concerned about sample used in this research. I surveyed venture companies just located in Seoul and Daejeon and also use only 84 samples which more or less provoke sample bias problem and generalization of results. I think the more following studies that focus on ventures located in other region, the better to verify the results of this paper.

  • PDF

The way to make training data for deep learning model to recognize keywords in product catalog image at E-commerce (온라인 쇼핑몰에서 상품 설명 이미지 내의 키워드 인식을 위한 딥러닝 훈련 데이터 자동 생성 방안)

  • Kim, Kitae;Oh, Wonseok;Lim, Geunwon;Cha, Eunwoo;Shin, Minyoung;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.1-23
    • /
    • 2018
  • From the 21st century, various high-quality services have come up with the growth of the internet or 'Information and Communication Technologies'. Especially, the scale of E-commerce industry in which Amazon and E-bay are standing out is exploding in a large way. As E-commerce grows, Customers could get what they want to buy easily while comparing various products because more products have been registered at online shopping malls. However, a problem has arisen with the growth of E-commerce. As too many products have been registered, it has become difficult for customers to search what they really need in the flood of products. When customers search for desired products with a generalized keyword, too many products have come out as a result. On the contrary, few products have been searched if customers type in details of products because concrete product-attributes have been registered rarely. In this situation, recognizing texts in images automatically with a machine can be a solution. Because bulk of product details are written in catalogs as image format, most of product information are not searched with text inputs in the current text-based searching system. It means if information in images can be converted to text format, customers can search products with product-details, which make them shop more conveniently. There are various existing OCR(Optical Character Recognition) programs which can recognize texts in images. But existing OCR programs are hard to be applied to catalog because they have problems in recognizing texts in certain circumstances, like texts are not big enough or fonts are not consistent. Therefore, this research suggests the way to recognize keywords in catalog with the Deep Learning algorithm which is state of the art in image-recognition area from 2010s. Single Shot Multibox Detector(SSD), which is a credited model for object-detection performance, can be used with structures re-designed to take into account the difference of text from object. But there is an issue that SSD model needs a lot of labeled-train data to be trained, because of the characteristic of deep learning algorithms, that it should be trained by supervised-learning. To collect data, we can try labelling location and classification information to texts in catalog manually. But if data are collected manually, many problems would come up. Some keywords would be missed because human can make mistakes while labelling train data. And it becomes too time-consuming to collect train data considering the scale of data needed or costly if a lot of workers are hired to shorten the time. Furthermore, if some specific keywords are needed to be trained, searching images that have the words would be difficult, as well. To solve the data issue, this research developed a program which create train data automatically. This program can make images which have various keywords and pictures like catalog and save location-information of keywords at the same time. With this program, not only data can be collected efficiently, but also the performance of SSD model becomes better. The SSD model recorded 81.99% of recognition rate with 20,000 data created by the program. Moreover, this research had an efficiency test of SSD model according to data differences to analyze what feature of data exert influence upon the performance of recognizing texts in images. As a result, it is figured out that the number of labeled keywords, the addition of overlapped keyword label, the existence of keywords that is not labeled, the spaces among keywords and the differences of background images are related to the performance of SSD model. This test can lead performance improvement of SSD model or other text-recognizing machine based on deep learning algorithm with high-quality data. SSD model which is re-designed to recognize texts in images and the program developed for creating train data are expected to contribute to improvement of searching system in E-commerce. Suppliers can put less time to register keywords for products and customers can search products with product-details which is written on the catalog.

The Reserch on Actual Condition of Crime of Arson Which Occurs in Korea and Its Countermeasures (방화범죄의 실태와 그 대책 - 관심도와 동기의 다양화에 대한 대응 -)

  • Choi, Jong-Tae
    • Korean Security Journal
    • /
    • no.1
    • /
    • pp.371-408
    • /
    • 1997
  • This article is the reserch on actual condition of crime of arson which occurs in Korea and its countermeasures. The the presented problem in this article are that (1) we have generally very low rate concern about the crime of arson contrary to realistic problems of rapid increase of crime of arson (2) as such criminal motives became so diverse as to the economic or criminal purpose unlike characteristic and mental deficiency of old days, and to countermeasure these problems effectively it presentation the necessity of systemantic research. Based on analysis of reality of arson, the tendency of this arson in Korea in the ratio of increase is said to be higher than those in violence crime or general fire rate. and further its rate is far more greater than those of the U.S.A. and Japan. Arson is considered to be a method of using fire as crime and in case of presently residence to be the abject, it is a public offense crime which aqccompany fatality in human life. This is the well It now fact to all of us. And further in order to presentation to the crime of arson, strictness of criminal law (criminal law No, 164 and 169, and fire protection law No. 110 and 111) and classification of arsonist as felony are institutionary reinforced to punish with certainty of possibility, Therefore, as tendency of arson has been increased compared to other nations, it is necessary to supplement strategical policy to bring out overall concerns of the seriousness of risk and damage of arson, which have been resulted from the lack of understanding. In characteristics analysis of crime of arson, (1) It is now reveald that, in the past such crime rate appeared far more within the boundary of town or city areas in the past, presently increased rate of arsons in rural areas are far more than in the town or small city areas, thereby showing characteristics of crime of arson extending nation wide. (2) general timetable of arson shows that night more than day time rate, and reveald that is trait behavior in secrecy.(3) arsonists are usually arrested at site or by victim or report of third person(82,9%).Investigation activities or self surrenders rate only 11.2%. The time span of arrest is normally the same day of arson and at times it takes more than one year to arrest. This reveals its necessity to prepare for long period of time for arrest, (4) age rate of arson is in their thirties mostly as compared to homicide, robbery and adultery, and considerable numbers of arsons are in old age of over fifties. It reveals age rate is increased (5) Over half of the arsonists are below the junior high school (6) the rate of convicts by thier records is based on first offenders primarily and secondly more than 4 time convicts. This apparently shows necessity of effective correctional education policy for their social assimilation together with re-investigation of human education at the primary and secondary education system in thier life. The examples of motivation for arosnits, such as personal animosity, fury, monetary swindle, luscious purpose and other aims of destroying of proof, and other social resistance, violence including ways of threatening, beside the motives of individual defects, are diverse and arsonic suicide and specifically suicidal accompany together keenly manifested. When we take this fact with the criminal theory, it really reveals arsons of crime are increasing and its casualities are serious and a point as a way of suicide is the anomie theory of Durkheim and comensurate with the theory of that of Merton, Specifically in the arson of industrial complex, it is revealed that one with revolutionary motive or revolting motive would do the arsonic act. For the policy of prevention of arsons, professional research work in organizational cooperation for preventive activities is conducted in municipal or city wise functions in the name of Parson Taskforces and beside a variety of research institutes in federal government have been operating effectively to countermeasure in many fields of research. Franch and Sweden beside the U.S. set up a overall operation of fire prevention research funtions and have obtained very successful result. Japan also put their research likewise for countermeasure. In this research as a way of preventive fire policy, first, it is necessary to accomodate the legal preventitive activities for fire prevention in judicial side and as an administrative side, (1) precise statistic management of crime of arson (2) establishment of professional research functions or a corporate (3) improvement of system for cooperative structural team for investigation of fires and menpower organization of professional members. Secondly, social mentality in individual prospect, recognition of fires by arson and youth education of such effect, educational program for development and practical promotion. Thirdly, in view of environmental side, the ways of actual performance by programming with the establishment of cooperative advancement in local social function elements with administrative office, habitants, school facilities and newspapers measures (2) establishment of personal protection where weak menpowers are displayed in special fire prevention measures. These measures are presented for prevention of crime of arson. The control of crime and prevention shall be prepared as a means of self defence by the principle of self responsibility Specifically arsonists usually aims at the comparatively weak control of fire prevention is prevalent and it is therefore necessary to prepare individual facilities with their spontaneous management of fire prevention instead of public municipal funtures of local geverment. As Clifford L. Karchmer asserted instead of concerns about who would commit arson, what portion of area would be the target of the arson. It is effective to minister spontaveously the fire prevention measure in his facility with the consideration of characteristics of arson. On the other hand, it is necessary for the concerned personnel of local goverment and groups to distribute to the local society in timely manner for new information about the fire prevention, thus contribute to effective result of fire prevention result. In consideration of these factors, it is inevitable to never let coincide with the phemonemon of arsons in similar or mimic features as recognized that these could prevail just an epedemic as a strong imitational attitude. In processing of policy to encounter these problems, it is necessary to place priority of city policy to enhancement of overall concerns toward the definitive essense of crime of arson.

  • PDF