• Title/Summary/Keyword: 처리

Search Result 101,827, Processing Time 0.107 seconds

A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)

  • Yang, Yunseok;Lee, Hyun Jun;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.25-38
    • /
    • 2019
  • Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.

Individual Thinking Style leads its Emotional Perception: Development of Web-style Design Evaluation Model and Recommendation Algorithm Depending on Consumer Regulatory Focus (사고가 시각을 바꾼다: 조절 초점에 따른 소비자 감성 기반 웹 스타일 평가 모형 및 추천 알고리즘 개발)

  • Kim, Keon-Woo;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.171-196
    • /
    • 2018
  • With the development of the web, two-way communication and evaluation became possible and marketing paradigms shifted. In order to meet the needs of consumers, web design trends are continuously responding to consumer feedback. As the web becomes more and more important, both academics and businesses are studying consumer emotions and satisfaction on the web. However, some consumer characteristics are not well considered. Demographic characteristics such as age and sex have been studied extensively, but few studies consider psychological characteristics such as regulatory focus (i.e., emotional regulation). In this study, we analyze the effect of web style on consumer emotion. Many studies analyze the relationship between the web and regulatory focus, but most concentrate on the purpose of web use, particularly motivation and information search, rather than on web style and design. The web communicates with users through visual elements. Because the human brain is influenced by all five senses, both design factors and emotional responses are important in the web environment. Therefore, in this study, we examine the relationship between consumer emotion and satisfaction and web style and design. Previous studies have considered the effects of web layout, structure, and color on emotions. In this study, however, we excluded these web components, in contrast to earlier studies, and analyzed the relationship between consumer satisfaction and emotional indexes of web-style only. To perform this analysis, we collected consumer surveys presenting 40 web style themes to 204 consumers. Each consumer evaluated four themes. The emotional adjectives evaluated by consumers were composed of 18 contrast pairs, and the upper emotional indexes were extracted through factor analysis. The emotional indexes were 'softness,' 'modernity,' 'clearness,' and 'jam.' Hypotheses were established based on the assumption that emotional indexes have different effects on consumer satisfaction. After the analysis, hypotheses 1, 2, and 3 were accepted and hypothesis 4 was rejected. While hypothesis 4 was rejected, its effect on consumer satisfaction was negative, not positive. This means that emotional indexes such as 'softness,' 'modernity,' and 'clearness' have a positive effect on consumer satisfaction. In other words, consumers prefer emotions that are soft, emotional, natural, rounded, dynamic, modern, elaborate, unique, bright, pure, and clear. 'Jam' has a negative effect on consumer satisfaction. It means, consumer prefer the emotion which is empty, plain, and simple. Regulatory focus shows differences in motivation and propensity in various domains. It is important to consider organizational behavior and decision making according to the regulatory focus tendency, and it affects not only political, cultural, ethical judgments and behavior but also broad psychological problems. Regulatory focus also differs from emotional response. Promotion focus responds more strongly to positive emotional responses. On the other hand, prevention focus has a strong response to negative emotions. Web style is a type of service, and consumer satisfaction is affected not only by cognitive evaluation but also by emotion. This emotional response depends on whether the consumer will benefit or harm himself. Therefore, it is necessary to confirm the difference of the consumer's emotional response according to the regulatory focus which is one of the characteristics and viewpoint of the consumers about the web style. After MMR analysis result, hypothesis 5.3 was accepted, and hypothesis 5.4 was rejected. But hypothesis 5.4 supported in the opposite direction to the hypothesis. After validation, we confirmed the mechanism of emotional response according to the tendency of regulatory focus. Using the results, we developed the structure of web-style recommendation system and recommend methods through regulatory focus. We classified the regulatory focus group in to three categories that promotion, grey, prevention. Then, we suggest web-style recommend method along the group. If we further develop this study, we expect that the existing regulatory focus theory can be extended not only to the motivational part but also to the emotional behavioral response according to the regulatory focus tendency. Moreover, we believe that it is possible to recommend web-style according to regulatory focus and emotional desire which consumers most prefer.

A Study on the Funerary Mean of the Vertical Plate Armour from the 4th Century - Mainly Based on the Burial Patterns Shown by the Ancient Tombs No.164 and No.165 in Bokcheon-dong - (종장판갑(縱長板甲) 부장의 다양성과 의미 - 부산 복천동 164·165호분 출토 자료를 중심으로 -)

  • Lee, Yu Jin
    • Korean Journal of Heritage: History & Science
    • /
    • v.44 no.3
    • /
    • pp.178-199
    • /
    • 2011
  • The ancient tombs found in Bokcheon-dong, Busan originate from the time between the $4^{th}$ and $5^{th}$ centuries, the period of the Three Nations. They are known as the tombs where the Vertical Plate Armour was mainly buried. In 2006, two units of the Vertical Plate Armour were additionally investigated in the tombs No.164 and No.165 which had been constructed at the end of the eastern slope near the hill of the group of ancient tombs in Bokcheon-dong. Throughout this study, the contents of the two units of the Vertical Plate Armour, whose preservation process has been completed, have been arranged, while the group of constructed ancient tombs in Bokcheon-dong from the $4^{th}$ century has been observed through the consideration of the burial pattern. The units of the Vertical Plate Armour from the tombs No.164 and No.165 can be classified as the IIa-typed armor showing the Gyeongju and Ulsan patterns, according to the attribute of the manufacturing technology. Also, they can be chronologically recorded as those from the early period of Stage II among the three stages regarding the chronological recording of the Vertical Plate Armour. While more than two units of the Vertical Plate Armour were buried in the largesized tomb on the top of the hill of the group of ancient tombs, one unit of the Vertical Plate Armour was buried in the small-sized tomb. By considering such a trend, it can be said that in the stage of burying the armor showing the Gyeongju and Ulsan patterns (I-type and IIa-type), different units of the Vertical Plate Armour were buried according to the size of the tomb. However, as the armor showing the Busan pattern (IIb-type) was settled, only one unit was buried. Meanwhile, the tombs No.164 and No.165 can be included in the wooden chamber tomb showing the Gyeongju pattern, which is a slender rectangular wooden chamber tomb with the aspect ratio of more than 1:3. However, according to the trend shown by the buried earthenware, it can be said that there seem to be common types and patterns shared with the earthenware which has been found in the area of Gimhae and is called the one showing the Geumgwan Gaya pattern. In other words, there seem to be close relationships between the subject tombs and the tomb No.3 in Gujeong-dong and the tomb No.55 in Sara-ri, Gyeongju, regarding the types of armor and tombs and the arrangement of buried artifacts. However, the buried earthenware shows a relationship with the areas of Busan and Gimhae. By considering the combined trend of the Gyeongju and Gimhae elements found in one tomb, it is possible to assume that the group of constructed ancient tombs in Bokcheon-dong used to be actively related with both areas. It has been thought that the Vertical Plate Armour used to be the exclusive property of the upper hierarchy until now, since it was buried in the large-sized tomb located on the top of the hill of the group of ancient tombs in Bokcheondong. However, as shown in case of the tombs No.164 and No.165, it has been verified that the Vertical Plate Armour was also buried in the small-sized tomb in terms of such factors as locations, sizes, the amount of buried artifacts and the qualitative aspect. Therefore, it is impossible to discuss the hierarchical characteristic of the tomb just based on the buried units of the Vertical Plate Armour. Also, it is difficult to assume that armor used to symbolize the domination of the military forces. The hierarchical characteristic of the group of constructed ancient tombs in Bokcheon-dong from the $4^{th}$ century can be verified according to the location and size of each tomb. As are sult, the re seem to be some differences regarding the buried units of the vertical plate armour. However, it would be necessary to carry out amore multilateral examination in order to find out whether the burial of the vertical plate armour could be regarded as the artifact which symbolizes the status or class of the deceased.

Query-based Answer Extraction using Korean Dependency Parsing (의존 구문 분석을 이용한 질의 기반 정답 추출)

  • Lee, Dokyoung;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.161-177
    • /
    • 2019
  • In this paper, we study the performance improvement of the answer extraction in Question-Answering system by using sentence dependency parsing result. The Question-Answering (QA) system consists of query analysis, which is a method of analyzing the user's query, and answer extraction, which is a method to extract appropriate answers in the document. And various studies have been conducted on two methods. In order to improve the performance of answer extraction, it is necessary to accurately reflect the grammatical information of sentences. In Korean, because word order structure is free and omission of sentence components is frequent, dependency parsing is a good way to analyze Korean syntax. Therefore, in this study, we improved the performance of the answer extraction by adding the features generated by dependency parsing analysis to the inputs of the answer extraction model (Bidirectional LSTM-CRF). The process of generating the dependency graph embedding consists of the steps of generating the dependency graph from the dependency parsing result and learning the embedding of the graph. In this study, we compared the performance of the answer extraction model when inputting basic word features generated without the dependency parsing and the performance of the model when inputting the addition of the Eojeol tag feature and dependency graph embedding feature. Since dependency parsing is performed on a basic unit of an Eojeol, which is a component of sentences separated by a space, the tag information of the Eojeol can be obtained as a result of the dependency parsing. The Eojeol tag feature means the tag information of the Eojeol. The process of generating the dependency graph embedding consists of the steps of generating the dependency graph from the dependency parsing result and learning the embedding of the graph. From the dependency parsing result, a graph is generated from the Eojeol to the node, the dependency between the Eojeol to the edge, and the Eojeol tag to the node label. In this process, an undirected graph is generated or a directed graph is generated according to whether or not the dependency relation direction is considered. To obtain the embedding of the graph, we used Graph2Vec, which is a method of finding the embedding of the graph by the subgraphs constituting a graph. We can specify the maximum path length between nodes in the process of finding subgraphs of a graph. If the maximum path length between nodes is 1, graph embedding is generated only by direct dependency between Eojeol, and graph embedding is generated including indirect dependencies as the maximum path length between nodes becomes larger. In the experiment, the maximum path length between nodes is adjusted differently from 1 to 3 depending on whether direction of dependency is considered or not, and the performance of answer extraction is measured. Experimental results show that both Eojeol tag feature and dependency graph embedding feature improve the performance of answer extraction. In particular, considering the direction of the dependency relation and extracting the dependency graph generated with the maximum path length of 1 in the subgraph extraction process in Graph2Vec as the input of the model, the highest answer extraction performance was shown. As a result of these experiments, we concluded that it is better to take into account the direction of dependence and to consider only the direct connection rather than the indirect dependence between the words. The significance of this study is as follows. First, we improved the performance of answer extraction by adding features using dependency parsing results, taking into account the characteristics of Korean, which is free of word order structure and omission of sentence components. Second, we generated feature of dependency parsing result by learning - based graph embedding method without defining the pattern of dependency between Eojeol. Future research directions are as follows. In this study, the features generated as a result of the dependency parsing are applied only to the answer extraction model in order to grasp the meaning. However, in the future, if the performance is confirmed by applying the features to various natural language processing models such as sentiment analysis or name entity recognition, the validity of the features can be verified more accurately.

A Study on the Spatial Structure of Eupchi(邑治) and Landscape Architecture of Provincial Government Office(地方官衙) in the Late Joseon Dynasty through 'Sukchunjeahdo(宿踐諸衙圖)' - Focused on the Youngyuhyun Pyeongan Province and Sincheongun Hwanghae Province - (『숙천제아도(宿踐諸衙圖)』를 통해 본 조선시대 읍치(邑治)의 공간구조와 관아(官衙) 조경 - 평안도 영유현과 황해도 신천군을 중심으로 -)

  • Shin, Sang sup;Lee, Seung yoen
    • Korean Journal of Heritage: History & Science
    • /
    • v.49 no.2
    • /
    • pp.86-103
    • /
    • 2016
  • 'Sukchunjeahdo' illustration-book, which was left by Han, Pil-gyo(韓弼敎 : 1807~1878)in the late Joseon Dynasty, includes pictorial record paintings containing government offices, Eupchi, and Feng Shui condition drawn by Gyehwa(界畵) method Sabangjeondomyobeop(四方顚倒描法) and is the rare historical material that help to understand spatial structure and landscape characteristics. Youngyuhyun(永柔縣) and Sincheongun(信川郡) town, the case sites of this study, show Feng Shui foundation structure and placement rules of government offices in the Joseon Period are applied such as 3Dan 1Myo(三壇一廟 : Sajikdan, Yeodan, Seonghwangdan, Hyanggyo), 3Mun 3Jo(三門三朝 : Oeah, Dongheon, Naeah) and Jeonjohuchim(前朝後寢) etc. by setting the upper and lower hierarchy of the north south central axis. The circulation system is the pattern that roads are segmented around the marketplace of the entrance of the town and the structure is that heading to the north along the internal way leads to the government office and going out to the main street leads to the major city. Baesanimsu(背山臨水 : Mountain in backward and water in front) foundation, back hill pine forest, intentionally created low mountains and town forest etc. showed landscape aesthetics well suited for the environmental comfort condition such as microclimate control, natural disaster prevention, psychological stability reflecting color constancy principle etc. and tower pavilions were built throughout the scenic spot, reflecting life philosophy and thoughts of contemporaries such as physical and mental discipline, satisfied at the reality of poverty, returning to nature etc. For government office landscape, shielding and buffer planting, landscape planting etc. were considered around Gaeksa(客舍), Dongheon(東軒), Naeah(內衙) backyard and deciduous tree s and flowering trees were cultivated as main species and in case of Gaeksa, tiled pavilions and pavilions topped with poke weed in tetragonal pond were introduced to Dongheon and Naeah and separate pavilions were built for the purpose of physical and mental discipline and military training such as archery. Back hill pine tree forest formed back landscape and zelkova, pear trees, willow trees, old pine trees, lotus, flowering trees etc. were cultivated as gardening trees and Feng-Shui forest with willow trees as its main species was created for landscape and practical purposes. On the other hand, various cultural landscape elements etc. were introduced such as pavilions, pond serving as fire protection water(square and circle), stone pagoda and stone Buddha, fountains and wells, monument houses, flagpoles etc. In case of Sincheongun town forest(邑藪), Manhagwan(挽河觀), Moonmujeong(文武井), Sangjangdae(上場岱) and Hajangdae(下場岱) Market place, Josanshup<(造山藪 : Dongseojanglim(東西長林)>, Namcheon(南川) etc. were combined and community cultural park with the nature of modern urban park was operated. In this context, government office landscape shows the garden management aspect where square pond and pavilions, flowering trees are harmonized around side pavilion and backyard. Also, environmental design technique not biased to aesthetics and ideological moral philosophy and comprehensively considering functionality (shielding and fire prevention, microclimate control, etc.) and environmental soundness etc. is working.

Expanded Uses and Trend of Domestic and International Research of Rose of Sharon(Hibiscus syriacus L.) as Korean National Flower since the Protection of New Plant Variety (식물신품종보호제도 이후 나라꽃 무궁화의 국내외 연구동향 및 확대 이용 방안)

  • Kang, Ho Chul;Kim, Dong Yeob;Wang, Yae Ga;Ha, Yoo Mi
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.47 no.5
    • /
    • pp.49-65
    • /
    • 2019
  • This study was carried out to investigate the domestic and international development of a new cultivar of the Rose of Sharon (Hibiscus syriacus L.), the Korean national flower, and the protection of the new plant variety. In addition, it will be used as basic data for the expansion of domestic distribution, promoting oversea export, and expanding the range of landscape architectural use. A total of 97 varieties received plant variety protection rights from the Korea Seed & Variety Service from 2004 to 2018. The selection criteria were plants having unique flowers, growth habits, and variegated leaves. Some cultivars with unique features, such as flower size, shape, and red eyes were available for focus planting. Plant varieties with tall and strong growth patterns have been highly valuable for street and focus planting. Cultivars with dwarf stems and compact branches are utilized for pot planting and bonsai. The protected cultivars were mostly single flower varieties, with two semi-double flowers. There were 57 cultivars of pink flowers with red eyes and 21 cultivars of white flowers with red eyes. There were 61 cultivars developed by crossing, 23 cultivars through interspecific hybridization and 7 cultivars developed through radiation treatment and mutation. The Hibiscus cultivars registered to the United States Patent and Trademark Office (USPTO) consisted of seven cultivars each from the United States, the United Kingdom, and the Netherlands, four from South Korea, and three from Belgium. The Hibiscus cultivars registered to the European Community Plant Variety Office (CPVO) consisted of 16 cultivars from France, 9 from the Netherlands, 5 from the UK and 1 from Belgium. The cultivars that received both plant patent and plant breeder rights in the United States and Canada were 'America Irene Scott', 'Antong Two', 'CARPA', 'DVPazurri', 'Gandini Santiago', 'Gandini van Aart', 'ILVO347', 'ILVOPS', 'JWNWOOD 4', 'Notwood3', 'RWOODS5', 'SHIMCR1', 'SHIMRR38', 'SHIMRV24', and 'THEISSHSSTL'. 'SHIMCR1' and 'SHIMRV24' acquired both domestic plant protection rights and overseas plant patents. The 14 cultivars that received both US plant patents and European protection rights were 'America Irene Scott', 'Bricutts', 'DVPAZURRI', 'Gandini Santiago', 'Gandini van Aart', 'JWNWOOD4', 'MINDOUB1', 'MINDOUR1', 'MINDOUV5', 'NOTWOOD3', 'RWOODS5', 'RWOODS6', 'Summer Holiday', and 'Summer Night'. The cultivars that obtained US patents consisted of 18 cultivars (52.9%) with double flowers, 4 cultivars (11.8%) with semi-double flowers, and 12 cultivars (35.3%) with single flowers. The cultivars that obtained European new variety protection rights, consisted of 11 cultivars (34.3%) with double flowers, 12 cultivars (21.9%) with semi-double flowers, and 14 cultivars (43.8%) with single flowers. In the future, new cultivars of H. syriacus need to be developed in order to expand domestic distribution and export abroad. In addition, when developing new cultivars, it is required to develop cultivars with shorter branches for use in flower beds, borders, hedges, and pot planting.

An Examination into the Illegal Trade of Cultural Properties (문화재(文化財)의 국제적 불법 거래(不法 去來)에 관한 고찰)

  • Cho, Boo-Keun
    • Korean Journal of Heritage: History & Science
    • /
    • v.37
    • /
    • pp.371-405
    • /
    • 2004
  • International circulation of cultural assets involves numerous countries thereby making an approach based on international law essential to resolving this problem. Since the end of the $2^{nd}$ World War, as the value of cultural assets evolved from material value to moral and ethical values, with emphasis on establishing national identities, newly independent nations and former colonial states took issue with ownership of cultural assets which led to the need for international cooperation and statutory provisions for the return of cultural assets. UNESCO's 1954 "Convention for the Protection of Cultural Property in the Event of Armed Conflict" as preparatory measures for the protection of cultural assets, the 1970 "Convention on the Means of Prohibiting and Preventing the Illicit Import and Transfer of Ownership of Cultural Property" to regulate transfer of cultural assets, and the 1995 "Unidroit Convention on Stolen or Illegally Exported Cultural Objects" which required the return of illegally acquired cultural property are examples of international agreements established on illegal transfers of cultural assets. In addition, the UN agency UNESCO established the Division of Cultural Heritage to oversee cultural assets related matters, and the UN since its 1973 resolution 3187, has continued to demonstrate interest in protection of cultural assets. The resolution 3187 affirms the return of cultural assets to the country of origin, advises on preventing illegal transfers of works of art and cultural assets, advises cataloguing cultural assets within the respective countries and, conclusively, recommends becoming a member of UNESCO, composing a forum for international cooperation. Differences in defining cultural assets pose a limitation on international agreements. While the 1954 Convention states that cultural assets are not limited to movable property and includes immovable property, the 1970 Convention's objective of 'Prohibiting and preventing the illicit import, export and transfer of ownership of cultural property' effectively limits the subject to tangible movable cultural property. The 1995 Convention also has tangible movable cultural property as its subject. On this point, the two conventions demonstrate distinction from the 1954 Convention and the 1972 Convention that focuses on immovable cultural property and natural property. The disparity in defining cultural property is due to the object and purpose of the convention and does not reflect an inherent divergence. In the case of Korea, beginning with the 1866 French invasion, 36 years of Japanese colonial rule, military rule and period of economic development caused outflow of numerous cultural assets to foreign countries. Of course, it is neither possible nor necessary to have all of these cultural properties returned, but among those that have significant value in establishing cultural and historical identity or those that have been taken symbolically as a demonstration of occupational rule can cause issues in their return. In these cases, the 1954 Convention and the ratification of the first legislation must be actively considered. In the return of cultural property, if the illicit acquisition is the core issue, it is a simple matter of following the international accords, while if it rises to the level of diplomatic discussions, it will become a political issue. In that case, the country requesting the return must convince the counterpart country. Realizing a response to the earnest need for preventing illicit trading of cultural assets will require extensive national and civic societal efforts in the East Asian area to overcome its current deficiencies. The most effective way to prevent illicit trading of cultural property is rapid circulation of information between Interpol member countries, which will require development of an internet based communication system as well as more effective deployment of legislation to prevent trading of illicitly acquired cultural property, subscription to international conventions and cataloguing collections.

An Analytical Approach Using Topic Mining for Improving the Service Quality of Hotels (호텔 산업의 서비스 품질 향상을 위한 토픽 마이닝 기반 분석 방법)

  • Moon, Hyun Sil;Sung, David;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.21-41
    • /
    • 2019
  • Thanks to the rapid development of information technologies, the data available on Internet have grown rapidly. In this era of big data, many studies have attempted to offer insights and express the effects of data analysis. In the tourism and hospitality industry, many firms and studies in the era of big data have paid attention to online reviews on social media because of their large influence over customers. As tourism is an information-intensive industry, the effect of these information networks on social media platforms is more remarkable compared to any other types of media. However, there are some limitations to the improvements in service quality that can be made based on opinions on social media platforms. Users on social media platforms represent their opinions as text, images, and so on. Raw data sets from these reviews are unstructured. Moreover, these data sets are too big to extract new information and hidden knowledge by human competences. To use them for business intelligence and analytics applications, proper big data techniques like Natural Language Processing and data mining techniques are needed. This study suggests an analytical approach to directly yield insights from these reviews to improve the service quality of hotels. Our proposed approach consists of topic mining to extract topics contained in the reviews and the decision tree modeling to explain the relationship between topics and ratings. Topic mining refers to a method for finding a group of words from a collection of documents that represents a document. Among several topic mining methods, we adopted the Latent Dirichlet Allocation algorithm, which is considered as the most universal algorithm. However, LDA is not enough to find insights that can improve service quality because it cannot find the relationship between topics and ratings. To overcome this limitation, we also use the Classification and Regression Tree method, which is a kind of decision tree technique. Through the CART method, we can find what topics are related to positive or negative ratings of a hotel and visualize the results. Therefore, this study aims to investigate the representation of an analytical approach for the improvement of hotel service quality from unstructured review data sets. Through experiments for four hotels in Hong Kong, we can find the strengths and weaknesses of services for each hotel and suggest improvements to aid in customer satisfaction. Especially from positive reviews, we find what these hotels should maintain for service quality. For example, compared with the other hotels, a hotel has a good location and room condition which are extracted from positive reviews for it. In contrast, we also find what they should modify in their services from negative reviews. For example, a hotel should improve room condition related to soundproof. These results mean that our approach is useful in finding some insights for the service quality of hotels. That is, from the enormous size of review data, our approach can provide practical suggestions for hotel managers to improve their service quality. In the past, studies for improving service quality relied on surveys or interviews of customers. However, these methods are often costly and time consuming and the results may be biased by biased sampling or untrustworthy answers. The proposed approach directly obtains honest feedback from customers' online reviews and draws some insights through a type of big data analysis. So it will be a more useful tool to overcome the limitations of surveys or interviews. Moreover, our approach easily obtains the service quality information of other hotels or services in the tourism industry because it needs only open online reviews and ratings as input data. Furthermore, the performance of our approach will be better if other structured and unstructured data sources are added.

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

SANET-CC : Zone IP Allocation Protocol for Offshore Networks (SANET-CC : 해상 네트워크를 위한 구역 IP 할당 프로토콜)

  • Bae, Kyoung Yul;Cho, Moon Ki
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.87-109
    • /
    • 2020
  • Currently, thanks to the major stride made in developing wired and wireless communication technology, a variety of IT services are available on land. This trend is leading to an increasing demand for IT services to vessels on the water as well. And it is expected that the request for various IT services such as two-way digital data transmission, Web, APP, etc. is on the rise to the extent that they are available on land. However, while a high-speed information communication network is easily accessible on land because it is based upon a fixed infrastructure like an AP and a base station, it is not the case on the water. As a result, a radio communication network-based voice communication service is usually used at sea. To solve this problem, an additional frequency for digital data exchange was allocated, and a ship ad-hoc network (SANET) was proposed that can be utilized by using this frequency. Instead of satellite communication that costs a lot in installation and usage, SANET was developed to provide various IT services to ships based on IP in the sea. Connectivity between land base stations and ships is important in the SANET. To have this connection, a ship must be a member of the network with its IP address assigned. This paper proposes a SANET-CC protocol that allows ships to be assigned their own IP address. SANET-CC propagates several non-overlapping IP addresses through the entire network from land base stations to ships in the form of the tree. Ships allocate their own IP addresses through the exchange of simple requests and response messages with land base stations or M-ships that can allocate IP addresses. Therefore, SANET-CC can eliminate the IP collision prevention (Duplicate Address Detection) process and the process of network separation or integration caused by the movement of the ship. Various simulations were performed to verify the applicability of this protocol to SANET. The outcome of such simulations shows us the following. First, using SANET-CC, about 91% of the ships in the network were able to receive IP addresses under any circumstances. It is 6% higher than the existing studies. And it suggests that if variables are adjusted to each port's environment, it may show further improved results. Second, this work shows us that it takes all vessels an average of 10 seconds to receive IP addresses regardless of conditions. It represents a 50% decrease in time compared to the average of 20 seconds in the previous study. Also Besides, taking it into account that when existing studies were on 50 to 200 vessels, this study on 100 to 400 vessels, the efficiency can be much higher. Third, existing studies have not been able to derive optimal values according to variables. This is because it does not have a consistent pattern depending on the variable. This means that optimal variables values cannot be set for each port under diverse environments. This paper, however, shows us that the result values from the variables exhibit a consistent pattern. This is significant in that it can be applied to each port by adjusting the variable values. It was also confirmed that regardless of the number of ships, the IP allocation ratio was the most efficient at about 96 percent if the waiting time after the IP request was 75ms, and that the tree structure could maintain a stable network configuration when the number of IPs was over 30000. Fourth, this study can be used to design a network for supporting intelligent maritime control systems and services offshore, instead of satellite communication. And if LTE-M is set up, it is possible to use it for various intelligent services.