• Title/Summary/Keyword: IT Techniques

Search Result 13,856, Processing Time 0.043 seconds

Location and Construction Characteristics of Imdaejeong Wonlim based on Documentation (기문(記文)을 중심으로 고찰한 임대정원림(臨對亭園林)의 입지 및 조영 특성)

  • Rho, Jae-Hyun;Park, Tae-Hee;Shin, Sang-Sup;Kim, Hyoun-Wuk
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.29 no.4
    • /
    • pp.14-26
    • /
    • 2011
  • Imdaejeong Wonlim is located on the verge of Sangsa Village in Sapyeong-ri, Daepyeong-myeon, Hwasun-gun Gyeongsangnam-do toward Northwest. It was planned by Sa-ae, Minjuhyeon in 1862 on the basis of Gobanwon built by Nam Eongi in 16th century against the backdrop of Mt. Bongjeong and facing Sapyeong Stream. As water flows from west to east in the shape of crane, this area is a propitious site standing for prosperity and happiness. This area shows a distinct feature of Wonlim surrounding the Imdaejeong with multi layers as consisting of 5 districts - front yard where landmark stone with engraved letters of 'Janggujiso of Master Sa-ea' and junipers are harmoniously arranged, internal garden of upper pavilion ranging from a pavilion to square pond with a little island in the middle, Sugyeongwon of under pavilionu consisting of 2 ponds with a painting of three taoist hermits, forest of Mt. Bonggeong and external garden including Sapyeong Stream and farmland. According to documentation and the results of on-site investigation, it is certainly proved that Imdaejeong Wonlim was motivated by Byeoseo Wonlim which realized the idea of 'going back to hometown after resignation' following the motives of Janggujiso, a hideout aimed to accomplish the ideology, 'training mind and fostering innate nature,' on the peaceful site surrounded by water and mountain, as well as motives of Sesimcheo(洗心處) to be unified with morality of Mother Nature, etc. In addition, it implies various imaginary landscapes such as Pihangji, Eupcheongdang, square pond with an island and painting of three Taoist hermits based on a notion that 'the further scent flies away, the fresher it becomes,' which is originated from Aelyeonseol(愛蓮說). In terms of technique of natural landscape treatment, divers techniques are found in Imdaejeong Wonlim such as distant view of Mt. Bongjeong, pulling view with an intention of transparent beauty of moonlight, circle view of natural and cultural sceneries on every side, borrowed scenary of pastoral rural life adopted as an opposite view, looked view of Sulyundaero, over looked view of pond, static view in pavilion and paths, close view of water space such as stream and pond, mushroom-and-umbrella like view of Imdaejeong, vista of pond surrounded by willows, imaginary view of engraved letters meaning 'widen knowledge by studying objectives' and selected view to comprise sunrise and sunset at the same time. In the beginning of construction, various plants seemed to be planted, albeit different from now, such as Ginkgo biloba, Phyllostachys spp., Salix spp., Pinus densiflora, Abies holophylla, Morus bombycis, Juglans mandschurica, Paulownia coreana, Prunus mume, Nelumbo nucifera, etc. Generally, it reflected dignity of Confucianism or beared aspect of semantic landscape implying Taoist taste and idea of Phoenix wishing a prosperity in the future. Furthermore, a diversity of planting methods were pursued for such as liner planting for the periphery of pond, bosquet planting and circle planting adopted around the pavilion, spot planting using green trees, solitary planting of monumentally planted Paulownia coreana and opposite planting presenting the Abies holophylla into yin and yang.

A Study on the Characteristics of Enterprise R&D Capabilities Using Data Mining (데이터마이닝을 활용한 기업 R&D역량 특성에 관한 탐색 연구)

  • Kim, Sang-Gook;Lim, Jung-Sun;Park, Wan
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.1-21
    • /
    • 2021
  • As the global business environment changes, uncertainties in technology development and market needs increase, and competition among companies intensifies, interests and demands for R&D activities of individual companies are increasing. In order to cope with these environmental changes, R&D companies are strengthening R&D investment as one of the means to enhance the qualitative competitiveness of R&D while paying more attention to facility investment. As a result, facilities or R&D investment elements are inevitably a burden for R&D companies to bear future uncertainties. It is true that the management strategy of increasing investment in R&D as a means of enhancing R&D capability is highly uncertain in terms of corporate performance. In this study, the structural factors that influence the R&D capabilities of companies are explored in terms of technology management capabilities, R&D capabilities, and corporate classification attributes by utilizing data mining techniques, and the characteristics these individual factors present according to the level of R&D capabilities are analyzed. This study also showed cluster analysis and experimental results based on evidence data for all domestic R&D companies, and is expected to provide important implications for corporate management strategies to enhance R&D capabilities of individual companies. For each of the three viewpoints, detailed evaluation indexes were composed of 7, 2, and 4, respectively, to quantitatively measure individual levels in the corresponding area. In the case of technology management capability and R&D capability, the sub-item evaluation indexes that are being used by current domestic technology evaluation agencies were referenced, and the final detailed evaluation index was newly constructed in consideration of whether data could be obtained quantitatively. In the case of corporate classification attributes, the most basic corporate classification profile information is considered. In particular, in order to grasp the homogeneity of the R&D competency level, a comprehensive score for each company was given using detailed evaluation indicators of technology management capability and R&D capability, and the competency level was classified into five grades and compared with the cluster analysis results. In order to give the meaning according to the comparative evaluation between the analyzed cluster and the competency level grade, the clusters with high and low trends in R&D competency level were searched for each cluster. Afterwards, characteristics according to detailed evaluation indicators were analyzed in the cluster. Through this method of conducting research, two groups with high R&D competency and one with low level of R&D competency were analyzed, and the remaining two clusters were similar with almost high incidence. As a result, in this study, individual characteristics according to detailed evaluation indexes were analyzed for two clusters with high competency level and one cluster with low competency level. The implications of the results of this study are that the faster the replacement cycle of professional managers who can effectively respond to changes in technology and market demand, the more likely they will contribute to enhancing R&D capabilities. In the case of a private company, it is necessary to increase the intensity of input of R&D capabilities by enhancing the sense of belonging of R&D personnel to the company through conversion to a corporate company, and to provide the accuracy of responsibility and authority through the organization of the team unit. Since the number of technical commercialization achievements and technology certifications are occurring both in the case of contributing to capacity improvement and in case of not, it was confirmed that there is a limit in reviewing it as an important factor for enhancing R&D capacity from the perspective of management. Lastly, the experience of utility model filing was identified as a factor that has an important influence on R&D capability, and it was confirmed the need to provide motivation to encourage utility model filings in order to enhance R&D capability. As such, the results of this study are expected to provide important implications for corporate management strategies to enhance individual companies' R&D capabilities.

The Prediction of Export Credit Guarantee Accident using Machine Learning (기계학습을 이용한 수출신용보증 사고예측)

  • Cho, Jaeyoung;Joo, Jihwan;Han, Ingoo
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.83-102
    • /
    • 2021
  • The government recently announced various policies for developing big-data and artificial intelligence fields to provide a great opportunity to the public with respect to disclosure of high-quality data within public institutions. KSURE(Korea Trade Insurance Corporation) is a major public institution for financial policy in Korea, and thus the company is strongly committed to backing export companies with various systems. Nevertheless, there are still fewer cases of realized business model based on big-data analyses. In this situation, this paper aims to develop a new business model which can be applied to an ex-ante prediction for the likelihood of the insurance accident of credit guarantee. We utilize internal data from KSURE which supports export companies in Korea and apply machine learning models. Then, we conduct performance comparison among the predictive models including Logistic Regression, Random Forest, XGBoost, LightGBM, and DNN(Deep Neural Network). For decades, many researchers have tried to find better models which can help to predict bankruptcy since the ex-ante prediction is crucial for corporate managers, investors, creditors, and other stakeholders. The development of the prediction for financial distress or bankruptcy was originated from Smith(1930), Fitzpatrick(1932), or Merwin(1942). One of the most famous models is the Altman's Z-score model(Altman, 1968) which was based on the multiple discriminant analysis. This model is widely used in both research and practice by this time. The author suggests the score model that utilizes five key financial ratios to predict the probability of bankruptcy in the next two years. Ohlson(1980) introduces logit model to complement some limitations of previous models. Furthermore, Elmer and Borowski(1988) develop and examine a rule-based, automated system which conducts the financial analysis of savings and loans. Since the 1980s, researchers in Korea have started to examine analyses on the prediction of financial distress or bankruptcy. Kim(1987) analyzes financial ratios and develops the prediction model. Also, Han et al.(1995, 1996, 1997, 2003, 2005, 2006) construct the prediction model using various techniques including artificial neural network. Yang(1996) introduces multiple discriminant analysis and logit model. Besides, Kim and Kim(2001) utilize artificial neural network techniques for ex-ante prediction of insolvent enterprises. After that, many scholars have been trying to predict financial distress or bankruptcy more precisely based on diverse models such as Random Forest or SVM. One major distinction of our research from the previous research is that we focus on examining the predicted probability of default for each sample case, not only on investigating the classification accuracy of each model for the entire sample. Most predictive models in this paper show that the level of the accuracy of classification is about 70% based on the entire sample. To be specific, LightGBM model shows the highest accuracy of 71.1% and Logit model indicates the lowest accuracy of 69%. However, we confirm that there are open to multiple interpretations. In the context of the business, we have to put more emphasis on efforts to minimize type 2 error which causes more harmful operating losses for the guaranty company. Thus, we also compare the classification accuracy by splitting predicted probability of the default into ten equal intervals. When we examine the classification accuracy for each interval, Logit model has the highest accuracy of 100% for 0~10% of the predicted probability of the default, however, Logit model has a relatively lower accuracy of 61.5% for 90~100% of the predicted probability of the default. On the other hand, Random Forest, XGBoost, LightGBM, and DNN indicate more desirable results since they indicate a higher level of accuracy for both 0~10% and 90~100% of the predicted probability of the default but have a lower level of accuracy around 50% of the predicted probability of the default. When it comes to the distribution of samples for each predicted probability of the default, both LightGBM and XGBoost models have a relatively large number of samples for both 0~10% and 90~100% of the predicted probability of the default. Although Random Forest model has an advantage with regard to the perspective of classification accuracy with small number of cases, LightGBM or XGBoost could become a more desirable model since they classify large number of cases into the two extreme intervals of the predicted probability of the default, even allowing for their relatively low classification accuracy. Considering the importance of type 2 error and total prediction accuracy, XGBoost and DNN show superior performance. Next, Random Forest and LightGBM show good results, but logistic regression shows the worst performance. However, each predictive model has a comparative advantage in terms of various evaluation standards. For instance, Random Forest model shows almost 100% accuracy for samples which are expected to have a high level of the probability of default. Collectively, we can construct more comprehensive ensemble models which contain multiple classification machine learning models and conduct majority voting for maximizing its overall performance.

A Study on Rationalization of National Forest Management in Korea (국유림경영(國有林經營)의 합리화(合理化)에 관(關)한 연구(硏究))

  • Choi, Kyu-Ryun
    • Journal of Korean Society of Forest Science
    • /
    • v.20 no.1
    • /
    • pp.1-44
    • /
    • 1973
  • Needless to say, the management of national forest in all countries is very important in view of the national mission and management purposes. Korean national forest is also in particular significant in promoting national economy for the continuous increasing of the demand for wood, conservation of the land and social welfare. But there's no denying the fact that the leading aim of the Korean forest policy has been based upon the conservation of forest resources and recovery of land conservation function instead of improvement of the forest productive capacity. Therefore, the management of national forest should be aimed as an industry in the chain of the Korean national economy. And the increment of the forest productive capacity based on rationalized forest management is also urgently needed. Not only the increment of the timber production but also the establishment of the good forest in quality and quantity are to bring naturally many functions of conservation and other public benefits. In 1908 Korean national forest was historically established for the first time as a result of the notification for ownership, and was divided into two kinds in 1911-1924, such as indisposable national forest for land conservation, forest management, scientific research and public welfare, and the other national forest to be disposed. Indisposable forest is mostly under the jurisdiction of national forest stations (Chungbu, Tongbu, Nambu), and the tother national forests are under custody of respective cities and provinces, and under custody of the other government authorities. As of the end of 1971, national forest land is 19.5% (1,297,708 ha) of the total forest land area, but growing stock is 50.1% ($35,406,079m^3$) of the total forest growing stock, and timber production of national forest is 23.6% ($205,959m^3$) of the year production of total timber in Korea. Accordingly, it is the important fact that national forest occupies the major part of Korean forestry. The author positively affirms that success or failure of the management of national forest controls rise or fall of forestry in Korea. All functions of forest are very important, but among others the function of timber production is most important especially in Korea, that unavoidably imports a large quantity of foreign wood every year (in 1971 import of foreign wood-$3,756,000m^3$, 160,995,000 dollars). So, Korea urgently needs the improvement of forest productive capacity in national forest. But it is difficult that wood production meets the rapid increase of demand for wood to the development of economy, because production term of forestry is long, so national forest management should be rationalized by the effective investment and development of forestry techniques in the long view. Although Korean national forest business has many difficulties in the budget, techniques and the lack of labour due to outflow of rural village labour by development of national economy, and the increase of labour wages and administrative expenses etc. the development of national forest depends on adoption of the suitable forest techniques and management adapted for social and economical development. In this view point the writer has investigated and analyzed the status of the management of national forest in Korea to examine the irrational problems and suggest an improvement plan. The national forestry statistics cited in this study is based on the basic statistics and the statistics of the forest business as of the end of 1971 published by Office of Forestry, Republic of Korea, and the other depended on the data presented by the national forest stations. The writer wants to propose as follows (seemed to be helpful in improvement of Korean national forest management). 1) In the organization of national forest management, more national forest stations should be established to manage intensively, and the staff of working plan officials should be strengthened because of the importance of working plan. 2) By increasing the staff of protection officials, forest area assigned for each protection official should be decreased to 1,000-2,000 ha. 3) The frequent personnel changes of supervisor of national forest station(the responsible person on-the-spot) obstructs to accomplish the consistent management plan. 4) In the working plan drafting for national forest, basic investigations should be carefully practiced with sufficient expenditure and staff not to draft unreal working plan. 5) The area of working-unit should be decreased to less than 2,000 ha on the average for intensive management and the principle of a working-unit in a forest station should be realized as soon as possible. 6) Reforestation on open land should be completed in a short time with a debt of the special fund(a long term loan), and the land on which growing hardwood stands should be changed with conifers to increase productivity per unit area, and at the same time techical utilization method of hardwood should be developed. 7) Expenses of reforestation should be saved by mechanization and use of chemicals for reforestation and tree nursery operation providing against the lack of labour in future. 8) In forest protection, forest fire damage is enormous in comparison with foreign countries, accordingly prevention system and equipment should be improved, and also the minimum necessary budget should be counted up for establishment and manintenance of fire-lines. 9) Manufacture production should be enlarged to systematize protection, processing and circulation of forest business, and, by doing this, mich benefit is naturally given for rural people. 10) Establishment and arrangement of forest road networks and erosion control work are indispensable for the future development of national forest itself and local development. Therefore, these works should be promoted by the responsibility of general accounting instead of special accounting. 11) Mechanization of forest works should be realized for exploiting hinterlands to meet the demand for timber increased and for solving lack of labour, consequently it should promote import of forest machines, home production, training for operaters and careful adminitration. 12) Situation of labour in future will grow worse. Therefore, the countermeasure to maintain forest labourers and pay attention to public welfare facilities and works should be considered. 13) Although the condition of income and expenditure grows worse because of economical change, the regular expenditure should be fixed. So part of the surplus fund, as of the end of 1971, should be established for the fund, and used for enlarging reforestation and forest road networks(preceding investment in national forest).

  • PDF

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

Effects of Molecular Weight of Polyethylene Glycol on the Dimensional Stabilization of Wood (Polyethylene Glycol의 분자량(分子量)이 목재(木材)의 치수 안정화(安定化)에 미치는 영향(影響))

  • Cheon, Cheol;Oh, Joung Soo
    • Journal of Korean Society of Forest Science
    • /
    • v.71 no.1
    • /
    • pp.14-21
    • /
    • 1985
  • This study was carried out in order to prevent the devaluation of wood itself and wood products causing by anisotropy, hygroscopicity, shrinkage and swelling - properties that wood itself only have, in order to improve utility of wood, by emphasizing the natural beautiful figures of wood, to develop the dimensional stabilization techniques of wood with PEG that it is a cheap, non-toxic and the impregnation treatment is not difficult, on the effects of PEG molecular weights (200, 400, 600, 1000, 1500, 2000, 4000, 6000) and species (Pinus densiflora S. et Z., Larix leptolepis Gordon., Cryptomeria japonica D. Don., Cornus controversa Hemsl., Quercus variabilis Blume., Prunus sargentii Rehder.). The results were as follows; 1) PEG loading showed the maximum value (137.22%, Pinus densiflora, in PEG 400), the others showed that relatively slow decrease. The lower specific gravity, the more polymer loading. 2) Bulking coefficient didn't particularly show the correlation with specific gravity, for the most part, indicated the maximum values in PEG 600, except that the bulking coefficient of Quercus variabilis distributed between the range of 12-18% in PEG 400-2000. In general, the bulking coefficient of hardwood was higher than that of softwood. 3) Although there was more or less an exception according to species, volumetric swelling reduction was the greatest in PEG 400. That is, its value of Cryptomeria japonica was the greatest value with 95.0%, the others indicated more than 80% except for Prunus sargentii, while volumetric swelling reduction was decreased less than 70% as the molecular weight increase more than 1000. 4) The relative effectiveness of hardwood with high specific gravity was outstandingly higher than softwood. In general, the relative effectiveness of low molecular weight PEG was superior to those of high molecular weight PEG except that Quercus variabilis showed more than 1.6 to the total molecular weight range, while it was no significant difference as the molecular weight increase more than 4000. 5) According to the analysis of the results mentioned above, the dimensional stabilization of hardwood was more effective than softwood. Although volumetric swelling reduction was the greatest at a molecular weight of 400. In the view of polymer loading, bulking coefficiency reduction of swelling and relative effectiveness, it is desirable to use the mixture of PEG of molecular weight in the range of 200-1500. To practical use, it is recommended to study about the effects on the mixed ratio on the bulking coefficient, reduction of swelling and relative effectiveness.

  • PDF

A Study on the Application of Outlier Analysis for Fraud Detection: Focused on Transactions of Auction Exception Agricultural Products (부정 탐지를 위한 이상치 분석 활용방안 연구 : 농수산 상장예외품목 거래를 대상으로)

  • Kim, Dongsung;Kim, Kitae;Kim, Jongwoo;Park, Steve
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.93-108
    • /
    • 2014
  • To support business decision making, interests and efforts to analyze and use transaction data in different perspectives are increasing. Such efforts are not only limited to customer management or marketing, but also used for monitoring and detecting fraud transactions. Fraud transactions are evolving into various patterns by taking advantage of information technology. To reflect the evolution of fraud transactions, there are many efforts on fraud detection methods and advanced application systems in order to improve the accuracy and ease of fraud detection. As a case of fraud detection, this study aims to provide effective fraud detection methods for auction exception agricultural products in the largest Korean agricultural wholesale market. Auction exception products policy exists to complement auction-based trades in agricultural wholesale market. That is, most trades on agricultural products are performed by auction; however, specific products are assigned as auction exception products when total volumes of products are relatively small, the number of wholesalers is small, or there are difficulties for wholesalers to purchase the products. However, auction exception products policy makes several problems on fairness and transparency of transaction, which requires help of fraud detection. In this study, to generate fraud detection rules, real huge agricultural products trade transaction data from 2008 to 2010 in the market are analyzed, which increase more than 1 million transactions and 1 billion US dollar in transaction volume. Agricultural transaction data has unique characteristics such as frequent changes in supply volumes and turbulent time-dependent changes in price. Since this was the first trial to identify fraud transactions in this domain, there was no training data set for supervised learning. So, fraud detection rules are generated using outlier detection approach. We assume that outlier transactions have more possibility of fraud transactions than normal transactions. The outlier transactions are identified to compare daily average unit price, weekly average unit price, and quarterly average unit price of product items. Also quarterly averages unit price of product items of the specific wholesalers are used to identify outlier transactions. The reliability of generated fraud detection rules are confirmed by domain experts. To determine whether a transaction is fraudulent or not, normal distribution and normalized Z-value concept are applied. That is, a unit price of a transaction is transformed to Z-value to calculate the occurrence probability when we approximate the distribution of unit prices to normal distribution. The modified Z-value of the unit price in the transaction is used rather than using the original Z-value of it. The reason is that in the case of auction exception agricultural products, Z-values are influenced by outlier fraud transactions themselves because the number of wholesalers is small. The modified Z-values are called Self-Eliminated Z-scores because they are calculated excluding the unit price of the specific transaction which is subject to check whether it is fraud transaction or not. To show the usefulness of the proposed approach, a prototype of fraud transaction detection system is developed using Delphi. The system consists of five main menus and related submenus. First functionalities of the system is to import transaction databases. Next important functions are to set up fraud detection parameters. By changing fraud detection parameters, system users can control the number of potential fraud transactions. Execution functions provide fraud detection results which are found based on fraud detection parameters. The potential fraud transactions can be viewed on screen or exported as files. The study is an initial trial to identify fraud transactions in Auction Exception Agricultural Products. There are still many remained research topics of the issue. First, the scope of analysis data was limited due to the availability of data. It is necessary to include more data on transactions, wholesalers, and producers to detect fraud transactions more accurately. Next, we need to extend the scope of fraud transaction detection to fishery products. Also there are many possibilities to apply different data mining techniques for fraud detection. For example, time series approach is a potential technique to apply the problem. Even though outlier transactions are detected based on unit prices of transactions, however it is possible to derive fraud detection rules based on transaction volumes.

Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)

  • Bae, Jung-Hwan;Son, Ji-Eun;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.141-156
    • /
    • 2013
  • Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.

Multi-day Trip Planning System with Collaborative Recommendation (협업적 추천 기반의 여행 계획 시스템)

  • Aprilia, Priska;Oh, Kyeong-Jin;Hong, Myung-Duk;Ga, Myeong-Hyeon;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.159-185
    • /
    • 2016
  • Planning a multi-day trip is a complex, yet time-consuming task. It usually starts with selecting a list of points of interest (POIs) worth visiting and then arranging them into an itinerary, taking into consideration various constraints and preferences. When choosing POIs to visit, one might ask friends to suggest them, search for information on the Web, or seek advice from travel agents; however, those options have their limitations. First, the knowledge of friends is limited to the places they have visited. Second, the tourism information on the internet may be vast, but at the same time, might cause one to invest a lot of time reading and filtering the information. Lastly, travel agents might be biased towards providers of certain travel products when suggesting itineraries. In recent years, many researchers have tried to deal with the huge amount of tourism information available on the internet. They explored the wisdom of the crowd through overwhelming images shared by people on social media sites. Furthermore, trip planning problems are usually formulated as 'Tourist Trip Design Problems', and are solved using various search algorithms with heuristics. Various recommendation systems with various techniques have been set up to cope with the overwhelming tourism information available on the internet. Prediction models of recommendation systems are typically built using a large dataset. However, sometimes such a dataset is not always available. For other models, especially those that require input from people, human computation has emerged as a powerful and inexpensive approach. This study proposes CYTRIP (Crowdsource Your TRIP), a multi-day trip itinerary planning system that draws on the collective intelligence of contributors in recommending POIs. In order to enable the crowd to collaboratively recommend POIs to users, CYTRIP provides a shared workspace. In the shared workspace, the crowd can recommend as many POIs to as many requesters as they can, and they can also vote on the POIs recommended by other people when they find them interesting. In CYTRIP, anyone can make a contribution by recommending POIs to requesters based on requesters' specified preferences. CYTRIP takes input on the recommended POIs to build a multi-day trip itinerary taking into account the user's preferences, the various time constraints, and the locations. The input then becomes a multi-day trip planning problem that is formulated in Planning Domain Definition Language 3 (PDDL3). A sequence of actions formulated in a domain file is used to achieve the goals in the planning problem, which are the recommended POIs to be visited. The multi-day trip planning problem is a highly constrained problem. Sometimes, it is not feasible to visit all the recommended POIs with the limited resources available, such as the time the user can spend. In order to cope with an unachievable goal that can result in no solution for the other goals, CYTRIP selects a set of feasible POIs prior to the planning process. The planning problem is created for the selected POIs and fed into the planner. The solution returned by the planner is then parsed into a multi-day trip itinerary and displayed to the user on a map. The proposed system is implemented as a web-based application built using PHP on a CodeIgniter Web Framework. In order to evaluate the proposed system, an online experiment was conducted. From the online experiment, results show that with the help of the contributors, CYTRIP can plan and generate a multi-day trip itinerary that is tailored to the users' preferences and bound by their constraints, such as location or time constraints. The contributors also find that CYTRIP is a useful tool for collecting POIs from the crowd and planning a multi-day trip.

GENE EXPRESSION PATTERNS INDUCED BY $TAXOL^{(R)}$ AND CYCLOSPORIN A IN ORAL SQUAMOUS CELL CARCINOMA CELL LINE USING CDNA MICROARRAY (cDNA Microarray를 이용한 구강편평세포암종 세포주에서 $Taxol^{(R)}$과 Cyclosporin A로 유도된 유전자 발현양상)

  • Kim, Yong-Kwan;Lee, Jae-Hoon;Kim, Chul-Hwan
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.28 no.3
    • /
    • pp.202-212
    • /
    • 2006
  • It is well-known that paclitaxel($Taxol^{(R)}$), which is extracted from the pacific and English yew, has been used as a chemotherapeutic agent for ovarian carcinoma and advanced breast carcinoma and Cyclosporin A, which is highly lipophilic cyclic peptide and isolated from a fungus, has been also used as an useful immunosuppressive drug after transplantation and is associated with cellular apoptosis. Since 1953, in which James Watson, Rosalind Franklin and Francis Crick discovered the double helical structure of DNA, a few kinds of techniques for identifying gene expression have been developed. In postgenomic period, many of researchers have used the DNA microarray which is high throughput screening technique to screen large numbers of gene expression simultaneously. In this study, we searched and screened the gene expression in the oral squamous cell carcinoma cell lines treated with $Taxol^{(R)}$, cyclosporin or cyclosporin combined with $Taxol^{(R)}$ using cDNA microarray. The results were as following; 1. It was useful that the appropriate concentration of Cyclosporin A and $Taxol^{(R)}$ used in oral squamous cell carcinoma cell line was under 1${\mu}g/ml$ and 3${\mu}g/ml$. 2. In the experimental group in which $Taxol^{(R)}$ and $Taxol^{(R)}$ + Cyclosporin A were used, the cell growth was extremely decreased. 3. In the group in which Cyclosporin A was used, the MTT assay was rarely decreased which means the activity of succinyl dehydrogenase is remained in mitochondria but in the group in which the mixture of Cyclosporin A and $Taxol^{(R)}$ were used, the MTT assay was extremely decreased. 4. In the each group in which Cyclosporin A(3 ${\mu}g/ml$) and $Taxol^{(R)}$(1 ${\mu}g/ml$) were used, the cell arrest was appeared in $G_2/M$ phase and in the group in which $Taxol^{(R)}$(3 ${\mu}g/ml$) was used, the cell arrest was appeared in both S phase and $G_2/M$ phase. 5. In the oral squamous cell carcinoma cell line treated with $Taxol^{(R)}$, several genes including ANGPTL4, RALBP1 and TXNRD1, associated with apoptosis, SUI1, MAC30, RRAGA and CTGF, related with cell growth, HUS1 and DUSP5, related with cell cycle and proliferation, ATF4 and CEBPG, associated with transcription factor, BTG1 and VEGF, associated with angiogenesis, FDPS, FCER1G, GPA33 and EPHA4 associated with signal transduction and receptor activity and AKR1C2 and UGTA10 related with carcinogenesis were detected in increased levels. The genes that showed increaced expression in the oral squamous cell carcinoma cell line treated with Cyclosporin A were CYR61, SERPINB2, SSR3 and UPA3A which are known as genes associated with cell growth, carcinogenesis, receptor activity and transcription factor. The genes expressed in the HN22 cell line treated with cyclosporin combined with $taxol^{(R)}$ were ALCAM and GTSE1 associated with cancer invasiveness and cell cycle regulation.