• Title/Summary/Keyword: 시스템평가

Search Result 20,740, Processing Time 0.048 seconds

A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns (인터넷 검색기록 분석을 통한 쇼핑의도 포함 키워드 자동 추출 기법)

  • Kim, Mingyu;Kim, Namgyu;Jung, Inhwan
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.123-136
    • /
    • 2014
  • Recently, online shopping has further developed as the use of the Internet and a variety of smart mobile devices becomes more prevalent. The increase in the scale of such shopping has led to the creation of many Internet shopping malls. Consequently, there is a tendency for increasingly fierce competition among online retailers, and as a result, many Internet shopping malls are making significant attempts to attract online users to their sites. One such attempt is keyword marketing, whereby a retail site pays a fee to expose its link to potential customers when they insert a specific keyword on an Internet portal site. The price related to each keyword is generally estimated by the keyword's frequency of appearance. However, it is widely accepted that the price of keywords cannot be based solely on their frequency because many keywords may appear frequently but have little relationship to shopping. This implies that it is unreasonable for an online shopping mall to spend a great deal on some keywords simply because people frequently use them. Therefore, from the perspective of shopping malls, a specialized process is required to extract meaningful keywords. Further, the demand for automating this extraction process is increasing because of the drive to improve online sales performance. In this study, we propose a methodology that can automatically extract only shopping-related keywords from the entire set of search keywords used on portal sites. We define a shopping-related keyword as a keyword that is used directly before shopping behaviors. In other words, only search keywords that direct the search results page to shopping-related pages are extracted from among the entire set of search keywords. A comparison is then made between the extracted keywords' rankings and the rankings of the entire set of search keywords. Two types of data are used in our study's experiment: web browsing history from July 1, 2012 to June 30, 2013, and site information. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The original sample dataset contains 150 million transaction logs. First, portal sites are selected, and search keywords in those sites are extracted. Search keywords can be easily extracted by simple parsing. The extracted keywords are ranked according to their frequency. The experiment uses approximately 3.9 million search results from Korea's largest search portal site. As a result, a total of 344,822 search keywords were extracted. Next, by using web browsing history and site information, the shopping-related keywords were taken from the entire set of search keywords. As a result, we obtained 4,709 shopping-related keywords. For performance evaluation, we compared the hit ratios of all the search keywords with the shopping-related keywords. To achieve this, we extracted 80,298 search keywords from several Internet shopping malls and then chose the top 1,000 keywords as a set of true shopping keywords. We measured precision, recall, and F-scores of the entire amount of keywords and the shopping-related keywords. The F-Score was formulated by calculating the harmonic mean of precision and recall. The precision, recall, and F-score of shopping-related keywords derived by the proposed methodology were revealed to be higher than those of the entire number of keywords. This study proposes a scheme that is able to obtain shopping-related keywords in a relatively simple manner. We could easily extract shopping-related keywords simply by examining transactions whose next visit is a shopping mall. The resultant shopping-related keyword set is expected to be a useful asset for many shopping malls that participate in keyword marketing. Moreover, the proposed methodology can be easily applied to the construction of special area-related keywords as well as shopping-related ones.

User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis (다계층 이원 네트워크를 활용한 사용자 관점의 이슈 클러스터링)

  • Kim, Jieun;Kim, Namgyu;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.93-107
    • /
    • 2014
  • In this paper, we report what we have observed with regard to user-perspective issue clustering based on multi-layered two-mode network analysis. This work is significant in the context of data collection by companies about customer needs. Most companies have failed to uncover such needs for products or services properly in terms of demographic data such as age, income levels, and purchase history. Because of excessive reliance on limited internal data, most recommendation systems do not provide decision makers with appropriate business information for current business circumstances. However, part of the problem is the increasing regulation of personal data gathering and privacy. This makes demographic or transaction data collection more difficult, and is a significant hurdle for traditional recommendation approaches because these systems demand a great deal of personal data or transaction logs. Our motivation for presenting this paper to academia is our strong belief, and evidence, that most customers' requirements for products can be effectively and efficiently analyzed from unstructured textual data such as Internet news text. In order to derive users' requirements from textual data obtained online, the proposed approach in this paper attempts to construct double two-mode networks, such as a user-news network and news-issue network, and to integrate these into one quasi-network as the input for issue clustering. One of the contributions of this research is the development of a methodology utilizing enormous amounts of unstructured textual data for user-oriented issue clustering by leveraging existing text mining and social network analysis. In order to build multi-layered two-mode networks of news logs, we need some tools such as text mining and topic analysis. We used not only SAS Enterprise Miner 12.1, which provides a text miner module and cluster module for textual data analysis, but also NetMiner 4 for network visualization and analysis. Our approach for user-perspective issue clustering is composed of six main phases: crawling, topic analysis, access pattern analysis, network merging, network conversion, and clustering. In the first phase, we collect visit logs for news sites by crawler. After gathering unstructured news article data, the topic analysis phase extracts issues from each news article in order to build an article-news network. For simplicity, 100 topics are extracted from 13,652 articles. In the third phase, a user-article network is constructed with access patterns derived from web transaction logs. The double two-mode networks are then merged into a quasi-network of user-issue. Finally, in the user-oriented issue-clustering phase, we classify issues through structural equivalence, and compare these with the clustering results from statistical tools and network analysis. An experiment with a large dataset was performed to build a multi-layer two-mode network. After that, we compared the results of issue clustering from SAS with that of network analysis. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The sample dataset contains 150 million transaction logs and 13,652 news articles of 5,000 panels over one year. User-article and article-issue networks are constructed and merged into a user-issue quasi-network using Netminer. Our issue-clustering results applied the Partitioning Around Medoids (PAM) algorithm and Multidimensional Scaling (MDS), and are consistent with the results from SAS clustering. In spite of extensive efforts to provide user information with recommendation systems, most projects are successful only when companies have sufficient data about users and transactions. Our proposed methodology, user-perspective issue clustering, can provide practical support to decision-making in companies because it enhances user-related data from unstructured textual data. To overcome the problem of insufficient data from traditional approaches, our methodology infers customers' real interests by utilizing web transaction logs. In addition, we suggest topic analysis and issue clustering as a practical means of issue identification.

The Prediction of Currency Crises through Artificial Neural Network (인공신경망을 이용한 경제 위기 예측)

  • Lee, Hyoung Yong;Park, Jung Min
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.19-43
    • /
    • 2016
  • This study examines the causes of the Asian exchange rate crisis and compares it to the European Monetary System crisis. In 1997, emerging countries in Asia experienced financial crises. Previously in 1992, currencies in the European Monetary System had undergone the same experience. This was followed by Mexico in 1994. The objective of this paper lies in the generation of useful insights from these crises. This research presents a comparison of South Korea, United Kingdom and Mexico, and then compares three different models for prediction. Previous studies of economic crisis focused largely on the manual construction of causal models using linear techniques. However, the weakness of such models stems from the prevalence of nonlinear factors in reality. This paper uses a structural equation model to analyze the causes, followed by a neural network model to circumvent the linear model's weaknesses. The models are examined in the context of predicting exchange rates In this paper, data were quarterly ones, and Consumer Price Index, Gross Domestic Product, Interest Rate, Stock Index, Current Account, Foreign Reserves were independent variables for the prediction. However, time periods of each country's data are different. Lisrel is an emerging method and as such requires a fresh approach to financial crisis prediction model design, along with the flexibility to accommodate unexpected change. This paper indicates the neural network model has the greater prediction performance in Korea, Mexico, and United Kingdom. However, in Korea, the multiple regression shows the better performance. In Mexico, the multiple regression is almost indifferent to the Lisrel. Although Lisrel doesn't show the significant performance, the refined model is expected to show the better result. The structural model in this paper should contain the psychological factor and other invisible areas in the future work. The reason of the low hit ratio is that the alternative model in this paper uses only the financial market data. Thus, we cannot consider the other important part. Korea's hit ratio is lower than that of United Kingdom. So, there must be the other construct that affects the financial market. So does Mexico. However, the United Kingdom's financial market is more influenced and explained by the financial factors than Korea and Mexico.

Determinants of Mobile Application Use: A Study Focused on the Correlation between Application Categories (모바일 앱 사용에 영향을 미치는 요인에 관한 연구: 앱 카테고리 간 상관관계를 중심으로)

  • Park, Sangkyu;Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.157-176
    • /
    • 2016
  • For a long time, mobile phone had a sole function of communication. Recently however, abrupt innovations in technology allowed extension of the sphere in mobile phone activities. Development of technology enabled realization of almost computer-like environment even on a very small device. Such advancement yielded several forms of new high-tech devices such as smartphone and tablet PC, which quickly proliferated. Simultaneously with the diffusion of the mobile devices, mobile applications for those devices also prospered and soon became deeply penetrated in consumers' daily lives. Numerous mobile applications have been released in app stores yielding trillions of cumulative downloads. However, a big majority of the applications are disregarded from consumers. Even after the applications are purchased, they do not survive long in consumers' mobile devices and are soon abandoned. Nevertheless, it is imperative for both app developers and app-store operators to understand consumer behaviors and to develop marketing strategies aiming to make sustainable business by first increasing sales of mobile applications and by also designing surviving strategy for applications. Therefore, this research analyzes consumers' mobile application usage behavior in a frame of substitution/supplementary of application categories and several explanatory variables. Considering that consumers of mobile devices use multiple apps simultaneously, this research adopts multivariate probit models to explain mobile application usage behavior and to derive correlation between categories of applications for observing substitution/supplementary of application use. The research adopts several explanatory variables including sociodemographic data, user experiences of purchased applications that reflect future purchasing behavior of paid applications as well as consumer attitudes toward marketing efforts, variables representing consumer attitudes toward rating of the app and those representing consumer attitudes toward app-store promotion efforts (i.e., top developer badge and editor's choice badge). Results of this study can be explained in hedonic and utilitarian framework. Consumers who use hedonic applications, such as those of game and entertainment-related, are of young age with low education level. However, consumers who are old and have received higher education level prefer utilitarian application category such as life, information etc. There are disputable arguments over whether the users of SNS are hedonic or utilitarian. In our results, consumers who are younger and those with higher education level prefer using SNS category applications, which is in a middle of utilitarian and hedonic results. Also, applications that are directly related to tangible assets, such as banking, stock and mobile shopping, are only negatively related to experience of purchasing of paid app, meaning that consumers who put weights on tangible assets do not prefer buying paid application. Regarding categories, most correlations among categories are significantly positive. This is because someone who spend more time on mobile devices tends to use more applications. Game and entertainment category shows significant and positive correlation; however, there exists significantly negative correlation between game and information, as well as game and e-commerce categories of applications. Meanwhile, categories of game and SNS as well as game and finance have shown no significant correlations. This result clearly shows that mobile application usage behavior is quite clearly distinguishable - that the purpose of using mobile devices are polarized into utilitarian and hedonic purpose. This research proves several arguments that can only be explained by second-hand real data, not by survey data, and offers behavioral explanations of mobile application usage in consumers' perspectives. This research also shows substitution/supplementary patterns of consumer application usage, which then explain consumers' mobile application usage behaviors. However, this research has limitations in some points. Classification of categories itself is disputable, for classification is diverged among several studies. Therefore, there is a possibility of change in results depending on the classification. Lastly, although the data are collected in an individual application level, we reduce its observation into an individual level. Further research will be done to resolve these limitations.

Forest Structure in Relation to Slope Aspect and Altitude in valley Forests at Hambaeksan Area (함백산지역 계곡부의 사면방향과 해발고에 따른 산림구조)

  • 박인협;최윤호;이석면;최영철;유석봉
    • Korean Journal of Environment and Ecology
    • /
    • v.15 no.4
    • /
    • pp.361-368
    • /
    • 2002
  • The valley forests located at the east-facing slope and the west facing slope in Hambaeksan area were studied to investigate forest structure in relation to aspect and altitude of the slope. There was little difference in density. mean DBH and basal area of the tree layer between east-facing slope and west-facing slope. The importance percentages of Tilia amurensis and Betula costata in west-facing slope were higher than those in east-facing slope. However, the importance percentages of Quercus mongilica and Fraxinus rhynchophylla in the west facing slope were lower than those in east-facing slope. Species diversity of the west-facing slope was 1.415 and that of the east-facing slope was 1.328. Elevation trends were also found for forest structure. As elevation Increased basal area and mean height of the tree layer decreased in both of east-facing slope and west-facing slope. There was a tendency that number of species, species diversity and evenness decreased with increasing elevation. The importance percentage of Quercus mongolica increased with increasing elevation while those of Betula costata and Maackia amurensis decreased. The result of cluster analysis for the tree and subtree layer indicated that the studied forests were classified into the mixed forest community of broad-leaved tree species at west-facing slope and the low and middle elevation belts of east-facing slope and Quercus mongolica community at the high elevation belt of east-facing slope. Quercus mongolica was significantly and positively correlated with Symplocos chinensis for. pilosa, Acer tschonoskii var. rubripes and deutzia glabrata. Betula costata was significantly and negatively correlated with Quercus mongolica and Acer pseudo-sieboldianum.

Strategies about Optimal Measurement Matrix of Environment Factors Inside Plastic Greenhouse (플라스틱온실 내부 환경 인자 다중센서 설치 위치 최적화 전략)

  • Lee, JungKyu;Kang, DongHyun;Oh, SangHoon;Lee, DongHoon
    • Journal of Bio-Environment Control
    • /
    • v.29 no.2
    • /
    • pp.161-170
    • /
    • 2020
  • There is systematic spatial variations in environmental properties due to sensitive reaction to external conditions at plastic greenhouse occupied 99.2% of domestic agricultural facilities. In order to construct 3 dimensional distribution of temperature, relative humidity, CO2 and illuminance, measurement matrix as 3 by 3 by 5 in direction of width, height and length, respectively, dividing indoor space of greenhouse was designed and tested at experimental site. Linear regression analysis was conducted to evaluate optimal estimation method in terms with horizontal and vertical variations. Even though sole measurement point for temperature and relative humidity could be feasible to assess indoor condition, multiple measurement matrix is inevitably required to improve spatial precision at certain time domain such as period of sunrise and sunset. In case with CO2, multiple measurement matrix could not successfully improve the spatial predictability during a whole experimental period. In case with illuminance, prediction performance was getting smaller after a time period of sunrise due to systematic interference such as indoor structure. Thus, multiple sensing methodology was proposed in direction of length at higher height than growing bed, which could compensate estimation error in spatial domain. Appropriate measurement matrix could be constructed considering the transition of stability in indoor environmental properties due to external variations. As a result, optimal measurement matrix should be carefully designed considering flexibility of construction relevant with the type of property, indoor structure, the purpose of crop and the period of growth. For an instance, partial cooling and heating system to save a consumption of energy supplement could be successfully accomplished by the deployment of multiple measurement matrix.

Comparison of Dose Distributions Calculated by Anisotropic Analytical Algorithm and Pencil Beam Convolution Algorithm at Tumors Located in Liver Dome Site (간원개에 위치한 종양에 대한 Anisotropic Analyticalal Algorithm과 Pencil Beam Convolution 알고리즘에 따른 전달선량 비교)

  • Park, Byung-Do;Jung, Sang-Hoon;Park, Sung-Ho;Kwak, Jeong-Won;Kim, Jong-Hoon;Yoon, Sang-Min;Ahn, Seung-Do
    • Progress in Medical Physics
    • /
    • v.23 no.2
    • /
    • pp.106-113
    • /
    • 2012
  • The purpose of this study is to evaluate the variation of radiation dose distribution for liver tumor located in liver dome and for the interest organs(normal liver, kidney, stomach) with the pencil beam convolution (PBC) algorithm versus anisotropic Analyticalal algorithm (AAA) of the Varian Eclipse treatment planning system, The target volumes from 20 liver cancer patients were used to create treatment plans. Treatment plans for 10 patients were performed in Stereotactic Body Radiation Therapy (SBRT) plan and others were performed in 3 Dimensional Conformal Radiation Therapy (3DCRT) plan. dose calculation was recalculated by AAA algorithm after dose calculation was performed by PBC algorithm for 20 patients. Plans were optimized to 100% of the PTV by the Prescription Isodose in Dose Calculation with the PBC algorithm. Plans were recalculated with the AAA, retaining identical beam arrangements, monitor units, field weighting and collimator condition. In this study, Total PTV was to be statistically significant (SRS: p=0.018, 3DCRT: p=0.006) between PBC and AAA algorithm. and in the case of PTV, ITV in liver dome, plans for 3DCRT were to be statistically significant respectively (p=0.013, p=0.024). normal liver and kidney were to be statistically significant (p=0.009, p=0.037). For the predictive index of dose variation, CVF ratio was to be statistically significant for PTV in the liver dome versus PTV (SRS r=0.684, 3DCRT r=0.732, p<0.01) and CVF ratio for Tumor size was to be statistically significant (SRS r=-0.193, p=0.017, 3DCRT r=0.237, p=0.023).

Evaluation of Setup Uncertainty on the CTV Dose and Setup Margin Using Monte Carlo Simulation (몬테칼로 전산모사를 이용한 셋업오차가 임상표적체적에 전달되는 선량과 셋업마진에 대하여 미치는 영향 평가)

  • Cho, Il-Sung;Kwark, Jung-Won;Cho, Byung-Chul;Kim, Jong-Hoon;Ahn, Seung-Do;Park, Sung-Ho
    • Progress in Medical Physics
    • /
    • v.23 no.2
    • /
    • pp.81-90
    • /
    • 2012
  • The effect of setup uncertainties on CTV dose and the correlation between setup uncertainties and setup margin were evaluated by Monte Carlo based numerical simulation. Patient specific information of IMRT treatment plan for rectal cancer designed on the VARIAN Eclipse planning system was utilized for the Monte Carlo simulation program including the planned dose distribution and tumor volume information of a rectal cancer patient. The simulation program was developed for the purpose of the study on Linux environment using open source packages, GNU C++ and ROOT data analysis framework. All misalignments of patient setup were assumed to follow the central limit theorem. Thus systematic and random errors were generated according to the gaussian statistics with a given standard deviation as simulation input parameter. After the setup error simulations, the change of dose in CTV volume was analyzed with the simulation result. In order to verify the conventional margin recipe, the correlation between setup error and setup margin was compared with the margin formula developed on three dimensional conformal radiation therapy. The simulation was performed total 2,000 times for each simulation input of systematic and random errors independently. The size of standard deviation for generating patient setup errors was changed from 1 mm to 10 mm with 1 mm step. In case for the systematic error the minimum dose on CTV $D_{min}^{stat{\cdot}}$ was decreased from 100.4 to 72.50% and the mean dose $\bar{D}_{syst{\cdot}}$ was decreased from 100.45% to 97.88%. However the standard deviation of dose distribution in CTV volume was increased from 0.02% to 3.33%. The effect of random error gave the same result of a reduction of mean and minimum dose to CTV volume. It was found that the minimum dose on CTV volume $D_{min}^{rand{\cdot}}$ was reduced from 100.45% to 94.80% and the mean dose to CTV $\bar{D}_{rand{\cdot}}$ was decreased from 100.46% to 97.87%. Like systematic error, the standard deviation of CTV dose ${\Delta}D_{rand}$ was increased from 0.01% to 0.63%. After calculating a size of margin for each systematic and random error the "population ratio" was introduced and applied to verify margin recipe. It was found that the conventional margin formula satisfy margin object on IMRT treatment for rectal cancer. It is considered that the developed Monte-carlo based simulation program might be useful to study for patient setup error and dose coverage in CTV volume due to variations of margin size and setup error.

Convolution-Superposition Based IMRT Plan Study for the PTV Containing the Air Region: A Prostate Cancer Case (Convolution-Superposition 알고리즘을 이용한 치료계획시스템에서 공기가 포함된 표적체적에 대한 IMRT 플랜: 전립선 케이스)

  • Kang, Sei-Kwon;Yoon, Jai-Woong;Park, Soah;Hwang, Taejin;Cheong, Kwang-Ho;Han, Taejin;Kim, Haeyoung;Lee, Me-Yeon;Kim, Kyoung Ju;Bae, Hoonsik
    • Progress in Medical Physics
    • /
    • v.24 no.4
    • /
    • pp.271-277
    • /
    • 2013
  • In prostate IMRT planning, the planning target volume (PTV), extended from a clinical target volume (CTV), often contains an overlap air volume from the rectum, which poses a problem inoptimization and prescription. This study was aimed to establish a planning method for such a case. There can be three options in which volume should be considered the target during optimization process; PTV including the air volume of air density ('airOpt'), PTV including the air volume of density value one, mimicking the tissue material ('density1Opt'), and PTV excluding the air volume ('noAirOpt'). Using 10 MV photon beams, seven field IMRT plans for each target were created with the same parameter condition. For these three cases, DVHs for the PTV, bladder and the rectum were compared. Also, the dose coverage for the CTV and the shifted CTV were evaluated in which the shifted CTV was a copied and translated virtual CTV toward the rectum inside the PTV, thus occupying the initial position of the overlap air volume, simulating the worst condition for the dose coverage in the target. Among the three options, only density1Opt plan gave clinically acceptable result in terms of target coverage and maximum dose. The airOpt plan gave exceedingly higher dose and excessive dose coverage for the target volume whereas noAirOpt plan gave underdose for the shifted CTV. Therefore, for prostate IMRT plan, having an air region in the PTV, density modification of the included air to the value of one, is suggested, prior to optimization and prescription for the PTV. This idea can be equally applied to any cases including the head and neck cancer with the PTV having the overlapped air region. Further study is being under process.

The Feasibility Study of MRI-based Radiotherapy Treatment Planning Using Look Up Table (Look Up Table을 이용한 자기공명영상 기반 방사선 치료계획의 타당성 분석 연구)

  • Kim, Shin-Wook;Shin, Hun-Joo;Lee, Young-Kyu;Seo, Jae-Hyuk;Lee, Gi-Woong;Park, Hyeong-Wook;Lee, Jae-Choon;Kim, Ae-Ran;Kim, Ji-Na;Kim, Myong-Ho;Kay, Chul-Seung;Jang, Hong-Seok;Kang, Young-Nam
    • Progress in Medical Physics
    • /
    • v.24 no.4
    • /
    • pp.237-242
    • /
    • 2013
  • In the intracranial regions, an accurate delineation of the target volume has been difficult with only the CT data due to poor soft tissue contrast of CT images. Therefore, the magnetic resonance images (MRI) for the delineation of the target volumes were widely used. To calculate dose distributions with MRI-based RTP, the electron density (ED) mapping concept from the diagnostic CT images and the pseudo CT concept from the MRI were introduced. In this study, the look up table (LUT) from the fifteen patients' diagnostic brain MRI images was created to verify the feasibility of MRI-based RTP. The dose distributions from the MRI-based calculations were compared to the original CT-based calculation. One MRI set has ED information from LUT (lMRI). Another set was generated with voxel values assigned with a homogeneous density of water (wMRI). A simple plan with a single anterior 6MV one portal was applied to the CT, lMRI, and wMRI. Depending on the patient's target geometry for the 3D conformal plan, 6MV photon beams and from two to five gantry portals were used. The differences of the dose distribution and DVH between the lMRI based and CT-based plan were smaller than the wMRI-based plan. The dose difference of wMRI vs. lMRI was measured as 91 cGy vs. 57 cGy at maximum dose, 74 cGt vs. 42 cGy at mean dose, and 94 cGy vs. 53 at minimum dose. The differences of maximum dose, minimum dose, and mean dose of the wMRI-based plan were lower than the lMRI-based plan, because the air cavity was not calculated in the wMRI-based plan. These results prove the feasibility of the lMRI-based planning for brain tumor radiation therapy.