• Title/Summary/Keyword: actual error

Search Result 1,381, Processing Time 0.038 seconds

Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

  • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.509-520
    • /
    • 2015
  • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

Analysis of Respiratory Motion Artifacts in PET Imaging Using Respiratory Gated PET Combined with 4D-CT (4D-CT와 결합한 호흡게이트 PET을 이용한 PET영상의 호흡 인공산물 분석)

  • Cho, Byung-Chul;Park, Sung-Ho;Park, Hee-Chul;Bae, Hoon-Sik;Hwang, Hee-Sung;Shin, Hee-Soon
    • The Korean Journal of Nuclear Medicine
    • /
    • v.39 no.3
    • /
    • pp.174-181
    • /
    • 2005
  • Purpose: Reduction of respiratory motion artifacts in PET images was studied using respiratory-gated PET (RGPET) with moving phantom. Especially a method of generating simulated helical CT images from 4D-CT datasets was developed and applied to a respiratory specific RGPET images for more accurate attenuation correction. Materials and Methods: Using a motion phantom with periodicity of 6 seconds and linear motion amplitude of 26 mm, PET/CT (Discovery ST: GEMS) scans with and without respiratory gating were obtained for one syringe and two vials with each volume of 3, 10, and 30 ml respectively. RPM (Real-Time Position Management, Varian) was used for tracking motion during PET/CT scanning. Ten datasets of RGPET and 4D-CT corresponding to every 10% phase intervals were acquired. from the positions, sizes, and uptake values of each subject on the resultant phase specific PET and CT datasets, the correlations between motion artifacts in PET and CT images and the size of motion relative to the size of subject were analyzed. Results: The center positions of three vials in RGPET and 4D-CT agree well with the actual position within the estimated error. However, volumes of subjects in non-gated PET images increase proportional to relative motion size and were overestimated as much as 250% when the motion amplitude was increased two times larger than the size of the subject. On the contrary, the corresponding maximal uptake value was reduced to about 50%. Conclusion: RGPET is demonstrated to remove respiratory motion artifacts in PET imaging, and moreover, more precise image fusion and more accurate attenuation correction is possible by combining with 4D-CT.

A Bibliographical and Literary Research on the Xinxu(新序) of the Published edition in Joseon (조선간본(朝鮮刊本) 『유향신서(劉向新序)』의 서지·문헌 연구)

  • You, Sueng-hyun;Min, Kuan-dong
    • Cross-Cultural Studies
    • /
    • v.51
    • /
    • pp.257-257
    • /
    • 2018
  • Xinxu(新序) was published in Korea by 1492. Among the existing editions, the editions that can confirm the realities are the collections of Keimyung University, the Korean Studies Central Research Institute, Kyonggi University, Hujodang(後彫堂), and the National Assembly Library of Japan. The Keimyung University's precious book is the 'first published book', and the old book is the 'later published book' which covers pages 69-70 and 71-72 of the first published book. It is the 'later published book' that has the same side inscribed. The second books, the Central Research Institute of Korea Studies and the Kyonggi University Collection are the first published books, and the Hujodang and the National Assembly Library of Japan are on pages 9-10, 63-64, 87-88, 107-108. The corresponding side is the 'later published book'. Comparing the editions, it can be concluded that the existing editions of the previous editions have been withdrawn two times, and in the latter editions, the existing editions of four editions can also be confirmed to have been edited three times. In this paper, the literature based on the existing editions was studied and features of the Korean edition were presented. First, we examine the types of paragraphs. In principle, the text is composed of '11 lines and 18 characters', but on the actual version, the number of characters is shown in the table. In the Korean edition of the Joseon dynasty, a blank space appears in the original text. The erroneous letter in the Joseon book was identified the reason for the error was explained in detail.

Establishment and application of standard-RSF for trace inorganic matter mass analysis using GD-MS (GD-MS 분석 장비를 활용한 극미량 무기물 질량 분석을 위한 표준RSF 구축 및 응용)

  • Jang, MinKyung;Yang, JaeYeol;Lee, JongHyeon;Yoon, JaeSik
    • Analytical Science and Technology
    • /
    • v.31 no.6
    • /
    • pp.240-246
    • /
    • 2018
  • The present study analyzed standard samples of three types of aluminum matrix certified reference materials (CRM) using GD-MS. Calibration curves were constructed for 13 elements (Mg, Si, Ti, V, Cr, Mn, Fe, Ni, Cu, Zn, Ga, Sn, and Pb), with the slope representing the relative sensitivity factor (RSF). The x- and y-axes of the calibration curve represented ion beam ratio (IBR) and the authenticated value of the standard sample, respectively. In order to evaluate precision and linearity of the calibration curve, RSD and the coefficient of determination were calculated. Curve RSD for every element reflected high precision (within 10 %). For most elements, the coefficient of determination was ${\geq}0.99$, indicating excellent linearity. However, vanadium, nickel, and gallium curves exhibited relatively low linearity (0.90~0.95), likely due to their narrow concentration ranges. Standard RSF was calculated using the slope of the curve generated for three types of CRM. Despite vanadium, nickel, and gallium exhibiting low coefficients of determination, their standard RSF resembled that of the three types of CRM. Therefore, the RSF method may be used for element quantitation. Standard iron matrix samples were analyzed to verify the applicability of the aluminum matrix standard RSF, as well as to calculate the RSD-estimated error of the measured value relative to the actual standard value. Six elements (Al, Si, V, Cr, Mn, and Ni) exhibited an RSD of approximately 30 %, while the RSD of Cu was 77 %. In general, Cu isotopes are subject to interference: $^{63}Cu$ to $^{54}Fe^{2+}-^{36}Ar$ and $^{65}Cu$ to $^{56}Fe-Al^{3+}$ interference. Thus, the influence of these impurities may have contributed to the high RSD value observed for Cu. To reliably identify copper, the resolution should be set at ${\geq}8000$. However, high resolutions are inappropriate for analyzing trace elements, as it lowers ion permeability. In conclusion, quantitation of even relatively low amounts of six elements (Al, Si, V, Cr, Mn, and Ni) is possible using this method.

The Study on the Anssolim Technnique of Columns of Main-hall Architectures in Korean Palaces (궁궐 정전건축 기둥 안쏠림기법 고찰)

  • Kim, Derk Moon
    • Korean Journal of Heritage: History & Science
    • /
    • v.43 no.2
    • /
    • pp.40-59
    • /
    • 2010
  • Anssolim is the unique technique which standing columns lean in a inward direction of buildings in traditional architecture, which has not been thoroughly investigated to this day. With a dearth of previous studies, the anssolim technique can only be examined through detailed three-dimensional surveys. The main halls of Korean palaces can be seen as buildings that were built with the regulations of the day in mind, making them excellent research subjects when studying the anssolim technique. The findings can be summarized as follows. 1. In the main halls that were studied, anssolim was applied most to main space (eokan) columns, then lessened for peripheral columns. 2. The largest second-floor cheoma columns were placed inward in the eokan, then became smaller as with the peripheral columns. In the case of the eokan, the columns were arranged according to the size of the anssolim. 3. The second-floor cheoma column anssolim in the middle-floor main hall were generally a third or a quarter of the size of those on the first floor. As on the first floor, the largest anssolim were applied to the eokan columns, then became gradually smaller towards the periphery columns. 4. In the palace main halls, the largest anssolim were used for the eokan columns, and became smaller with the peripheral columns. This unique structure can be seen to be a Korean technique that deviates from the Chinese "Yingzaofashi(營造法式)" techniques. Although this study is limited in that it only studies the main hall of Korean palaces, it is significant in that it shed new light on the technological implications of the anssolim technique, and can be used as important data for research into the history of technology. Although this type of data is difficult to extrapolate, it has been made as accurate as possible by minimizing the margin of error in the data for the palaces that were actually studied.

Rice Yield Estimation Using Sentinel-2 Satellite Imagery, Rainfall and Soil Data (Sentinel-2 위성영상과 강우 및 토양자료를 활용한 벼 수량 추정)

  • KIM, Kyoung-Seop;CHOUNG, Yun-Jae;JUN, Byong-Woon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.1
    • /
    • pp.133-149
    • /
    • 2022
  • Existing domestic studies on estimating rice yield were mainly implemented at the level of cities and counties in the entire nation using MODIS satellite images with low spatial resolution. Unlike previous studies, this study tried to estimate rice yield at the level of eup-myon-dong in Gimje-si, Jeollabuk-do using Sentinel-2 satellite images with medium spatial resolution, rainfall and soil data, and then to evaluate its accuracy. Five vegetation indices such as NDVI, LAI, EVI2, MCARI1 and MCARI2 derived from Sentinel-2 images of August 1, 2018 for Gimje-si, Jeollabuk-do, rainfall and paddy soil-type data were aggregated by the level of eup-myon-dong and then rice yield was estimated with gamma generalized linear model, an expanded variant of multi-variate regression analysis to solve the non-normality problem of dependent variable. In the rice yield model finally developed, EVI2, rainfall days in September, and saline soils ratio were used as significant independent variables. The coefficient of determination representing the model fit was 0.68 and the RMSE for showing the model accuracy was 62.29kg/10a. This model estimated the total rice production in Gimje-si in 2018 to be 96,914.6M/T, which was very close to 94,470.3M/T the actual amount specified in the Statistical Yearbook with an error of 0.46%. Also, the rice production per unit area of Gimje-si was amounted to 552kg/10a, which was almost consistent with 550kg/10a of the statistical data. This result is similar to that of the previous studies and it demonstrated that the rice yield can be estimated using Sentinel-2 satellite images at the level of cities and counties or smaller districts in Korea.

Verification of Multi-point Displacement Response Measurement Algorithm Using Image Processing Technique (영상처리기법을 이용한 다중 변위응답 측정 알고리즘의 검증)

  • Kim, Sung-Wan;Kim, Nam-Sik
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.3A
    • /
    • pp.297-307
    • /
    • 2010
  • Recently, maintenance engineering and technology for civil and building structures have begun to draw big attention and actually the number of structures that need to be evaluate on structural safety due to deterioration and performance degradation of structures are rapidly increasing. When stiffness is decreased because of deterioration of structures and member cracks, dynamic characteristics of structures would be changed. And it is important that the damaged areas and extent of the damage are correctly evaluated by analyzing dynamic characteristics from the actual behavior of a structure. In general, typical measurement instruments used for structure monitoring are dynamic instruments. Existing dynamic instruments are not easy to obtain reliable data when the cable connecting measurement sensors and device is long, and have uneconomical for 1 to 1 connection process between each sensor and instrument. Therefore, a method without attaching sensors to measure vibration at a long range is required. The representative applicable non-contact methods to measure the vibration of structures are laser doppler effect, a method using GPS, and image processing technique. The method using laser doppler effect shows relatively high accuracy but uneconomical while the method using GPS requires expensive equipment, and has its signal's own error and limited speed of sampling rate. But the method using image signal is simple and economical, and is proper to get vibration of inaccessible structures and dynamic characteristics. Image signals of camera instead of sensors had been recently used by many researchers. But the existing method, which records a point of a target attached on a structure and then measures vibration using image processing technique, could have relatively the limited objects of measurement. Therefore, this study conducted shaking table test and field load test to verify the validity of the method that can measure multi-point displacement responses of structures using image processing technique.

Model Evaluation for Predicting the Full Bloom Date of Apples Based on Air Temperature Variations in South Korea's Major Production Regions (기온 변화에 따른 우리나라 사과 주산지 만개일 예측을 위한 모델 평가)

  • Jae Hoon Jeong;Jeom Hwa Han;Jung Gun Cho;Dong Yong Lee;Seul Ki Lee;Si Hyeong Jang;Suhyun Ryu
    • Journal of Bio-Environment Control
    • /
    • v.32 no.4
    • /
    • pp.501-512
    • /
    • 2023
  • This study aimed to assess and determine the optimal model for predicting the full bloom date of 'Fuji' apples across South Korea. We evaluated the performance of four distinct models: the Development Rate Model (DVR)1, DVR2, the Chill Days (CD) model, and a sequentially integrated approach that combined the Dynamic model (DM) and the Growing Degree Hours (GDH) model. The full bloom dates and air temperatures were collected over a three-year period from six orchards located in the major apple production regions of South Korea: Pocheon, Hwaseong, Geochang, Cheongsong, Gunwi, and Chungju. Among these models, the one that combined DM for calculating chilling accumulation and the GDH model for estimating heat accumulation in sequence demonstrated the most accurate predictive performance, in contrast to the CD model that exhibited the lowest predictive precision. Furthermore, the DVR1 model exhibited an underestimation error at orchard located in Hwaseong. It projected a faster progression of the full bloom dates than the actual observations. This area is characterized by minimal diurnal temperature ranges, where the daily minimum temperature is high and the daily maximum temperature is relatively low. Therefore, to achieve a comprehensive prediction of the blooming date of 'Fuji' apples across South Korea, it is recommended to integrate a DM model for calculating the necessary chilling accumulation to break dormancy with a GDH model for estimating the requisite heat accumulation for flowering after dormancy release. This results in a combined DM+GDH model recognized as the most effective approach. However, further data collection and evaluation from different regions are needed to further refine its accuracy and applicability.

DEVELOPMENT OF STATEWIDE TRUCK TRAFFIC FORECASTING METHOD BY USING LIMITED O-D SURVEY DATA (한정된 O-D조사자료를 이용한 주 전체의 트럭교통예측방법 개발)

  • 박만배
    • Proceedings of the KOR-KST Conference
    • /
    • 1995.02a
    • /
    • pp.101-113
    • /
    • 1995
  • The objective of this research is to test the feasibility of developing a statewide truck traffic forecasting methodology for Wisconsin by using Origin-Destination surveys, traffic counts, classification counts, and other data that are routinely collected by the Wisconsin Department of Transportation (WisDOT). Development of a feasible model will permit estimation of future truck traffic for every major link in the network. This will provide the basis for improved estimation of future pavement deterioration. Pavement damage rises exponentially as axle weight increases, and trucks are responsible for most of the traffic-induced damage to pavement. Consequently, forecasts of truck traffic are critical to pavement management systems. The pavement Management Decision Supporting System (PMDSS) prepared by WisDOT in May 1990 combines pavement inventory and performance data with a knowledge base consisting of rules for evaluation, problem identification and rehabilitation recommendation. Without a r.easonable truck traffic forecasting methodology, PMDSS is not able to project pavement performance trends in order to make assessment and recommendations in the future years. However, none of WisDOT's existing forecasting methodologies has been designed specifically for predicting truck movements on a statewide highway network. For this research, the Origin-Destination survey data avaiiable from WisDOT, including two stateline areas, one county, and five cities, are analyzed and the zone-to'||'&'||'not;zone truck trip tables are developed. The resulting Origin-Destination Trip Length Frequency (00 TLF) distributions by trip type are applied to the Gravity Model (GM) for comparison with comparable TLFs from the GM. The gravity model is calibrated to obtain friction factor curves for the three trip types, Internal-Internal (I-I), Internal-External (I-E), and External-External (E-E). ~oth "macro-scale" calibration and "micro-scale" calibration are performed. The comparison of the statewide GM TLF with the 00 TLF for the macro-scale calibration does not provide suitable results because the available 00 survey data do not represent an unbiased sample of statewide truck trips. For the "micro-scale" calibration, "partial" GM trip tables that correspond to the 00 survey trip tables are extracted from the full statewide GM trip table. These "partial" GM trip tables are then merged and a partial GM TLF is created. The GM friction factor curves are adjusted until the partial GM TLF matches the 00 TLF. Three friction factor curves, one for each trip type, resulting from the micro-scale calibration produce a reasonable GM truck trip model. A key methodological issue for GM. calibration involves the use of multiple friction factor curves versus a single friction factor curve for each trip type in order to estimate truck trips with reasonable accuracy. A single friction factor curve for each of the three trip types was found to reproduce the 00 TLFs from the calibration data base. Given the very limited trip generation data available for this research, additional refinement of the gravity model using multiple mction factor curves for each trip type was not warranted. In the traditional urban transportation planning studies, the zonal trip productions and attractions and region-wide OD TLFs are available. However, for this research, the information available for the development .of the GM model is limited to Ground Counts (GC) and a limited set ofOD TLFs. The GM is calibrated using the limited OD data, but the OD data are not adequate to obtain good estimates of truck trip productions and attractions .. Consequently, zonal productions and attractions are estimated using zonal population as a first approximation. Then, Selected Link based (SELINK) analyses are used to adjust the productions and attractions and possibly recalibrate the GM. The SELINK adjustment process involves identifying the origins and destinations of all truck trips that are assigned to a specified "selected link" as the result of a standard traffic assignment. A link adjustment factor is computed as the ratio of the actual volume for the link (ground count) to the total assigned volume. This link adjustment factor is then applied to all of the origin and destination zones of the trips using that "selected link". Selected link based analyses are conducted by using both 16 selected links and 32 selected links. The result of SELINK analysis by u~ing 32 selected links provides the least %RMSE in the screenline volume analysis. In addition, the stability of the GM truck estimating model is preserved by using 32 selected links with three SELINK adjustments, that is, the GM remains calibrated despite substantial changes in the input productions and attractions. The coverage of zones provided by 32 selected links is satisfactory. Increasing the number of repetitions beyond four is not reasonable because the stability of GM model in reproducing the OD TLF reaches its limits. The total volume of truck traffic captured by 32 selected links is 107% of total trip productions. But more importantly, ~ELINK adjustment factors for all of the zones can be computed. Evaluation of the travel demand model resulting from the SELINK adjustments is conducted by using screenline volume analysis, functional class and route specific volume analysis, area specific volume analysis, production and attraction analysis, and Vehicle Miles of Travel (VMT) analysis. Screenline volume analysis by using four screenlines with 28 check points are used for evaluation of the adequacy of the overall model. The total trucks crossing the screenlines are compared to the ground count totals. L V/GC ratios of 0.958 by using 32 selected links and 1.001 by using 16 selected links are obtained. The %RM:SE for the four screenlines is inversely proportional to the average ground count totals by screenline .. The magnitude of %RM:SE for the four screenlines resulting from the fourth and last GM run by using 32 and 16 selected links is 22% and 31 % respectively. These results are similar to the overall %RMSE achieved for the 32 and 16 selected links themselves of 19% and 33% respectively. This implies that the SELINICanalysis results are reasonable for all sections of the state.Functional class and route specific volume analysis is possible by using the available 154 classification count check points. The truck traffic crossing the Interstate highways (ISH) with 37 check points, the US highways (USH) with 50 check points, and the State highways (STH) with 67 check points is compared to the actual ground count totals. The magnitude of the overall link volume to ground count ratio by route does not provide any specific pattern of over or underestimate. However, the %R11SE for the ISH shows the least value while that for the STH shows the largest value. This pattern is consistent with the screenline analysis and the overall relationship between %RMSE and ground count volume groups. Area specific volume analysis provides another broad statewide measure of the performance of the overall model. The truck traffic in the North area with 26 check points, the West area with 36 check points, the East area with 29 check points, and the South area with 64 check points are compared to the actual ground count totals. The four areas show similar results. No specific patterns in the L V/GC ratio by area are found. In addition, the %RMSE is computed for each of the four areas. The %RMSEs for the North, West, East, and South areas are 92%, 49%, 27%, and 35% respectively, whereas, the average ground counts are 481, 1383, 1532, and 3154 respectively. As for the screenline and volume range analyses, the %RMSE is inversely related to average link volume. 'The SELINK adjustments of productions and attractions resulted in a very substantial reduction in the total in-state zonal productions and attractions. The initial in-state zonal trip generation model can now be revised with a new trip production's trip rate (total adjusted productions/total population) and a new trip attraction's trip rate. Revised zonal production and attraction adjustment factors can then be developed that only reflect the impact of the SELINK adjustments that cause mcreases or , decreases from the revised zonal estimate of productions and attractions. Analysis of the revised production adjustment factors is conducted by plotting the factors on the state map. The east area of the state including the counties of Brown, Outagamie, Shawano, Wmnebago, Fond du Lac, Marathon shows comparatively large values of the revised adjustment factors. Overall, both small and large values of the revised adjustment factors are scattered around Wisconsin. This suggests that more independent variables beyond just 226; population are needed for the development of the heavy truck trip generation model. More independent variables including zonal employment data (office employees and manufacturing employees) by industry type, zonal private trucks 226; owned and zonal income data which are not available currently should be considered. A plot of frequency distribution of the in-state zones as a function of the revised production and attraction adjustment factors shows the overall " adjustment resulting from the SELINK analysis process. Overall, the revised SELINK adjustments show that the productions for many zones are reduced by, a factor of 0.5 to 0.8 while the productions for ~ relatively few zones are increased by factors from 1.1 to 4 with most of the factors in the 3.0 range. No obvious explanation for the frequency distribution could be found. The revised SELINK adjustments overall appear to be reasonable. The heavy truck VMT analysis is conducted by comparing the 1990 heavy truck VMT that is forecasted by the GM truck forecasting model, 2.975 billions, with the WisDOT computed data. This gives an estimate that is 18.3% less than the WisDOT computation of 3.642 billions of VMT. The WisDOT estimates are based on the sampling the link volumes for USH, 8TH, and CTH. This implies potential error in sampling the average link volume. The WisDOT estimate of heavy truck VMT cannot be tabulated by the three trip types, I-I, I-E ('||'&'||'pound;-I), and E-E. In contrast, the GM forecasting model shows that the proportion ofE-E VMT out of total VMT is 21.24%. In addition, tabulation of heavy truck VMT by route functional class shows that the proportion of truck traffic traversing the freeways and expressways is 76.5%. Only 14.1% of total freeway truck traffic is I-I trips, while 80% of total collector truck traffic is I-I trips. This implies that freeways are traversed mainly by I-E and E-E truck traffic while collectors are used mainly by I-I truck traffic. Other tabulations such as average heavy truck speed by trip type, average travel distance by trip type and the VMT distribution by trip type, route functional class and travel speed are useful information for highway planners to understand the characteristics of statewide heavy truck trip patternS. Heavy truck volumes for the target year 2010 are forecasted by using the GM truck forecasting model. Four scenarios are used. Fo~ better forecasting, ground count- based segment adjustment factors are developed and applied. ISH 90 '||'&'||' 94 and USH 41 are used as example routes. The forecasting results by using the ground count-based segment adjustment factors are satisfactory for long range planning purposes, but additional ground counts would be useful for USH 41. Sensitivity analysis provides estimates of the impacts of the alternative growth rates including information about changes in the trip types using key routes. The network'||'&'||'not;based GMcan easily model scenarios with different rates of growth in rural versus . . urban areas, small versus large cities, and in-state zones versus external stations. cities, and in-state zones versus external stations.

  • PDF

A Comparative Analysis of Social Commerce and Open Market Using User Reviews in Korean Mobile Commerce (사용자 리뷰를 통한 소셜커머스와 오픈마켓의 이용경험 비교분석)

  • Chae, Seung Hoon;Lim, Jay Ick;Kang, Juyoung
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.53-77
    • /
    • 2015
  • Mobile commerce provides a convenient shopping experience in which users can buy products without the constraints of time and space. Mobile commerce has already set off a mega trend in Korea. The market size is estimated at approximately 15 trillion won (KRW) for 2015, thus far. In the Korean market, social commerce and open market are key components. Social commerce has an overwhelming open market in terms of the number of users in the Korean mobile commerce market. From the point of view of the industry, quick market entry, and content curation are considered to be the major success factors, reflecting the rapid growth of social commerce in the market. However, academics' empirical research and analysis to prove the success rate of social commerce is still insufficient. Henceforward, it is to be expected that social commerce and the open market in the Korean mobile commerce will compete intensively. So it is important to conduct an empirical analysis to prove the differences in user experience between social commerce and open market. This paper is an exploratory study that shows a comparative analysis of social commerce and the open market regarding user experience, which is based on the mobile users' reviews. Firstly, this study includes a collection of approximately 10,000 user reviews of social commerce and open market listed Google play. A collection of mobile user reviews were classified into topics, such as perceived usefulness and perceived ease of use through LDA topic modeling. Then, a sentimental analysis and co-occurrence analysis on the topics of perceived usefulness and perceived ease of use was conducted. The study's results demonstrated that social commerce users have a more positive experience in terms of service usefulness and convenience versus open market in the mobile commerce market. Social commerce has provided positive user experiences to mobile users in terms of service areas, like 'delivery,' 'coupon,' and 'discount,' while open market has been faced with user complaints in terms of technical problems and inconveniences like 'login error,' 'view details,' and 'stoppage.' This result has shown that social commerce has a good performance in terms of user service experience, since the aggressive marketing campaign conducted and there have been investments in building logistics infrastructure. However, the open market still has mobile optimization problems, since the open market in mobile commerce still has not resolved user complaints and inconveniences from technical problems. This study presents an exploratory research method used to analyze user experience by utilizing an empirical approach to user reviews. In contrast to previous studies, which conducted surveys to analyze user experience, this study was conducted by using empirical analysis that incorporates user reviews for reflecting users' vivid and actual experiences. Specifically, by using an LDA topic model and TAM this study presents its methodology, which shows an analysis of user reviews that are effective due to the method of dividing user reviews into service areas and technical areas from a new perspective. The methodology of this study has not only proven the differences in user experience between social commerce and open market, but also has provided a deep understanding of user experience in Korean mobile commerce. In addition, the results of this study have important implications on social commerce and open market by proving that user insights can be utilized in establishing competitive and groundbreaking strategies in the market. The limitations and research direction for follow-up studies are as follows. In a follow-up study, it will be required to design a more elaborate technique of the text analysis. This study could not clearly refine the user reviews, even though the ones online have inherent typos and mistakes. This study has proven that the user reviews are an invaluable source to analyze user experience. The methodology of this study can be expected to further expand comparative research of services using user reviews. Even at this moment, users around the world are posting their reviews about service experiences after using the mobile game, commerce, and messenger applications.