• Title/Summary/Keyword: Pre Processing

Search Result 1,998, Processing Time 0.029 seconds

Latent topics-based product reputation mining (잠재 토픽 기반의 제품 평판 마이닝)

  • Park, Sang-Min;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.39-70
    • /
    • 2017
  • Data-drive analytics techniques have been recently applied to public surveys. Instead of simply gathering survey results or expert opinions to research the preference for a recently launched product, enterprises need a way to collect and analyze various types of online data and then accurately figure out customer preferences. In the main concept of existing data-based survey methods, the sentiment lexicon for a particular domain is first constructed by domain experts who usually judge the positive, neutral, or negative meanings of the frequently used words from the collected text documents. In order to research the preference for a particular product, the existing approach collects (1) review posts, which are related to the product, from several product review web sites; (2) extracts sentences (or phrases) in the collection after the pre-processing step such as stemming and removal of stop words is performed; (3) classifies the polarity (either positive or negative sense) of each sentence (or phrase) based on the sentiment lexicon; and (4) estimates the positive and negative ratios of the product by dividing the total numbers of the positive and negative sentences (or phrases) by the total number of the sentences (or phrases) in the collection. Furthermore, the existing approach automatically finds important sentences (or phrases) including the positive and negative meaning to/against the product. As a motivated example, given a product like Sonata made by Hyundai Motors, customers often want to see the summary note including what positive points are in the 'car design' aspect as well as what negative points are in thesame aspect. They also want to gain more useful information regarding other aspects such as 'car quality', 'car performance', and 'car service.' Such an information will enable customers to make good choice when they attempt to purchase brand-new vehicles. In addition, automobile makers will be able to figure out the preference and positive/negative points for new models on market. In the near future, the weak points of the models will be improved by the sentiment analysis. For this, the existing approach computes the sentiment score of each sentence (or phrase) and then selects top-k sentences (or phrases) with the highest positive and negative scores. However, the existing approach has several shortcomings and is limited to apply to real applications. The main disadvantages of the existing approach is as follows: (1) The main aspects (e.g., car design, quality, performance, and service) to a product (e.g., Hyundai Sonata) are not considered. Through the sentiment analysis without considering aspects, as a result, the summary note including the positive and negative ratios of the product and top-k sentences (or phrases) with the highest sentiment scores in the entire corpus is just reported to customers and car makers. This approach is not enough and main aspects of the target product need to be considered in the sentiment analysis. (2) In general, since the same word has different meanings across different domains, the sentiment lexicon which is proper to each domain needs to be constructed. The efficient way to construct the sentiment lexicon per domain is required because the sentiment lexicon construction is labor intensive and time consuming. To address the above problems, in this article, we propose a novel product reputation mining algorithm that (1) extracts topics hidden in review documents written by customers; (2) mines main aspects based on the extracted topics; (3) measures the positive and negative ratios of the product using the aspects; and (4) presents the digest in which a few important sentences with the positive and negative meanings are listed in each aspect. Unlike the existing approach, using hidden topics makes experts construct the sentimental lexicon easily and quickly. Furthermore, reinforcing topic semantics, we can improve the accuracy of the product reputation mining algorithms more largely than that of the existing approach. In the experiments, we collected large review documents to the domestic vehicles such as K5, SM5, and Avante; measured the positive and negative ratios of the three cars; showed top-k positive and negative summaries per aspect; and conducted statistical analysis. Our experimental results clearly show the effectiveness of the proposed method, compared with the existing method.

A Study on Analysis of consumer perception of YouTube advertising using text mining (텍스트 마이닝을 활용한 Youtube 광고에 대한 소비자 인식 분석)

  • Eum, Seong-Won
    • Management & Information Systems Review
    • /
    • v.39 no.2
    • /
    • pp.181-193
    • /
    • 2020
  • This study is a study that analyzes consumer perception by utilizing text mining, which is a recent issue. we analyzed the consumer's perception of Samsung Galaxy by analyzing consumer reviews of Samsung Galaxy YouTube ads. for analysis, 1,819 consumer reviews of YouTube ads were extracted. through this data pre-processing, keywords for advertisements were classified and extracted into nouns, adjectives, and adverbs. after that, frequency analysis and emotional analysis were performed. Finally, clustering was performed through CONCOR. the summary of this study is as follows. the first most frequently mentioned words were Galaxy Note (n = 217), Good (n = 135), Pen (n = 40), and Function (n = 29). it can be judged through the advertisement that consumers "Galaxy Note", "Good", "Pen", and "Features" have good functional aspects for Samsung mobile phone products and positively recognize the Note Pen. in addition, the recognition of "Samsung Pay", "Innovation", "Design", and "iPhone" shows that Samsung's mobile phone is highly regarded for its innovative design and functional aspects of Samsung Pay. second, it is the result of sentiment analysis on YouTube advertising. As a result of emotional analysis, the ratio of emotional intensity was positive (75.95%) and higher than negative (24.05%). this means that consumers are positively aware of Samsung Galaxy mobile phones. As a result of the emotional keyword analysis, positive keywords were "good", "good", "innovative", "highest", "fast", "pretty", etc., negative keywords were "frightening", "I want to cry", "discomfort", "sorry", "no", etc. were extracted. the implication of this study is that most of the studies by quantitative analysis methods were considered when looking at the consumer perception study of existing advertisements. In this study, we deviated from quantitative research methods for advertising and attempted to analyze consumer perception through qualitative research. this is expected to have a great influence on future research, and I am sure that it will be a starting point for consumer awareness research through qualitative research.

The Effect of Home economic education teaching plans for students in academic and those in vocational high schools' 'Preparation for Successful aging' in the 'Family life in old age' unit -A comparative study between practical problem-teaching lesson plans and instructor-led teaching and learning plans- (인문계와 가사.실업 전문계 고등학생의 '성공적인 노후생활 준비교육'을 위한 가정과 수업의 적용과 효과 -실천적 문제 중심 수업과 강의식 수업을 중심으로-)

  • Lee, Jong-Hui;Cho, Byung-Eun
    • Journal of Korean Home Economics Education Association
    • /
    • v.23 no.4
    • /
    • pp.105-124
    • /
    • 2011
  • To achieve this objective, practical problem-teaching lesson plans and instructor-led teaching and learning plans were developed and integrated into the Technology Home Economics, and Human Development curricula at both academic and vocational high schools. The impact of these plans was examined, as were connections between the teaching methods and types of schools. As part of this study, a survey was conducted on 1,263 students in 46 classes at 6 randomly selected high schools: 4 academic and 2 vocational. A total of 9 teachers conducted classes for both experimental and comparative groups between October 2009 and November 2010. Pre- and post-tests were used to study the impact of the lessons on the experimental and comparative groups. In terms of data analysis and statistics processing, this study implemented mean and standard deviations, t-test, and analysis of covariance using the SPSS 12.0 program. The results of this study are as follows. The practical problem-teaching lessons produced more positive results in the students than the instructor-led lessons, in terms of their image of the elderly, their level of knowledge about them, their understanding of their need for welfare services, and preparation for Successful aging. When comparing the results by type of school, the experimental groups at academic high schools appeared to have a more positive image and better understanding of the elderly and their need for welfare services, and were better prepared for Successful aging than during their previous lessons. They also showed an increase in independence from their children in aging. As for the comparative groups, students at academic high schools showed an increase in preparation for Successful aging compared to the previous lessons. Finally, as for future research on preparation for aging in high schools, more schools should include this subject in their regular curriculum for Technology Home Economics, Human Development and Home Economics in order to generalize the results, and they need to evaluate the content. Additionally, this study suggests that new high school curricula should include lessons on preparation for aging so that students can deal successfully with our aging society.

  • PDF

A Double-Blind Comparison of Paroxetine and Amitriptyline in the Treatment of Depression Accompanied by Alcoholism : Behavioral Side Effects during the First 2 Weeks of Treatment (주정중독에 동반된 우울증의 치료에서 Paroxetine과 Amitriptyline의 이중맹 비교 : 치료초기 2주 동안의 행동학적 부작용)

  • Yoon, Jin-Sang;Yoon, Bo-Hyun;Choi, Tae-Seok;Kim, Yong-Bum;Lee, Hyung-Yung
    • Korean Journal of Biological Psychiatry
    • /
    • v.3 no.2
    • /
    • pp.277-287
    • /
    • 1996
  • Objective : It has been proposed that cognition and related aspects of mental functioning are decreased in depression as well as in alcoholism. The objective of the study was to compare behavioral side effects of paroxetine and amitriptyline in depressed patients accompanied by alcoholism. The focused comparisons were drug effects concerning psychomotor performance, cognitive function, sleep and daytime sleepiness during the first 2 weeks of treatment. Methods : After an alcohol detoxification period(3 weeks) and a washout period(1 week), a total of 20 male inpatients with alcohol use disorder (DSM-IV), who also had a major depressive episode(DSM-IV), were treated double-blind with paroxetine 20mg/day(n=10) or amitriptyline 25mg/day(n=10) for 2 weeks. All patients were required to have a scare of at least 18 respectively on bath the Hamilton Rating Scale far Depression(HAM-D) and Beck Depression Inventory(BDI) at pre-drug baseline. Patients randomized to paroxetine received active medication in the morning and placebo in the evening whereas those randomized to amitriptyline received active medication in the evening and placebo in the morning. All patients performed the various tasks in a test battery at baseline and at days 3, 7 and 14. The test battery included : critical flicker fusion threshold for sensory information processing capacity : choice reaction time for gross psychomotor performance : tracking accuracy and latency of response to peripheral stimulus as a measure of line sensorimotor co-ordination and divided attention : digit symbol substitution as a measure of sustained attention and concentration. To rate perceived sleep and daytime sleepiness, 10cm line Visual analogue scales were employed at baseline and at days 3, 7 and 14. The subjective rating scales were adapted far this study from Leeds sleep Evaluation Questionnaire and Epworth Sleepiness Scale. In addition a comprehensive side effect assessment, using the UKU side effect rating scale, was carried out at baseline and at days 7 and 14. The efficacy of treatment was evaluated using HAM-D, BDI and clinical global impression far severity and improvement at days 7 and 14. Results : The pattern of results indicated thai paroxetine improved performance an mast of the lest variables and also improved sleep with no effect on daytime sleepiness aver the study period. In contrast, amitriptyline produced disruption of performance on same tests and improved sleep with increased daytime sleepiness in particular at day 3. On the UKU side effect rating scale, mare side effects were registered an amitriptyline. The therapeutic efficacy was observed in favor of paroxetine early in day 7. Conclusion : These results demonstrated thai paroxetine in much better than amitriptyline for the treatment of depressed patients accompained by alcoholism at least in terms of behavioral safety and tolerability, furthermore the results may assist in explaining the therapeutic outcome of paroxetine. For example, and earlier onset of antidepressant action of paroxetine may be caused by early improved cognitive function or by contributing to good compliance with treatment.

  • PDF

Development of Market Growth Pattern Map Based on Growth Model and Self-organizing Map Algorithm: Focusing on ICT products (자기조직화 지도를 활용한 성장모형 기반의 시장 성장패턴 지도 구축: ICT제품을 중심으로)

  • Park, Do-Hyung;Chung, Jaekwon;Chung, Yeo Jin;Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.1-23
    • /
    • 2014
  • Market forecasting aims to estimate the sales volume of a product or service that is sold to consumers for a specific selling period. From the perspective of the enterprise, accurate market forecasting assists in determining the timing of new product introduction, product design, and establishing production plans and marketing strategies that enable a more efficient decision-making process. Moreover, accurate market forecasting enables governments to efficiently establish a national budget organization. This study aims to generate a market growth curve for ICT (information and communication technology) goods using past time series data; categorize products showing similar growth patterns; understand markets in the industry; and forecast the future outlook of such products. This study suggests the useful and meaningful process (or methodology) to identify the market growth pattern with quantitative growth model and data mining algorithm. The study employs the following methodology. At the first stage, past time series data are collected based on the target products or services of categorized industry. The data, such as the volume of sales and domestic consumption for a specific product or service, are collected from the relevant government ministry, the National Statistical Office, and other relevant government organizations. For collected data that may not be analyzed due to the lack of past data and the alteration of code names, data pre-processing work should be performed. At the second stage of this process, an optimal model for market forecasting should be selected. This model can be varied on the basis of the characteristics of each categorized industry. As this study is focused on the ICT industry, which has more frequent new technology appearances resulting in changes of the market structure, Logistic model, Gompertz model, and Bass model are selected. A hybrid model that combines different models can also be considered. The hybrid model considered for use in this study analyzes the size of the market potential through the Logistic and Gompertz models, and then the figures are used for the Bass model. The third stage of this process is to evaluate which model most accurately explains the data. In order to do this, the parameter should be estimated on the basis of the collected past time series data to generate the models' predictive value and calculate the root-mean squared error (RMSE). The model that shows the lowest average RMSE value for every product type is considered as the best model. At the fourth stage of this process, based on the estimated parameter value generated by the best model, a market growth pattern map is constructed with self-organizing map algorithm. A self-organizing map is learning with market pattern parameters for all products or services as input data, and the products or services are organized into an $N{\times}N$ map. The number of clusters increase from 2 to M, depending on the characteristics of the nodes on the map. The clusters are divided into zones, and the clusters with the ability to provide the most meaningful explanation are selected. Based on the final selection of clusters, the boundaries between the nodes are selected and, ultimately, the market growth pattern map is completed. The last step is to determine the final characteristics of the clusters as well as the market growth curve. The average of the market growth pattern parameters in the clusters is taken to be a representative figure. Using this figure, a growth curve is drawn for each cluster, and their characteristics are analyzed. Also, taking into consideration the product types in each cluster, their characteristics can be qualitatively generated. We expect that the process and system that this paper suggests can be used as a tool for forecasting demand in the ICT and other industries.

Effect of Therapeutic and Educational strategies using music on improvement of auditory information processing and short-term memory skills for children with underachievement (학습부진아의 청각정보처리와 단기기억력 향상을 위한 음악의 치료적·교육적 접근)

  • Chong, Hyun Ju
    • Journal of Music and Human Behavior
    • /
    • v.1 no.1
    • /
    • pp.1-10
    • /
    • 2004
  • Being engaged in the musical tasks needs cognitive skills to perceive musical sound, organize them into meaningful unit, store them in the memory and retrieve them when needed. These skills are also required for academic tasks indicating that there is positive correlation between skills for musical and academic tasks. Based on these findings, the study purported to examine whether the developed sessions can enhance cognitive skills which is composed of auditory information skills, which is composed of perceiving sounds, organizing them into groups based on the existing information or organization pattern, and short-term memory skills. Eighteen elementary students in 4, 5, and 6th grades have participated in the study. The study has administered Music Cognitive Skills Test(MCST) before and after implementing music therapy sessions. The MCST consisted of five parts, first one measuring the rhythm imitating skills, second, measuring the melodic imitation skills, third, measuring discriminative skills in identifying higher pitch, fourth, measuring discriminative skills in identifying identical chords, and lastly, measuring the tone retention skills. The results indicated that there was statistical difference between the pre and post test in rhythm and melody imitation skills. Because reproduction of perceived rhythm patterns requires memory skills, imitating patterns are considered cognitive skills. Also melody is defined adding spatial dimension to the rhythm which is temporal concept. Being able to understand melodic pattern and to reproduce the pattern also requires cognitive skills. The subjects have shown significant improvement in these two areas. In other areas, there were definite increase of scores, however, no significant differences. The study also explores interpretation of these results and also observed consistencies among the participants in completing the musical tasks.

  • PDF

Airborne Hyperspectral Imagery availability to estimate inland water quality parameter (수질 매개변수 추정에 있어서 항공 초분광영상의 가용성 고찰)

  • Kim, Tae-Woo;Shin, Han-Sup;Suh, Yong-Cheol
    • Korean Journal of Remote Sensing
    • /
    • v.30 no.1
    • /
    • pp.61-73
    • /
    • 2014
  • This study reviewed an application of water quality estimation using an Airborne Hyperspectral Imagery (A-HSI) and tested a part of Han River water quality (especially suspended solid) estimation with available in-situ data. The estimation of water quality was processed two methods. One is using observation data as downwelling radiance to water surface and as scattering and reflectance into water body. Other is linear regression analysis with water quality in-situ measurement and upwelling data as at-sensor radiance (or reflectance). Both methods drive meaningful results of RS estimation. However it has more effects on the auxiliary dataset as water quality in-situ measurement and water body scattering measurement. The test processed a part of Han River located Paldang-dam downstream. We applied linear regression analysis with AISA eagle hyperspectral sensor data and water quality measurement in-situ data. The result of linear regression for a meaningful band combination shows $-24.847+0.013L_{560}$ as 560 nm in radiance (L) with 0.985 R-square. To comparison with Multispectral Imagery (MSI) case, we make simulated Landsat TM by spectral resampling. The regression using MSI shows -55.932 + 33.881 (TM1/TM3) as radiance with 0.968 R-square. Suspended Solid (SS) concentration was about 3.75 mg/l at in-situ data and estimated SS concentration by A-HIS was about 3.65 mg/l, and about 5.85mg/l with MSI with same location. It shows overestimation trends case of estimating using MSI. In order to upgrade value for practical use and to estimate more precisely, it needs that minimizing sun glint effect into whole image, constructing elaborate flight plan considering solar altitude angle, and making good pre-processing and calibration system. We found some limitations and restrictions such as precise atmospheric correction, sample count of water quality measurement, retrieve spectral bands into A-HSI, adequate linear regression model selection, and quantitative calibration/validation method through the literature review and test adopted general methods.

Effects of Jeju Citrus unshiu Peel Extracts Before and After Bioconversion with Cytolase on Anti-Inflammatory Activity in RAW264.7 Cells (면역세포에서 Bioconversion 전후 제주 감귤 과피 추출물의 항염증 효과)

  • Seo, Jieun;Lim, Heejin;Chang, Yun-Hee;Park, Hye-Ryeon;Han, Bok-Kyung;Jeong, Jung-Ky;Choi, Kyoung-Sook;Park, Su-Beom;Choi, Hyuk-Joon;Hwang, Jinah
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.44 no.3
    • /
    • pp.331-337
    • /
    • 2015
  • Citrus and its peels, which are by-products from juice and/or jam processing, have long been used in Asian folk medicine. Citrus peels show an abundant variety of flavanones, and these flavanones have glycone and aglycone forms. Aglycones are more potent than glycones with a variety of physiological functions since aglycone absorption is more efficient than glycones. Bioconversion with cytolase converted narirutin and naringin into naringenin and hesperidin into hesperetin. Therefore, this study aimed to investigate the anti-oxidant and anti-inflammatory effects of bioconversion of Citrus unshiu (CU) peel extracts with cytolase (CU-C) in RAW264.7 cells. HPLC chromatograms showed that CU and CU-C had 23.42% and 29.39% total flavonoids, respectively. There was substantial bioconversion of narirutin to naringenin and of hesperidin to hesperetin. All citrus peel extracts showed DPPH scavenging activities in a dose-dependent manner, and CU-C was more potent than intact CU. RAW264.7 cells were pre-treated with $0{\sim}500{\mu}g/mL$ of citrus peel extracts for 4 h and then stimulated by $1{\mu}g/mL$ of lipopolysaccharide (LPS) for 8 h. All citrus peel extracts showed decreased mRNA levels and protein expression of LPS-induced inducible nitric oxide synthase (iNOS) and cyclooxygenase-2 (COX-2) in a dose-dependent manner. Especially, CU-C markedly inhibited mRNA and protein expression of iNOS and COX-2 compared to intact citrus peel extracts. All citrus peel extracts showed decreased NO production by iNOS activity. This result suggests that bioconversion of citrus peel extracts with cytolase may provide potent functional food materials for prevention of chronic diseases attributable to oxidation and inflammation by boosting the anti-inflammatory effects of citrus peels.

Role of p-38 MAP Kinase in apoptosis of hypoxia-induced osteoblasts (저산소 상태로 인한 조골세포 고사사기전에서 p-38 MAP kinase의 역할에 관한 연구)

  • Yoon, Jeong-Hyeon;Jeong, Ae-Jin;Kang, Kyung-Hwa;Kim, Sang-Cheol
    • The korean journal of orthodontics
    • /
    • v.33 no.3 s.98
    • /
    • pp.169-183
    • /
    • 2003
  • Tooth movement by orthodontic force effects great tissue changes within the periodontium, especially by shifting the blood flow in the pressure side and resulting in a hypoxic state of low oxygen tension. The aim of this study is to elucidate the possible mechanism of apoptosis in response to hypoxia in MC3T3El osteoblasts, the main cells in bone remodeling during orthodontic tooth movement. MC3T3El osteoblasts under hypoxic conditions ($2\%$ orygen) resulted in apoptosis in a time-dependent manner as estimated by DNA fragmentation assay and nuclear morphology stained with fluorescent dye, Hoechst 33258. Pretreatment with Z-VAD-FMK, a pancaspase inhibitor, or Z-DEVD-CHO, a specific caspase-3 inhibitor, completely suppressed the DNA ladder in response to hypoxia. An increase in caspase-3-like protease (DEVDase) activity was observed during apoptosis, but no caspase-1 activity (YVADase) was detected. To confirm what caspases are involved in apoptosis, Western blot analysis was performed using anti-caspase-3 or -6 antibodies. The 10-kDa protein, corresponding to the active products of caspase-3, and the 10-kDa protein of the active protein of caspase-6 were generated in hypoxia-challenged cells in which the processing of the full length form of caspase-3 and -6 was evident. While a time course similar to this caspase-3 and -6 activation was evident, hypoxic stress caused the cleavage of lamin A, which was typical of caspase-6 activity. In addition, the stress elicited the release of cytochrome c into the cytosol during apoptosis. Furthermore, we observed that pre-treatment with SB203580, a selective p38 mitogen activated protein kinase inhibitor, attenuated the hypoxia-induced apoptosis. The addition of SB203S80 suppressed caspase-3 and -6-like protease activity by hypoxia up to $50\%$. In contrast, PD98059 had no effect on the hypoxia-induced apoptosis. To confirm the involvement of MAP kinase, JNK/SAPK, ERK, or p38 kinase assay was performed. Although p38 MAPK was activated in response to hypoxic treatment, the other MAPK -JNK/SAPK or ERK- was either only modestly activated or not at all. These results suggest that p38 MAPK is involved in hypoxia-induced apoptosis in MC3T3El osteoblasts.

Design of a Crowd-Sourced Fingerprint Mapping and Localization System (군중-제공 신호지도 작성 및 위치 추적 시스템의 설계)

  • Choi, Eun-Mi;Kim, In-Cheol
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.9
    • /
    • pp.595-602
    • /
    • 2013
  • WiFi fingerprinting is well known as an effective localization technique used for indoor environments. However, this technique requires a large amount of pre-built fingerprint maps over the entire space. Moreover, due to environmental changes, these maps have to be newly built or updated periodically by experts. As a way to avoid this problem, crowd-sourced fingerprint mapping attracts many interests from researchers. This approach supports many volunteer users to share their WiFi fingerprints collected at a specific environment. Therefore, crowd-sourced fingerprinting can automatically update fingerprint maps up-to-date. In most previous systems, however, individual users were asked to enter their positions manually to build their local fingerprint maps. Moreover, the systems do not have any principled mechanism to keep fingerprint maps clean by detecting and filtering out erroneous fingerprints collected from multiple users. In this paper, we present the design of a crowd-sourced fingerprint mapping and localization(CMAL) system. The proposed system can not only automatically build and/or update WiFi fingerprint maps from fingerprint collections provided by multiple smartphone users, but also simultaneously track their positions using the up-to-date maps. The CMAL system consists of multiple clients to work on individual smartphones to collect fingerprints and a central server to maintain a database of fingerprint maps. Each client contains a particle filter-based WiFi SLAM engine, tracking the smartphone user's position and building each local fingerprint map. The server of our system adopts a Gaussian interpolation-based error filtering algorithm to maintain the integrity of fingerprint maps. Through various experiments, we show the high performance of our system.