• Title/Summary/Keyword: Over Sampling

Search Result 1,272, Processing Time 0.037 seconds

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

Media Habits of Sensation Seekers (감지추구자적매체습관(感知追求者的媒体习惯))

  • Blakeney, Alisha;Findley, Casey;Self, Donald R.;Ingram, Rhea;Garrett, Tony
    • Journal of Global Scholars of Marketing Science
    • /
    • v.20 no.2
    • /
    • pp.179-187
    • /
    • 2010
  • Understanding consumers' preferences and use of media types is imperative for marketing and advertising managers, especially in today's fragmented market. A clear understanding assists managers in making more effective selections of appropriate media outlets, yet individuals' choices of type and use of media are based on a variety of characteristics. This paper examines one personality trait, sensation seeking, which has not appeared in the literature examining "new" media preferences and use. Sensation seeking is a personality trait defined as "the need for varied, novel, and complex sensations and experiences and the willingness to take physical and social risks for the sake of such experiences" (Zuckerman 1979). Six hypotheses were developed from a review of the literature. Particular attention was given to the Uses and Gratification theory (Katz 1959), which explains various reasons why people choose media types and their motivations for using the different types of media. Current theory suggests that High Sensation Seekers (HSS), due to their needs for novelty, arousal and unconventional content and imagery, would exhibit higher frequency of use of new media. Specifically, we hypothesize that HSS will use the internet more than broadcast (H1a) or print media (H1b) and more than low (LSS) (H2a) or medium sensation seekers (MSS) (H2b). In addition, HSS have been found to be more social and have higher numbers of friends therefore are expected to use social networking websites such as Facebook/MySpace (H3) and chat rooms (H4) more than LSS (a) and MSS (b). Sensation seekers can manifest into a range of behaviors including disinhibition,. It is expected that alternative social networks such as Facebook/MySpace (H5) and chat rooms (H6) will be used more often for those who have higher levels of disinhibition than low (a) or medium (b) levels. Data were collected using an online survey of participants in extreme sports. In order to reach this group, an improved version of a snowball sampling technique, chain-referral method, was used to select respondents for this study. This method was chosen as it is regarded as being effective to reach otherwise hidden population groups (Heckathorn, 1997). A final usable sample of 1108 respondents, which was mainly young (56.36% under 34), male (86.1%) and middle class (58.7% with household incomes over USD 50,000) was consistent with previous studies on sensation seeking. Sensation seeking was captured using an existing measure, the Brief Sensation Seeking Scale (Hoyle et al., 2002). Media usage was captured by measuring the self reported usage of various media types. Results did not support H1a and b. HSS did not show higher levels of usage of alternative media such as the internet showing in fact lower mean levels of usage than all the other types of media. The highest media type used by HSS was print media, suggesting that there is a revolt against the mainstream. Results support H2a and b that HSS are more frequent users of the internet than LSS or MSS. Further analysis revealed that there are significant differences in the use of print media between HSS and LSS, suggesting that HSS may seek out more specialized print publications in their respective extreme sport activity. Hypothesis 3a and b showed that HSS use Facebook/MySpace more frequently than either LSS or MSS. There were no significant differences in the use of chat rooms between LSS and HSS, so as a consequence no support for H4a, although significant for MSS H4b. Respondents with varying levels of disinhibition were expected to have different levels of use of Facebook/MySpace and chat-rooms. There was support for the higher levels of use of Facebook/MySpace for those with high levels of disinhibition than low or medium levels, supporting H5a and b. Similarly there was support for H6b, Those with high levels of disinhibition use chat-rooms significantly more than those with medium levels but not for low levels (H6a). The findings are counterintuitive and give some interesting insights for managers. First, although HSS use online media more frequently than LSS or MSS, this groups use of online media is less than either print or broadcast media. The advertising executive should not place too much emphasis on online media for this important market segment. Second, social media, such as facebook/Myspace and chatrooms should be examined by managers as potential ways to reach this group. Finally, there is some implication for public policy by the higher levels of use of social media by those who are disinhibited. These individuals are more inclined to engage in more socially risky behavior which may have some dire implications, e.g. by internet predators or future employers. There is a limitation in the study in that only those who engage in extreme sports are included. This is by nature a HSS activity. A broader population is therefore needed to test if these results hold.

The Effect of Structured Information on the Sleep Amount of Patients Undergoing Open Heart Surgery (계획된 간호 정보가 수면량에 미치는 영향에 관한 연구 -개심술 환자를 중심으로-)

  • 이소우
    • Journal of Korean Academy of Nursing
    • /
    • v.12 no.2
    • /
    • pp.1-26
    • /
    • 1982
  • The main purpose of this study was to test the effect of the structured information on the sleep amount of the patients undergoing open heart surgery. This study has specifically addressed to the Following two basic research questions: (1) Would the structed in formation influence in the reduction of sleep disturbance related to anxiety and Physical stress before and after the operation? and (2) that would be the effects of the structured information on the level of preoperative state anxiety, the hormonal change, and the degree of behavioral change in the patients undergoing an open heart surgery? A Quasi-experimental research was designed to answer these questions with one experimental group and one control group. Subjects in both groups were matched as closely as possible to avoid the effect of the differences inherent to the group characteristics, Baseline data were also. collected on both groups for 7 days prior to the experiment and found that subjects in both groups had comparable sleep patterns, trait anxiety, hormonal levels and behavioral level. A structured information as an experimental input was given to the subjects in the experimental group only. Data were collected and compared between the experimental group and the control group on the sleep amount of the consecutive pre and post operative days, on preoperative state anxiety level, and on hormonal and behavioral changes. To test the effectiveness of the structured information, two main hypotheses and three sub-hypotheses were formulated as follows; Main hypothesis 1: Experimental group which received structured information will have more sleep amount than control group without structured information in the night before the open heart surgery. Main hypothesis 2: Experimental group with structured information will have more sleep, amount than control group without structured information during the week following the open heart surgery Sub-hypothesis 1: Experimental group with structured information will be lower in the level of State anxiety than control group without structured information in the night before the open heart surgery. Sub-hypothesis 2 : Experimental group with structured information will have lower hormonal level than control group without stuctured information on the 5th day after the open heart surgery Sub-hypothesis 3: Experimental group with structured information will be lower in the behavioral change level than control group without structured information during the week after the open heart surgery. The research was conducted in a national university hospital in Seoul, Korea. The 53 Subjects who participated in the study were systematically divided into experimental group and control group which was decided by random sampling method. Among 53 subjects, 26 were placed in the experimental group and 27 in the control group. Instruments; (1) Structed information: Structured information as an independent variable was constructed by the researcher on the basis of Roy's adaptation model consisting of physiologic needs, self-concept, role function and interdependence needs as related to the sleep and of operational procedures. (2) Sleep amount measure: Sleep amount as main dependent variable was measured by trained nurses through observation on the basis of the established criteria, such as closed or open eyes, regular or irregular respiration, body movement, posture, responses to the light and question, facial expressions and self report after sleep. (3) State anxiety measure: State Anxiety as a sub-dependent variable was measured by Spi-elberger's STAI Anxiety scale, (4) Hormornal change measure: Hormone as a sub-dependent variable was measured by the cortisol level in plasma. (5) Behavior change measure: Behavior as a sub-dependent variable was measured by the Behavior and Mood Rating Scale by Wyatt. The data were collected over a period of four months, from June to October 1981, after the pretest period of two months. For the analysis of the data and test for the hypotheses, the t-test with mean differences and analysis of covariance was used. The result of the test for instruments show as follows: (1) STAI measurement for trait and state anxiety as analyzed by Cronbachs alpha coefficient analysis for item analysis and reliability showed the reliability level at r= .90 r= .91 respectively. (2) Behavior and Mood Rating Scale measurement was analyzed by means of Principal Component Analysis technique. Seven factors retained were anger, anxiety, hyperactivity, depression, bizarre behavior, suspicious behavior and emotional withdrawal. Cumulative percentage of each factor was 71.3%. The result of the test for hypotheses show as follows; (1) Main hypothesis, was not supported. The experimental group has 282 minutes of sleep as compared to the 255 minutes of sleep by the control group. Thus the sleep amount was higher in experimental group than in control group, however, the difference was not statistically significant at .05 level. (2) Main hypothesis 2 was not supported. The mean sleep amount of the experimental group and control group were 297 minutes and 278 minutes respectively Therefore, the experimental group had more sleep amount as compared to the control group, however, the difference was not statistically significant at .05 level. Thus, the main hypothesis 2 was not supported. (3) Sub-hypothesis 1 was not supported. The mean state anxiety of the experimental group and control group were 42.3, 43.9 in scores. Thus, the experimental group had slightly lower state anxiety level than control group, howe-ver, the difference was not statistically significant at .05 level. (4) Sub-hypothesis 2 was not supported. . The mean hormonal level of the experimental group and control group were 338 ㎍ and 440 ㎍ respectively. Thus, the experimental group showed decreased hormonal level than the control group, however, the difference was not statistically significant at .05 level. (5) Sub-hypothesis 3 was supported. The mean behavioral level of the experimental group and control group were 29.60 and 32.00 respectively in score. Thus, the experimental group showed lower behavioral change level than the control group. The difference was statistically significant at .05 level. In summary, the structured information did not influence the sleep amount, state anxiety or hormonal level of the subjects undergoing an open heart surgery at a statistically significant level, however, it showed a definite trends in their relationships, not least to mention its significant effect shown on behavioral change level. It can further be speculated that a great degree of individual differences in the variables such as sleep amount, state anxiety and fluctuation in hormonal level may partly be responsible for the statistical insensitivity to the experimentation.

  • PDF

Community Ecological Study on the Quercus acuta Forests in Bogildo-Island (보길도(甫吉島) 붉가시나무림(林)의 군락생태학적(群落生態學的) 연구(硏究))

  • Kim, Chong-Young;Lee, Jeong-Seok;Oh, Kwang-In;Jang, Seok-Ki;Park, Jin-Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.89 no.5
    • /
    • pp.618-629
    • /
    • 2000
  • This study was carried out to investigate ecological niche of Quercus acuta communities in Bogildo-island from July to October, 1998. This island is occupied by a subtropical evergreen broad-leaved forests. The study on community ecology of Q. acuta, mostly dominant species of subtropical forests, is very important for successful forest management. Sampling areas were selected in 16 quadrats, dominated by Q. acuta to examine the vegetation characteristics(plant identification, D.B.H.) and environmental elements (microtopography, altitude, slope degree, aspect, illumination and soil physicochemical properties). On the basis of data from field surveys, importance values were calculated for the dominance of Q. acuta and volume growth was analyzed by tree ring widths. The results obtained were as follows ; 1. The lists of vascular plants in the investigations were identified as 54 families, 91 genera, 113 species, 9 varieties, 1 formae. It appeared that 45 kinds were evergreen, 6 kinds(Camellia japonica, Ligustrum japonicum, Eurya japonica, Smilax china, Trachelospermum asiaticum var. intermedium, Carex lanceolata) were commonly observed in all plots and 5 species(Cinnamomum japonicum, Ardisia japonica, Cymbidium goeringii, Dryopteris bissetiana, Viburnum erosum) were most highly observed in all plots(over 80%). 2. The dominating species per strata were, Quercus acuta, Castanopsis cuspidata sp. Quercus salicina, Pinus thunbergii, Prunus sargentii in tree layer, Camellia Japonica, Ligustrum japonicum, Quercus acuta, Eurya japonica, Castanopsis cuspidata sp. in subtree layer, Camellia japonica, Ligustrum japonicum, Smilax china, Cinnamomum japonicum, Viburnum erosum in shrub layer and Trachelospermum asiaticum var. intermedium, Ardisia japonica, Carex lanceolata, Camellia japonica(seedlings), Quercus acuta(seedlings) in herb layer, all in descending orders. 3. Quercus acuta could be suggested as shade intolerant tree, considering the distribution in southern, western, nothern and eastern slopes in the descending orders. 4. Mean relative illumination in the forest is 0.89 % and it is relatively low in brightness. 5. Sustainment of Quercus acuta community couldn't be confirmed by judging from their reverse J curve in even-aged forest, as shown in D.B.H. distribution analysis. 6. The result of annual ring width analysis(mean ; 2.44 mm) showed three stages, such as a gentle increasing(1~12 year ; 2.04 mm), a relatively steep increasing(13~22 year ; 2.95 mm) and decreasing or stagnating(23 year after ; 2.41 mm).

  • PDF

The Determination of Trust in Franchisor-Franchisee Relationships in China (중국 프랜차이즈 시스템에서의 본부와 가맹점간 신뢰의 영향요인)

  • Shin, Geon-Cheol;Ma, Yaokun
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.2
    • /
    • pp.65-88
    • /
    • 2008
  • Since the implementation of economic reforms in 1978, the Chinese economy grows rapidly at an average annul growth rate of 9% over the post two decades. Franchising has been widely recognized as an important source of entrepreneurial activity. Trust is important in that it facilitates relational exchanges by permits partners to transcend short-run inequities or risks to concentrate on long-term profits or gains. In the relationship between the franchisors and franchisees, trust has been described as an important source of competitive advantage. However, little research has been done on the factors affecting trust in Chinese franchisor-franchisee relationships. The purpose of this study is to investigate what factors affect the trust in the franchise system in China, and to provide guidelines and insights to franchisors which enter Chinese market. In this study, according to Morgan and Hunt (1994), trust is defined as the extending when one party has confidence in an exchange partner's reliability and integrity. We offered a conceptual model of the empirical study. The model shows that the factors affecting the trust include franchisor's supports, communication, satisfaction with previous outcome and conflict. We also suggested the franchisor's supports and communication like to enhance the franchisee's satisfaction with previous outcome, and the franchisor's supports, communication and he franchisee's satisfaction with previous outcome tend to decrease conflict. Before the formal study, a pretest involving exploratory interviews with owners from three franchisees was conducted to make sure the questionnaire was relevant and clear to the respondents. The data were collected using trained interviewers to carry out personal interviews with the aid of an unidentified, muti-page, structured questionnaire. The respondents comprised of owners, managers, and owner managers of franchisee-owned food service franchises located in Beijing, China. Even though a total of 256 potential franchises were initially contacted, the finally usable sample consisted of 125 respondents. As expected, the sampling method was successful in soliciting respondents with waried personal and firm characteristics. Self-administrated questionnaires were used for all measures. And established scales were used to measure the latent constructs in this study. The measures tapped the franchisees' perceptions of the relationship with the referent franchisor. Five-point Likert-type scales ranging from "strongly disagree" (=1) to "strongly agree" (=7) were used throughout the constructs (trust, eight items; support, five items; communication, four items; satisfaction, six items; conflict, three items). The reliability measurements traditionally employed, such as the Cronbach's alpha, were used. All the reliabilities were greater than.80. The proposed measurement model was estimated using SPSS 12.0 and AMOS 5.0 analysis package. We conducted A series of exploratory factor analyses and confirmatory factor analyses to assess the convergent validity, discriminant validity, and reliability. The results indicate reasonable overall fits between the model and the observed data. The overall fit of measurement model were $X^2$= 159.699, p=0.004, d.f. = 116, GFI =.879, NFI =.898, CFI =.969, IFI =.970, TLI =.959, RMR =.058. The results demonstrated that the data reasonably fitted the model. We also examined construct reliability and reliability and average variance extracted (AVE). The construct reliability of each construct was greater than.80 and the AVE of each construct was greater than.50. According to the analysis of Structure Equation Modeling (SEM), the results of path model indicated an adequate fit of the model: $X^2$= 142.126, p = 0.044, d.f. = 115, GFI =.892, NFI =.909, CFI =.981, IFI =.981, TLI =.974, RMR =.057. As hypothesized, the results showed that it is strategically important to establish trust in a franchise system, and the franchisor's supports, communication and satisfaction with previous outcome tend to reinforce franchisee's trust. The results also showed trust seems to decrease as the experience of conflict episodes increases. And we also noticed that franchisor's supports and communication tend to enhance the franchisee's satisfaction with previous outcome, and communication tend to decrease conflict. If the trust between the franchisor and franchisee can be established in a franchise system, franchising offers many benefits and reduces many costs. To manage a mutual trust of relationship with their franchisees, franchisor's should provide support effectively to their franchisees. Effective assistant services have direct effect on franchisees' satisfaction with previous outcome and trust in franchisor. Especially, franchise sales process, orientation, and training in the start-up period are key elements for success of the franchise system. Franchisor's support is an accumulated separate satisfaction evaluation with different kind of service provided by the franchisor. And providing support definitely can improve the trustworthy image of the franchisor. In the franchise system, conflicts of interests and exertions of different power sources are very common. The experience of conflict episodes seems to negatively relate to trust. Therefore, it is important to reduce the negative side of the relationship conflicts. Communication actually plays a broader role in reducing conflict and establish mutual trust in franchisor-franchisee relationship. And effective communication between franchisors and franchisees can improve franchisees' satisfaction toward the franchise system. As the diversification of Chinese markets, both franchisors and franchisees must keep the relevant, timely, and reliable communication. And it is very important to improve the quality of communication. Satisfaction with precious outcomes seems to positively relate to trust. Franchisors and franchisees that are highly satisfied with the previous outcomes that flow from their relationship will perceive their partner as advancing their goal achievement. Therefore, it is necessary for both franchisor and their franchisees to make the welfare of partner with effort. Little literature has focused on what factors affect the trust between franchisors and their franchisees in China. This study developed the hypotheses regarding the factors affecting trust in the transaction relationship. The results of data analysis supported the hypotheses strongly. There are certain limitations in this study. First, we may point out that some other factors missed in this study could be significantly important. Second, the context of this study, food service industry, limits its potential generalizability for all franchise systems. More studies in different categories of franchise system are needed to broaden its generalizability. Third, the model was tested empirically in a sample in Beijing, more empirical tests of the proposed model in other Chinese areas are needed. Finally, the analysis in this study was solely based on the perception of franchisees and the opinions of franchisors were not included.

  • PDF

Postoperative Radiotherapy in the Rectal Cancers Patterns of Care Study for the Years of $1998\~1999$ (직장암의 방사선치료에 대한 Patterns of Care Study: $1998{\sim}1999$년도 수술 후 방사선치료 환자들의 특성 및 치료내용에 대한 분석결과)

  • Kim, Jong-Hoon;Oh, Do-Hoon;Kang, Ki-Moon;Kim, Woo-Cheol;Kim, Won-Dong;Kim, Jung, Soo;Kim, June-Sang;Kim, Jin-Hee;Kil, Hak-Jae;Suh, Chang-Ok;Sohn, Seung-Chang;Ahn, Yong-Chan;Yang, Dae-Sik
    • Radiation Oncology Journal
    • /
    • v.23 no.1
    • /
    • pp.22-31
    • /
    • 2005
  • Purpose : To conduct a nationwide survey on the principals in radiotherapy for rectal cancer, and produce a database of Korean Patterns of Care Study. Materials and Methods : We developed web-based Patterns of Care Study system and a national survey was conducted using random sampling based on power allocation methods. Eligible patients were who had postoperative radiotherapy for rectal cancer without gross residual tumor after surgical resection and without previous history of other cancer and radiotherapy to pelvis. Data of patients were Inputted to the web based PCS system by each investigators in 19 institutions. Results : Informations on 309 patients with rectal cancer who received radiotherapy between 1998 and 1999 were collected. Male to female ratio was 59 : 41, and the most common location of tumor was lower rectum ($46\%$). Preoperative CEA was checked in $79\%$ of cases and its value was higher than 6 ng/ml in $32\%$. Pathologic stage were I in $1.5\%$, II in $32\%$, III in $53\%$, and IV in $1.6\%$. Low anterior resection was the most common type of surgery and complete resection was peformed in $95\%$ of cases. Distal resection margin was less than 2 cm in $30\%$, and number of lymph node dissected was less than 12 in $31\%$. Chemotherapy was peformed in $91\%$ and most common regimen was 5-FU and leucovorine ($59\%$). The most common type of field arrangement used for the initial pelvic field was the four field box (Posterior-Right-Left) technique ($65.0\%$), and there was no AP-PA parallel opposing field used. Patient position was prone in $81.2\%$, and the boost field was used in $61.8\%$. To displace bowel outward, pressure modulating devices or bladder filling was used in $40.1\%$. Radiation dose was prescribed to isocenter in $45.3\%$ and to isodose line in 123 cases ($39.8\%$). Percent delivered dose over $90\%$ was achieved in $92.9\%$. Conclusion : We could find the Patterns of Care for the radiotherapy in Korean rectal cancer patients was similar to that of US national survey. The type of surgery and the regimen of chemotherapy were variable according to institutions and the variations of radiation dose and field arrangement were within acceptable range.

A Study on Relationship between Degree of Stress and Dyspepsia, Sleeping, Satisfaction of Adult Women in Rural Area (성인 여성들의 스트레스와 소화불량 및 수면장애와의 관련성)

  • Kim, Yeong-Hee;Cho, Soo-Yeul;Kang, Pock-Soo;Lee, Kyeong-Soo;Kim, Seok-Beom;Kim, Sang-Kyu;Kang, Young-Ah;Hwang, Young-Lork
    • Journal of agricultural medicine and community health
    • /
    • v.25 no.1
    • /
    • pp.51-63
    • /
    • 2000
  • Ten Dongs were selected according to the systematic cluster sampling in Koryong Gun, and the survey was conducted on 571 women in the age between 30-69 years. The first survey was performed for 6 days between August 27 to September 1, 1999 with the investigation rate of 60.3%, and the second survey was performed in November with the investigation rate of 91.8%. The contents of survey included demographic characteristics, health behaviors, dyspepsia symptom score, sleeping induction time and the degree of sleep satisfaction, and degree of stress in the subjects. The dyspepsia symptom score was in the average 13.4 points out of a total 44 points and was the highest in the 50-59 year-old age group with 13.9 points. The sleep induction time was in the average of 35 minutes and was the highest in the 50-59 year-old age group with 40.9 minutes; the degree of sleep satisfaction was in the average of 7.9 points and was the lowest in the 50-59 year-old age group with 7.5 points. The stress score was in the average of 18.3 points and was highest in those subjects in their 40's and 50's with 18.7 points. When the correlation among the stress score, the degree of sleep satisfaction and dyspepsia symptom score was analyzed, the results showed that he stress score and the degree of sleep satisfaction showed a significant negative correlation and that the stress score and dyspepsia symptom score showed a significant positive correlation. Also, a significant negative correlation was found between the degree of sleep satisfaction and dyspepsia symptom score. According to each age group, a significant correlation was revealed among the stress score, dyspepsia symptom score and the degree of sleep satisfaction in those subjects over 40 years of age compared to those subjects who were younger than 40 years of age. As for educational level, the correlation among the stress score, the degree of sleep satisfaction and dyspepsia symptom score was higher in those subjects with less than middle school education compared to those subjects with more than high school education. When those factors that effects on the dyspepsia symptom score were analyzed with multiple regression, the results showed that the level of stress and chronic diseases were selected as significant variables. When those factors that affected on the degree of sleep satisfaction were analyzed, the sleep induction time and presence of chronic diseases and stress were selected as significant variables. Those women in their 50's who live in rural areas showed the highest level of stress, lowest the degree of sleep satisfaction, and highest level of dyspepsia, indicating that they need stress management. Also, since stress was showed to be a significant variable effecting on dyspepsia or the degree of sleep satisfaction, it is concluded that health promotion is possible through stress management. More studies are needed in the future on coping resources that would strengthen coping against stress, and by conducting studies on stress and related factors on community people, the measures of mental health promotion need to be developed.

  • PDF

An Analytical Approach Using Topic Mining for Improving the Service Quality of Hotels (호텔 산업의 서비스 품질 향상을 위한 토픽 마이닝 기반 분석 방법)

  • Moon, Hyun Sil;Sung, David;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.21-41
    • /
    • 2019
  • Thanks to the rapid development of information technologies, the data available on Internet have grown rapidly. In this era of big data, many studies have attempted to offer insights and express the effects of data analysis. In the tourism and hospitality industry, many firms and studies in the era of big data have paid attention to online reviews on social media because of their large influence over customers. As tourism is an information-intensive industry, the effect of these information networks on social media platforms is more remarkable compared to any other types of media. However, there are some limitations to the improvements in service quality that can be made based on opinions on social media platforms. Users on social media platforms represent their opinions as text, images, and so on. Raw data sets from these reviews are unstructured. Moreover, these data sets are too big to extract new information and hidden knowledge by human competences. To use them for business intelligence and analytics applications, proper big data techniques like Natural Language Processing and data mining techniques are needed. This study suggests an analytical approach to directly yield insights from these reviews to improve the service quality of hotels. Our proposed approach consists of topic mining to extract topics contained in the reviews and the decision tree modeling to explain the relationship between topics and ratings. Topic mining refers to a method for finding a group of words from a collection of documents that represents a document. Among several topic mining methods, we adopted the Latent Dirichlet Allocation algorithm, which is considered as the most universal algorithm. However, LDA is not enough to find insights that can improve service quality because it cannot find the relationship between topics and ratings. To overcome this limitation, we also use the Classification and Regression Tree method, which is a kind of decision tree technique. Through the CART method, we can find what topics are related to positive or negative ratings of a hotel and visualize the results. Therefore, this study aims to investigate the representation of an analytical approach for the improvement of hotel service quality from unstructured review data sets. Through experiments for four hotels in Hong Kong, we can find the strengths and weaknesses of services for each hotel and suggest improvements to aid in customer satisfaction. Especially from positive reviews, we find what these hotels should maintain for service quality. For example, compared with the other hotels, a hotel has a good location and room condition which are extracted from positive reviews for it. In contrast, we also find what they should modify in their services from negative reviews. For example, a hotel should improve room condition related to soundproof. These results mean that our approach is useful in finding some insights for the service quality of hotels. That is, from the enormous size of review data, our approach can provide practical suggestions for hotel managers to improve their service quality. In the past, studies for improving service quality relied on surveys or interviews of customers. However, these methods are often costly and time consuming and the results may be biased by biased sampling or untrustworthy answers. The proposed approach directly obtains honest feedback from customers' online reviews and draws some insights through a type of big data analysis. So it will be a more useful tool to overcome the limitations of surveys or interviews. Moreover, our approach easily obtains the service quality information of other hotels or services in the tourism industry because it needs only open online reviews and ratings as input data. Furthermore, the performance of our approach will be better if other structured and unstructured data sources are added.

DEVELOPMENT OF STATEWIDE TRUCK TRAFFIC FORECASTING METHOD BY USING LIMITED O-D SURVEY DATA (한정된 O-D조사자료를 이용한 주 전체의 트럭교통예측방법 개발)

  • 박만배
    • Proceedings of the KOR-KST Conference
    • /
    • 1995.02a
    • /
    • pp.101-113
    • /
    • 1995
  • The objective of this research is to test the feasibility of developing a statewide truck traffic forecasting methodology for Wisconsin by using Origin-Destination surveys, traffic counts, classification counts, and other data that are routinely collected by the Wisconsin Department of Transportation (WisDOT). Development of a feasible model will permit estimation of future truck traffic for every major link in the network. This will provide the basis for improved estimation of future pavement deterioration. Pavement damage rises exponentially as axle weight increases, and trucks are responsible for most of the traffic-induced damage to pavement. Consequently, forecasts of truck traffic are critical to pavement management systems. The pavement Management Decision Supporting System (PMDSS) prepared by WisDOT in May 1990 combines pavement inventory and performance data with a knowledge base consisting of rules for evaluation, problem identification and rehabilitation recommendation. Without a r.easonable truck traffic forecasting methodology, PMDSS is not able to project pavement performance trends in order to make assessment and recommendations in the future years. However, none of WisDOT's existing forecasting methodologies has been designed specifically for predicting truck movements on a statewide highway network. For this research, the Origin-Destination survey data avaiiable from WisDOT, including two stateline areas, one county, and five cities, are analyzed and the zone-to'||'&'||'not;zone truck trip tables are developed. The resulting Origin-Destination Trip Length Frequency (00 TLF) distributions by trip type are applied to the Gravity Model (GM) for comparison with comparable TLFs from the GM. The gravity model is calibrated to obtain friction factor curves for the three trip types, Internal-Internal (I-I), Internal-External (I-E), and External-External (E-E). ~oth "macro-scale" calibration and "micro-scale" calibration are performed. The comparison of the statewide GM TLF with the 00 TLF for the macro-scale calibration does not provide suitable results because the available 00 survey data do not represent an unbiased sample of statewide truck trips. For the "micro-scale" calibration, "partial" GM trip tables that correspond to the 00 survey trip tables are extracted from the full statewide GM trip table. These "partial" GM trip tables are then merged and a partial GM TLF is created. The GM friction factor curves are adjusted until the partial GM TLF matches the 00 TLF. Three friction factor curves, one for each trip type, resulting from the micro-scale calibration produce a reasonable GM truck trip model. A key methodological issue for GM. calibration involves the use of multiple friction factor curves versus a single friction factor curve for each trip type in order to estimate truck trips with reasonable accuracy. A single friction factor curve for each of the three trip types was found to reproduce the 00 TLFs from the calibration data base. Given the very limited trip generation data available for this research, additional refinement of the gravity model using multiple mction factor curves for each trip type was not warranted. In the traditional urban transportation planning studies, the zonal trip productions and attractions and region-wide OD TLFs are available. However, for this research, the information available for the development .of the GM model is limited to Ground Counts (GC) and a limited set ofOD TLFs. The GM is calibrated using the limited OD data, but the OD data are not adequate to obtain good estimates of truck trip productions and attractions .. Consequently, zonal productions and attractions are estimated using zonal population as a first approximation. Then, Selected Link based (SELINK) analyses are used to adjust the productions and attractions and possibly recalibrate the GM. The SELINK adjustment process involves identifying the origins and destinations of all truck trips that are assigned to a specified "selected link" as the result of a standard traffic assignment. A link adjustment factor is computed as the ratio of the actual volume for the link (ground count) to the total assigned volume. This link adjustment factor is then applied to all of the origin and destination zones of the trips using that "selected link". Selected link based analyses are conducted by using both 16 selected links and 32 selected links. The result of SELINK analysis by u~ing 32 selected links provides the least %RMSE in the screenline volume analysis. In addition, the stability of the GM truck estimating model is preserved by using 32 selected links with three SELINK adjustments, that is, the GM remains calibrated despite substantial changes in the input productions and attractions. The coverage of zones provided by 32 selected links is satisfactory. Increasing the number of repetitions beyond four is not reasonable because the stability of GM model in reproducing the OD TLF reaches its limits. The total volume of truck traffic captured by 32 selected links is 107% of total trip productions. But more importantly, ~ELINK adjustment factors for all of the zones can be computed. Evaluation of the travel demand model resulting from the SELINK adjustments is conducted by using screenline volume analysis, functional class and route specific volume analysis, area specific volume analysis, production and attraction analysis, and Vehicle Miles of Travel (VMT) analysis. Screenline volume analysis by using four screenlines with 28 check points are used for evaluation of the adequacy of the overall model. The total trucks crossing the screenlines are compared to the ground count totals. L V/GC ratios of 0.958 by using 32 selected links and 1.001 by using 16 selected links are obtained. The %RM:SE for the four screenlines is inversely proportional to the average ground count totals by screenline .. The magnitude of %RM:SE for the four screenlines resulting from the fourth and last GM run by using 32 and 16 selected links is 22% and 31 % respectively. These results are similar to the overall %RMSE achieved for the 32 and 16 selected links themselves of 19% and 33% respectively. This implies that the SELINICanalysis results are reasonable for all sections of the state.Functional class and route specific volume analysis is possible by using the available 154 classification count check points. The truck traffic crossing the Interstate highways (ISH) with 37 check points, the US highways (USH) with 50 check points, and the State highways (STH) with 67 check points is compared to the actual ground count totals. The magnitude of the overall link volume to ground count ratio by route does not provide any specific pattern of over or underestimate. However, the %R11SE for the ISH shows the least value while that for the STH shows the largest value. This pattern is consistent with the screenline analysis and the overall relationship between %RMSE and ground count volume groups. Area specific volume analysis provides another broad statewide measure of the performance of the overall model. The truck traffic in the North area with 26 check points, the West area with 36 check points, the East area with 29 check points, and the South area with 64 check points are compared to the actual ground count totals. The four areas show similar results. No specific patterns in the L V/GC ratio by area are found. In addition, the %RMSE is computed for each of the four areas. The %RMSEs for the North, West, East, and South areas are 92%, 49%, 27%, and 35% respectively, whereas, the average ground counts are 481, 1383, 1532, and 3154 respectively. As for the screenline and volume range analyses, the %RMSE is inversely related to average link volume. 'The SELINK adjustments of productions and attractions resulted in a very substantial reduction in the total in-state zonal productions and attractions. The initial in-state zonal trip generation model can now be revised with a new trip production's trip rate (total adjusted productions/total population) and a new trip attraction's trip rate. Revised zonal production and attraction adjustment factors can then be developed that only reflect the impact of the SELINK adjustments that cause mcreases or , decreases from the revised zonal estimate of productions and attractions. Analysis of the revised production adjustment factors is conducted by plotting the factors on the state map. The east area of the state including the counties of Brown, Outagamie, Shawano, Wmnebago, Fond du Lac, Marathon shows comparatively large values of the revised adjustment factors. Overall, both small and large values of the revised adjustment factors are scattered around Wisconsin. This suggests that more independent variables beyond just 226; population are needed for the development of the heavy truck trip generation model. More independent variables including zonal employment data (office employees and manufacturing employees) by industry type, zonal private trucks 226; owned and zonal income data which are not available currently should be considered. A plot of frequency distribution of the in-state zones as a function of the revised production and attraction adjustment factors shows the overall " adjustment resulting from the SELINK analysis process. Overall, the revised SELINK adjustments show that the productions for many zones are reduced by, a factor of 0.5 to 0.8 while the productions for ~ relatively few zones are increased by factors from 1.1 to 4 with most of the factors in the 3.0 range. No obvious explanation for the frequency distribution could be found. The revised SELINK adjustments overall appear to be reasonable. The heavy truck VMT analysis is conducted by comparing the 1990 heavy truck VMT that is forecasted by the GM truck forecasting model, 2.975 billions, with the WisDOT computed data. This gives an estimate that is 18.3% less than the WisDOT computation of 3.642 billions of VMT. The WisDOT estimates are based on the sampling the link volumes for USH, 8TH, and CTH. This implies potential error in sampling the average link volume. The WisDOT estimate of heavy truck VMT cannot be tabulated by the three trip types, I-I, I-E ('||'&'||'pound;-I), and E-E. In contrast, the GM forecasting model shows that the proportion ofE-E VMT out of total VMT is 21.24%. In addition, tabulation of heavy truck VMT by route functional class shows that the proportion of truck traffic traversing the freeways and expressways is 76.5%. Only 14.1% of total freeway truck traffic is I-I trips, while 80% of total collector truck traffic is I-I trips. This implies that freeways are traversed mainly by I-E and E-E truck traffic while collectors are used mainly by I-I truck traffic. Other tabulations such as average heavy truck speed by trip type, average travel distance by trip type and the VMT distribution by trip type, route functional class and travel speed are useful information for highway planners to understand the characteristics of statewide heavy truck trip patternS. Heavy truck volumes for the target year 2010 are forecasted by using the GM truck forecasting model. Four scenarios are used. Fo~ better forecasting, ground count- based segment adjustment factors are developed and applied. ISH 90 '||'&'||' 94 and USH 41 are used as example routes. The forecasting results by using the ground count-based segment adjustment factors are satisfactory for long range planning purposes, but additional ground counts would be useful for USH 41. Sensitivity analysis provides estimates of the impacts of the alternative growth rates including information about changes in the trip types using key routes. The network'||'&'||'not;based GMcan easily model scenarios with different rates of growth in rural versus . . urban areas, small versus large cities, and in-state zones versus external stations. cities, and in-state zones versus external stations.

  • PDF

한국 청소년의 약물남용과 비행행위

  • 김성이
    • Korea journal of population studies
    • /
    • v.11 no.2
    • /
    • pp.54-66
    • /
    • 1988
  • I. Introduction Since the 1970's drug abuse among young people has increasingly become a social problem in Korea. In the 1980's, drug abuse, especially glue sniffing, has become the cause of many unfortunated incidents resulting in harm to others as well as the abusers themselves. Taking into consideration of the seriousness of this problem, the Republic of Korea National Red Cross initiated a nation-wide research programme, to understand the present situation and to raise the level of public awareness. The goal of this research was to begin a nation - wide campaign against drug abuse. The research team was composed of the Advisary Committee members and the staff of the Youth Department of the Republic of Korea National Red Cross. The data were collected in February 1988 with the collaboration of the staff and volunteers in the local Chapters. The respondents were allocated nation-wide by the quota sampling method. The questionnaires were distributed to the respondents in three groups :2, 700 to junior and senior high school students, 605 to working youths, and 916 to delinquent youths. A total of 4, 221 questionnaires were collected. II. Characteristics of the Respondents The respondents in each group were selected evenly from rural and urban areas. The general characteristics of the respondents can be described as follow: in case of students, the proportions between male and female respondents, and between senior high school and junior high school students were almost evenly distributed. In case of working youths, the proportion of females (80.5%) was higher than those of the students and the delinquents groups. Delinquent youths were defined as those currently being under custody of the centers for juvenile delinquents. Of this number, 38.8% and 68.2% were junior and senior high school drop-outs respectively. The majority of them (92.6%) were male. As for the family background of the respondents, the proportion of those residing in poverty - stricken areas, and the proportion of those from broken families were higher in case of working youths and delinquent youths than those in case of students. III. Present Patterns of Drug Abuse The following summarizes the presents of drug abuse, as tabulated from the results of the survey. 1. Smoking The percentage of youths who smoke was 36% in the student group, 32% m the working youths group, and 94.4% in the delinquent youths group. 2. Alcohol 50.3% of students, 71.6% of working youths, and 93.3% of delinquent youths has experienced drinking alcohol beverages. 3. Tonic: non - alcoholic, caffeinated beverages popular in Korea and Japan The percentage of those who have used tonic at least once was over 90% in all of the three groups. 4. Sedative About 70% of each group has used sedative with the proportion of working youths use higher than those in other groups. 5. Stimulants Those who have used stimulants comprised around 15% in each group. 6. Tranquilizers Somewhat less than 5% of students and working youths, and 28% of delinquent youths, have used tranquilizers. 7. Hypnotics The users of hypnotics amounted to 0.4% of students, 2.6% of working youths and 7.1% of delinquent youths. 8. Marihuana Those who have used marihuana indicated 0.7% of students, 0.8% of working youths, and 13% of delinquent youths. 9. Glue-sniffing The percentage of glue-sniffing was 3.7%, 5% in the students group and in the youths group respectively, but the proportion was unusually high, at 40.7% in the delinquent youths group. From the results of the survey the present situation of drug abuse in Korea can be summarized as follows: 1. A high percentage of Korean youths have experienced smoking cigarettes and drinking alcoholic beverages. 2. Tonics (non - alcoholic, caffeinated beverages), antipyretic analgesics and stimulants quite regularly used. 3. Tranquilizers, hypnotics, marihuana and glue-sniffing are more widely used among delinquent youths than the other youths. From this fact, there exists a correlation between drug abuse and juvenile delinquency. IV. Time-series Analysis of the First Experience of Drug Abuse and Deviant Behaviour The respoundents were asked when they were first exposed to drugs and when they committed deviant acts. By calculating the average age of each experience, the following pattern was found (See Figure 1). Youths are first exposed to drugs by abuse of tonic(non - alcoholic, caffeinated beverages). At the age of 13, they amoke cigarettes, the use of antipyretic analgesics begins at 14 year old, while at the age of 15, they use tranquilizers, and at 16 hynotics. The period of drug abuse which starts from drinking caffeinated beverages and smoking cigarettes and ends in the use of hypnotics takes about three years. During this period, other delinquent behaviours begin to surface, that is, at the age of 13 when smoking cigarettes begins, the delinquent behaviour pattern starts with truancy. Next, they start taking money from others by using physical force. Prior to the age of 15, they are suspended from school, become hostile to adults, begin running away from home, and start using stimulants and alcohol. Soon they become involved even in glue-sniffing and in the use of marihuana. At the age of 15, they begin to see adult videos and carry weapons. Sexual promiscuity and usage of tranquilizers follows the viewing of adult videos. Consequently, by the time they reach the age of 16, they visit drinking establishments, and are picked up by police for committing delinquent acts. And finally, they come to use hypnotic - type drugs. From the above descriptions, drug abuse can be assumed to have a close correlation with delinquent behaviour. V. Social Factors Related to Drug Abuse As for the Korean youths, glue-sniffing is found to he related to aggressive delinquency, in such cases as run - aways, being picked up by the police, and taking money by force. Smoking cigarettes and drinking alcohol is found to be related to seeing adult videos and visiting drinking establishments. Hypnotics and marihuana were found to be representive of drugs which are related to degenerational delinquency, irrespective of social delinquency. The social factors connected with these drug abuse are as follows: 1. Individual factors Male students were more heavily involved in the usage of drug than females. Youths who do not attend church were more likely to be involved in drugs than those who attend. 2. Family factors The youths who were displeased with their mothers smoking and those who thought their parents did not love each other, or those whose parents had used drugs without prescription, were more likely to he drug users. 3. School factors Those youths who found school life boring, were unsuccessful in their studies, spend most of their time with friends, feel their teachers smoke too much, those who had a positive perception of their teachers smoking were likely to he drug users. To sum up, drug abusers depend on the influence of their parents, teachers and peers. IV. Reasons for Drug Abuse Korean students have mainly used drugs to release stress (42.8%), to stay awake (19.7%), and because of the easy accessibility of drugs( 16.6%). Other reasons are due to their ignorance of the side effects of the drugs (3.6%), natural curiosity (4.2%), and to increase strength(3.O%). From the above facts, the major reasons for drug abuse among Korean youths are to release stress and to stay awake in order to prepare exams. Furthermore, since drugs are readily available, we can conclude that drug abuse is caused by the school system(such as entrance exams) in Korea. VII. Conclusion Drug usage among Korean youths are relatively less common than those of western youths. In some cases, such as, glue-sniffing and use of stimulants, the pattern of drug abuse is found. Moreover, early drug abuse is evident, and it has a close connection with deviant behaviour, resulting in juvenile delinquency. Drug abuse cannot be attributed to any one social factor. Specifically, drug abuse depends on parents, peers, teachers and other members of the community, and also is influenced by social institutions such as the entrance exam system. Every person and organization concerned with youth must participate collectively in restraining drug abuse. Finally, it is suggested that social agencial working for youth welfare should make every effort to tackle this serious problem confronted by the Korean youths today.

  • PDF