• Title/Summary/Keyword: lowest frequency words

Search Result 11, Processing Time 0.025 seconds

Word Sense Disambiguation based on Concept Learning with a focus on the Lowest Frequency Words (저빈도어를 고려한 개념학습 기반 의미 중의성 해소)

  • Kim Dong-Sung;Choe Jae-Woong
    • Language and Information
    • /
    • v.10 no.1
    • /
    • pp.21-46
    • /
    • 2006
  • This study proposes a Word Sense Disambiguation (WSD) algorithm, based on concept learning with special emphasis on statistically meaningful lowest frequency words. Previous works on WSD typically make use of frequency of collocation and its probability. Such probability based WSD approaches tend to ignore the lowest frequency words which could be meaningful in the context. In this paper, we show an algorithm to extract and make use of the meaningful lowest frequency words in WSD. Learning method is adopted from the Find-Specific algorithm of Mitchell (1997), according to which the search proceeds from the specific predefined hypothetical spaces to the general ones. In our model, this algorithm is used to find contexts with the most specific classifiers and then moves to the more general ones. We build up small seed data and apply those data to the relatively large test data. Following the algorithm in Yarowsky (1995), the classified test data are exhaustively included in the seed data, thus expanding the seed data. However, this might result in lots of noise in the seed data. Thus we introduce the 'maximum a posterior hypothesis' based on the Bayes' assumption to validate the noise status of the new seed data. We use the Naive Bayes Classifier and prove that the application of Find-Specific algorithm enhances the correctness of WSD.

  • PDF

An Analysis on the Korean Language for Optimum Transmission of Hangul Code (한글 부호의 최적화 전송을 위한 한국어 낱자 분석)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.1
    • /
    • pp.33-38
    • /
    • 2015
  • The goal of this paper is to propose the Hangul Jamo to make a optimum transmission code of Hangul consonant and vowel(Jamo). The Hangul Jamo was analyzed by three kinds of Hangul Jamo. First one is the basic Hangul Jamo which is consisted by 24ea Jamo. Second one is a two combination keyboard which has 28ea Jamo. Third one is 54 Jamo set which is added the double Jamo to the second one. Use frequency of Hangul-Jamo is analyzed based on the Hangul in "Modern Korean Use Frequency Rate Survey Result" issued by The National Institute of the Korean Language". Total 58,437 Korean words are contained in the report and the words are composed with Hangul 1,540ea. The results of this study are as follows, In the Jamo are that in the first Jamo set case, the highest use frequency consonant is "ㅇ" and the lowest one is "ㅋ", and in the vowel case, the highest use frequency one is "ㅏ" and the lowest one is "ㅑ". In the second case, the highest use frequency consonant is same as first case and the highest vowel is "ㅏ" and the lowest one is "ㅒ". In the third case, the highest consonant is "ㅏ" and the lowest one is "ㅞ". the highest vowel is "ㄱ" and the lowest one is "ㄽ".

Non-equal DC link Voltages in a Cascaded H-Bridge with a Selective Harmonic Mitigation-PWM Technique Based on the Fundamental Switching Frequency

  • Moeini, Amirhossein;Iman-Eini, Hossein;Najjar, Mohammad
    • Journal of Power Electronics
    • /
    • v.17 no.1
    • /
    • pp.106-114
    • /
    • 2017
  • In this paper, the Selective Harmonic Mitigation-PWM (SHM-PWM) method is used in single-phase and three-phase Cascaded H-Bridge (CHB) inverters in order to fulfill different power quality standards such as EN 50160, CIGRE WG 36-05, IEC 61000-3-6 and IEC 61000-2-12. Non-equal DC link voltages are used to increase the degrees of freedom for the proposed SHM-PWM technique. In addition, it will be shown that the obtained solutions become continuous and without sudden changes. As a result, the look-up tables can be significantly reduced. The proposed three-phase modulation method can mitigate up to the 50th harmonic from the output voltage, while each switch has just one switching in a fundamental period. In other words, the switching frequency of the power switches are limited to 50 Hz, which is the lowest switching frequency that can be achieved in the multilevel converters, when the optimal selective harmonic mitigation method is employed. In single-phase mode, the proposed method can successfully mitigate harmonics up to the 50th, where the switching frequency is 150 Hz. Finally, the validity of the proposed method is verified by simulations and experiments on a 9-level CHB inverter.

The prosodic characters of particles in Korean -- focusing on the read speech -- (한국어 조사의 운율적 특성 - 낭독체 문장을 중심으로-)

  • Jun Eun;Lee Sook-hyang
    • MALSORI
    • /
    • no.37
    • /
    • pp.73-85
    • /
    • 1999
  • The prosodic characteristics of Korean particles in read speech were examined in this paper based on K-ToBI labeling system in order to see whether they are prosodically weak form like functions words in English. Acoustic measurements and statistical analysis were done focusing on the distribution of particles over a variety of prosodic positions, prosodic positional effects on the phonetic realization of particles, and acoustic strength of particles compared to those of their surrounding syllables. The panicles were distributed rather equally over all 4 prosodic positions with the highest frequency at IP-medial/AP-final position and the lowest at IP-medial/AP-medial position except that topic marker 'Un/nUn' showed preference for IP-final/AP-final position. There was a significant prosodic positional effect on the duration and F0 of the particles. Duration was the longest at IP-final/AP-final position and interestingly, at IP-medial/AP-medial position while F0 was the highest at IP-final/AP-medial Position as expected. The comparison of the acoustic properties of the particles with those of neighbor syllables showed that duration was generally significantly longer and energy also showed larger values, if not significant, in particles suggesting that the particles in Korean are not prosodically weaker like function words in English.

  • PDF

Analysis of Korean Language to Optimize the Hangul Character Coding for Information Processing and Communication (한글의 정보처리 및 통신용 부호 최적화를 위한 한국어 분석)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.3
    • /
    • pp.375-380
    • /
    • 2015
  • This paper is studied the Korean language to optimize the Hangul character coding for information processing in information terminal device and transmission in network. The paper analyzed Hangul character in Korean language and use frequency of each character. The paper also compared the analysis result to Hangul characters which are coded in standard in Korean character and Unicode. This study referred "Modern Korean Use Frequency Rate Survey Result" issued by The National Institute of the Korean Language. There are total 58,437 Korean words in the report. As a result of this paper, the Korean word 58,437ea are consisted of Hangul character total 1,540ea. The highest use frequency character is "다" and its use frequency to total use frequency rate is 15%. The lowest use character is "휫"and its use frequency to total use frequency rate is 0.00003%. The number of analyzed Hangul character 1,540 is less 7.2 times and 1.5 times than Korean and Unicode standard respectively.

N1-N2 Audiograms of the Guinea Pig Cochlea (기네아피그 달팽이관의 N1-N2 오디오그램)

  • 장순석
    • Journal of Biomedical Engineering Research
    • /
    • v.16 no.1
    • /
    • pp.77-84
    • /
    • 1995
  • Nl and N2 gross neural action potentials were measured from the round window of the guinea pig cochlea at the onset of the acoustic stimuli. Nl -N2 audiograms were made by means of regulating stimulant intensities in order to produce constant Nl -N2 potentials as criteria for different input tone pip frequencies. The lowest threshold was measured with an input tone pip 15 dB SPL in intensity and 12 KHz in frequency when the animal was in normal physiological condition. The procedure of experimental measurements is explained in detail. This experimental approach is very useful for the investigation of the Cochlear function. Both nonlinear and active functions of the Cochlea can be monitored by Nl -N2 audiograms. Key words : Guinea Pig, Cochlea, Wl and N2 Gross Neural Action Potential, Nl -N2 audiogram.

  • PDF

A Research on Difference Between Consumer Perception of Slow Fashion and Consumption Behavior of Fast Fashion: Application of Topic Modelling with Big Data

  • YANG, Oh-Suk;WOO, Young-Mok;YANG, Yae-Rim
    • The Journal of Economics, Marketing and Management
    • /
    • v.9 no.1
    • /
    • pp.1-14
    • /
    • 2021
  • Purpose: The article deals with the proposition that consumers' fashion consumption behavior will still follow the consumption behavior of fast fashion, despite recognizing the importance of slow fashion. Research design, data and methodology: The research model to verify this proposition is topic modelling with big data including unstructured textual data. we combined 5,506 news articles posted on Naver news search platform during the 2003-2019 period about fast fashion and slow fashion, high-frequency words have been derived, and topics have been found using LDA model. Based on these, we examined consumers' perception and consumption behavior on slow fashion through the analysis of Topic Network. Results: (1) Looking at the status of annual article collection, consumers' interest in slow fashion mainly began in 2005 and showed a steady increase up to 2019. (2) Term Frequency analysis showed that the keywords for slow fashion are the lowest, with consumers' consumption patterns continuing around 'brand.' (3) Each topic's weight in articles showed that 'social value' - which includes slow fashion - ranked sixth among the 9 topics, low linkage with other topics. (4) Lastly, 'brand' and 'fashion trend' were key topics, and the topic 'social value' accounted for a low proportion. Conclusion: Slow fashion was not a considerable factor of consumption behavior. Consumption patterns in fashion sector are still dominated by general consumption patterns centered on brands and fast fashion.

Patent Technology Trends of Oral Health: Application of Text Mining

  • Hee-Kyeong Bak;Yong-Hwan Kim;Han-Na Kim
    • Journal of dental hygiene science
    • /
    • v.24 no.1
    • /
    • pp.9-21
    • /
    • 2024
  • Background: The purpose of this study was to utilize text network analysis and topic modeling to identify interconnected relationships among keywords present in patent information related to oral health, and subsequently extract latent topics and visualize them. By examining key keywords and specific subjects, this study sought to comprehend the technological trends in oral health-related innovations. Furthermore, it aims to serve as foundational material, suggesting directions for technological advancement in dentistry and dental hygiene. Methods: The data utilized in this study consisted of information registered over a 20-year period until July 31st, 2023, obtained from the patent information retrieval service, KIPRIS. A total of 6,865 patent titles related to keywords, such as "dentistry," "teeth," and "oral health," were collected through the searches. The research tools included a custom-designed program coded specifically for the research objectives based on Python 3.10. This program was used for keyword frequency analysis, semantic network analysis, and implementation of Latent Dirichlet Allocation for topic modeling. Results: Upon analyzing the centrality of connections among the top 50 frequently occurring words, "method," "tooth," and "manufacturing" displayed the highest centrality, while "active ingredient" had the lowest. Regarding topic modeling outcomes, the "implant" topic constituted the largest share at 22.0%, while topics concerning "devices and materials for oral health" and "toothbrushes and oral care" exhibited the lowest proportions at 5.5% each. Conclusion: Technologies concerning methods and implants are continually being researched in patents related to oral health, while there is comparatively less technological development in devices and materials for oral health. This study is expected to be a valuable resource for uncovering potential themes from a large volume of patent titles and suggesting research directions.

A Comparative Study on the Service Characteristics for Transferring Process of High-Speed Rail and Domestic Airline Systems by Using Structural Equation Modeling (공분산구조분석을 이용한 고속철도와 국내항공의 이동단계별 서비스특성 비교연구)

  • Kim, Tae Ho;Jeong, Kwang Seop;Park, Je Jin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.2D
    • /
    • pp.183-190
    • /
    • 2009
  • In order to improve continuous success and stabilization of high-speed rail in the future, Using Frequency of high-speed rail should be enhanced by improving satisfaction of high-speed rail passenger. High-speed rail is needed to hold the priority in competition means by comparing the traits of service of domestic air lines. This study utilizes structural equation modeling to develop model for estimating factors influencing to service through conducting survey questionnaire. It also uses reliability analysis, correlation analysis, factor analysis to examine the rationalization of items and to establish hypothesis of this research. The results show that KTX contains 'inner service' item that is considered to be ameliorated and that domestic airline present low performance of 'outer service' item. In other words, moving section which partly is under a limited condition is needed to be improved. In addition, access to airport and transfer to other transportations have to be improved as they show the lowest satisfaction.

Influence of Self-Differentiation and Acculturation on Marriage Satisfaction Among Immigrant Women by Residential Area (거주지역에 따른 결혼이민자 여성의 자아분화 및 문화적응이 결혼만족도에 미치는 영향)

  • Lee, Young-Boon;Lee, Yu-Kyung
    • Journal of Families and Better Life
    • /
    • v.28 no.1
    • /
    • pp.145-157
    • /
    • 2010
  • This is a study that explores the influence of self-differentiation and acculturation among married immigrant women on their feelings of marriage satisfaction by residential area. The aim was to verify the level of self-differentiation and acculturation that married immigrant women secure from multicultural marriage. To achieve this objective, the study widely distributed a questionnaire that targeted women who were participating in education and other services at health and family support centers, multi-cultural family support centers, general social welfare centers, immigrant women shelters, and Korean language classrooms which are located in Seoul, Gyeonggi, Chungcheong, Jeolla, and Gyeongsang. Data analysis involved frequency analysis, descriptive statistics, one-way-Anova, and multiple regression analysis. Based on the results of descriptive statistics, two factors, namely, (1)interpersonal-relation differentiation, a sub-scale of self-differentiation, and (2)marginalization, a sub-scale of acculturation, had the lowest average. In verifying its various hypotheses, the study achieved the following results. Firstly, among demographic characteristics, there was difference of the mean in the marriage period, average monthly income, the frequency of meetings with the married woman´s parents-in-law and her own parents, and the average cost of supporting the woman's parents-in-law and her own parents. Secondly, among demographic characteristics, the variable of influencing marriage satisfaction showed negative influence in the case of women dwelling in farming and fishing villages. This shows that women residing in cities whether small, medium or large have higher marriage satisfaction. Also, as a result of verifying whether self-differentiation has influence on marriage satisfaction, the element of interpersonal-relations differentiation had a negative influence on marriage satisfaction. Thirdly, as for influence of acculturation upon marriage satisfaction, the study showed that just integration, which is a sub-scale of acculturation had positive effect on marriage satisfaction. In other words, the study showed that the interpersonal-relation differentiation among the sub-scales of self-differentiation among married immigrant women had a negative influence, and that integration among the sub-scales in acculturation had a positive influence on marriage satisfaction. Based on these results, in order to increase interpersonal-relation differentiation, as well as marriage satisfaction among immigrant women, the study suggests the integration of the women's families with the nuclear and extended families in the communities where the women reside.