• Title/Summary/Keyword: Data Word length

Search Result 46, Processing Time 0.025 seconds

The final stop consonant perception in typically developing children aged 4 to 6 years and adults (4-6세 정상발달아동 및 성인의 종성파열음 지각력 비교)

  • Byeon, Kyeongeun;Ha, Seunghee
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.57-65
    • /
    • 2015
  • This study aimed to identify the development pattern of final stop consonant perception using the gating task. Sixty-four subjects participated in the study: 16 children aged 4 years, 16 children aged 5 years, 17 children aged 6 years, and 15 adults. One-syllable words with consonant-vowel-consonant(CVC) structure, mokㄱ-motㄱ and papㄱ-patㄱ were used as stimuli in order to remove the redundancy of acoustic cues in stimulus words, 40ms-length (-40ms) and 60ms-length (-60ms) from the entire duration of the final consonant were deleted. Three conditions (the whole word segment, -40ms, -60ms) were used for this speech perception experiment. 48 tokens (4 stimuli ${\times}3$ conditions ${\times}4$ trials) in total were provided for participants. The results indicated that 5 and 6 year olds showed final consonant perception similar to adults in stimuli, papㄱ-patㄱ and only the 6-year-old children showed perception similar to adults in stimuli, 'mokㄱ-motㄱ. The results suggested that younger typically developing children require more acoustic information to accurately perceive final consonants than older children and adults. Final consonant perception ability may become adult-like around 6 years old. The study provides fundamental data on the development pattern of speech perception in normal developing children, which can be used to compare to those of children with communication disorders.

Characterization of New Two Parametric Generalized Useful Information Measure

  • Bhat, Ashiq Hussain;Baig, M. A. K.
    • Journal of Information Science Theory and Practice
    • /
    • v.4 no.4
    • /
    • pp.64-74
    • /
    • 2016
  • In this paper we define a two parametric new generalized useful average code-word length $L_{\alpha}^{\beta}$(P;U) and its relationship with two parametric new generalized useful information measure $H_{\alpha}^{\beta}$(P;U) has been discussed. The lower and upper bound of $L_{\alpha}^{\beta}$(P;U), in terms of $H_{\alpha}^{\beta}$(P;U) are derived for a discrete noiseless channel. The measures defined in this communication are not only new but some well known measures are the particular cases of our proposed measures that already exist in the literature of useful information theory. The noiseless coding theorems for discrete channel proved in this paper are verified by considering Huffman and Shannon-Fano coding schemes on taking empirical data. Also we study the monotonic behavior of $H_{\alpha}^{\beta}$(P;U) with respect to parameters ${\alpha}$ and ${\beta}$. The important properties of $H_{{\alpha}}^{{\beta}}$(P;U) have also been studied.

A Research on the Interlanguage of Chinese Speaking Korean Language Learners: Focusing on MLU and Characteristics Found in Vocabulary Usage (중국인 한국어 학습자의 중간언어 연구 - 평균발화길이(MLU)와 어휘적 특성을 중심으로)

  • Kim, Seon-Jung;Kim, Mok-Ah
    • Cross-Cultural Studies
    • /
    • v.22
    • /
    • pp.303-327
    • /
    • 2011
  • This study aims to uncover the learner's language proficiency shown in the writing data of Chinese elementary/intermediate level learners. Language proficiency of the learners acquired by error analysis provides only partial information, and thus this study analyses the interlanguage of Korean learners in terms of 'Mean Length of Utterance, MLU' to discover the overall aspect of learner's language proficiency more symmetrically. The analysis of vocabulary area is to be enforced after generally studying the learner's language development aspect in accordance with MLU-m(orpheme) and MLU-(w)ord found in compositions by Chinese speaking Korean language learners. In terms of MLU, it has been slightly increased as the level of proficiency between elementary level and intermediate level learners; however, the morpheme seemed to be difficult to use, since the difference between Chinese learners and Korean university students has been notably shown. Vocabulary diversity, using aspect for each word class, and using aspect of the predicate are studied for vocabulary area; more various and numerous vocabulary tend to be used as the level of proficiency increases. In terms of predicate use, Chinese learners use less numerous vocabulary types.

The Effects of M-CRM Characteristics, Market Orientation on Customer Loyalty and the Moderating Role of Relationship Length in Insurance Companies (보험기업의 M-CRM 특성과 시장지향성이 고객충성도에 미치는 영향: 관계기간의 조절효과)

  • Jung, Chul-Ho;Jung, Duk-Hwa
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.6
    • /
    • pp.726-738
    • /
    • 2016
  • This paper aims to examine structural relationship between the influence factors of customer loyalty, hypothesizing that m-CRM characteristics, market orientations, relationship quality and relationship length plays a crucial role in achieving customer loyalty in insurance companies. Total of 255 valid sample data were used to test study hypotheses. By using Structure Equation Modeling(SEM) method, the results show that m-CRM characteristics and customer orientation significantly influence to relationship quality except competitor orientation and all relationship quality are very significantly influence to customer loyalty being consisted of customer retention and word of mouth effect. In addition, the modulation effect of relationship length is confirmed about relationship between relationship quality and customer loyalty. A real situation we conducted our research may enable academics and practitioners to understand the antecedents and outcomes of m-CRM implementation in terms of market orientation.

Deep Learning-based Stock Price Prediction Using Limit Order Books and News Headlines (호가창과 뉴스 헤드라인을 이용한 딥러닝 기반 주가 변동 예측 기법)

  • Ryoo, Euirim;Lee, Ki Yong;Chung, Yon Dohn
    • The Journal of Society for e-Business Studies
    • /
    • v.27 no.1
    • /
    • pp.63-79
    • /
    • 2022
  • Recently, various studies have been conducted on stock price prediction using machine learning and deep learning techniques. Among these studies, the latest studies have attempted to predict stock prices using limit order books, which contain buy and sell order information of stocks. However, most of the studies using limit order books consider only the trend of limit order books over the most recent period of a specified length, and few studies consider both the medium and short term trends of limit order books. Therefore, in this paper, we propose a deep learning-based prediction model that predicts stock price more accurately by considering both the medium and short term trends of limit order books. Moreover, the proposed model considers news headlines during the same period to reflect the qualitative status of the company in the stock price prediction. The proposed model extracts the features of changes in limit order books with CNNs and the features of news headlines using Word2vec, and combines these information to predict whether a particular company's stock will rise or fall the next day. We conducted experiments to predict the daily stock price fluctuations of five stocks (Amazon, Apple, Facebook, Google, Tesla) with the proposed model using the real NASDAQ limit order book data and news headline data, and the proposed model improved the accuracy by up to 17.66%p and the average by 14.47%p on average. In addition, we conducted a simulated investment with the proposed model and earned a minimum of $492.46 and a maximum of $2,840.93 depending on the stock for 21 business days.

Analyzing Contextual Polarity of Unstructured Data for Measuring Subjective Well-Being (주관적 웰빙 상태 측정을 위한 비정형 데이터의 상황기반 긍부정성 분석 방법)

  • Choi, Sukjae;Song, Yeongeun;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.83-105
    • /
    • 2016
  • Measuring an individual's subjective wellbeing in an accurate, unobtrusive, and cost-effective manner is a core success factor of the wellbeing support system, which is a type of medical IT service. However, measurements with a self-report questionnaire and wearable sensors are cost-intensive and obtrusive when the wellbeing support system should be running in real-time, despite being very accurate. Recently, reasoning the state of subjective wellbeing with conventional sentiment analysis and unstructured data has been proposed as an alternative to resolve the drawbacks of the self-report questionnaire and wearable sensors. However, this approach does not consider contextual polarity, which results in lower measurement accuracy. Moreover, there is no sentimental word net or ontology for the subjective wellbeing area. Hence, this paper proposes a method to extract keywords and their contextual polarity representing the subjective wellbeing state from the unstructured text in online websites in order to improve the reasoning accuracy of the sentiment analysis. The proposed method is as follows. First, a set of general sentimental words is proposed. SentiWordNet was adopted; this is the most widely used dictionary and contains about 100,000 words such as nouns, verbs, adjectives, and adverbs with polarities from -1.0 (extremely negative) to 1.0 (extremely positive). Second, corpora on subjective wellbeing (SWB corpora) were obtained by crawling online text. A survey was conducted to prepare a learning dataset that includes an individual's opinion and the level of self-report wellness, such as stress and depression. The participants were asked to respond with their feelings about online news on two topics. Next, three data sources were extracted from the SWB corpora: demographic information, psychographic information, and the structural characteristics of the text (e.g., the number of words used in the text, simple statistics on the special characters used). These were considered to adjust the level of a specific SWB. Finally, a set of reasoning rules was generated for each wellbeing factor to estimate the SWB of an individual based on the text written by the individual. The experimental results suggested that using contextual polarity for each SWB factor (e.g., stress, depression) significantly improved the estimation accuracy compared to conventional sentiment analysis methods incorporating SentiWordNet. Even though literature is available on Korean sentiment analysis, such studies only used only a limited set of sentimental words. Due to the small number of words, many sentences are overlooked and ignored when estimating the level of sentiment. However, the proposed method can identify multiple sentiment-neutral words as sentiment words in the context of a specific SWB factor. The results also suggest that a specific type of senti-word dictionary containing contextual polarity needs to be constructed along with a dictionary based on common sense such as SenticNet. These efforts will enrich and enlarge the application area of sentic computing. The study is helpful to practitioners and managers of wellness services in that a couple of characteristics of unstructured text have been identified for improving SWB. Consistent with the literature, the results showed that the gender and age affect the SWB state when the individual is exposed to an identical queue from the online text. In addition, the length of the textual response and usage pattern of special characters were found to indicate the individual's SWB. These imply that better SWB measurement should involve collecting the textual structure and the individual's demographic conditions. In the future, the proposed method should be improved by automated identification of the contextual polarity in order to enlarge the vocabulary in a cost-effective manner.

FFT/IFFT IP Generator for OFDM Modems (OFDM 모뎀용 FFT/IFFT IP 자동 생성기)

  • Lee Jin-Woo;Shin Kyung-Wook;Kim Jong-Whan;Baek Young-Seok;Eo Ik-Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.3A
    • /
    • pp.368-376
    • /
    • 2006
  • This paper describes a Fcore_GenSim(Parameterized FFT Core Generation & Simulation Program), which can be used as an essential If(Intellectual Property) in various OFDM modem designs. The Fcore_Gensim is composed of two parts, a parameterized core generator(PFFT_CoreGen) that generates Verilog-HDL models of FFT cores, and a fixed-point FFT simulator(FXP_FFTSim) which can be used to estimate the SQNR performance of the generated cores. The parameters that can be specified for core generation are FFT length in the range of 64 ~2048-point and word-lengths of input/output/internal/twiddle data in the range of 8-b "24-b with 2-b step. Total 43,659 FFT cores can be generated by Fcore_Gensim. In addition, CBFP(Convergent Block Floating Point) scaling can be optionally specified. To achieve an optimized hardware and SQNR performance of the generated core, a hybrid structure of R2SDF and R2SDC stages and a hybrid algorithm of radix-2, radix-2/4, radix-2/4/8 are adopted according to FFT length and CBFP scaling.

Analysis of an Optimal Iterative Turbo Equalizer for Underwater Acoustic Communication (수중 음향통신에 적합한 최적의 반복기반 터보 등화기 분석)

  • Park, Tae Doo;Lee, Seong Ro;Kim, Beom Mu;Jung, Ji Won
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38C no.3
    • /
    • pp.303-310
    • /
    • 2013
  • Underwater acoustic communication has multipath error because of reflection by sea-level and sea-bottom. The multipath of underwater channel causes signal distortion and error floor. In order to improve the performance, it is necessary to employ an iterative coding scheme. Among the iterative coding scheme, turbo codes and LDPC codes are dominant channel coding schemes in recent. This paper concluded that turbo coding scheme is optimal for underwater communications system in aspect to performance, coded word length, and equalizer combining. Also, we confirmed the performance in the environment of oceanic experimentation using turbo equalizer based on distance 5Km, data rate 1Kbps.

Optimizing Multiple Pronunciation Dictionary Based on a Confusability Measure for Non-native Speech Recognition (타언어권 화자 음성 인식을 위한 혼잡도에 기반한 다중발음사전의 최적화 기법)

  • Kim, Min-A;Oh, Yoo-Rhee;Kim, Hong-Kook;Lee, Yeon-Woo;Cho, Sung-Eui;Lee, Seong-Ro
    • MALSORI
    • /
    • no.65
    • /
    • pp.93-103
    • /
    • 2008
  • In this paper, we propose a method for optimizing a multiple pronunciation dictionary used for modeling pronunciation variations of non-native speech. The proposed method removes some confusable pronunciation variants in the dictionary, resulting in a reduced dictionary size and less decoding time for automatic speech recognition (ASR). To this end, a confusability measure is first defined based on the Levenshtein distance between two different pronunciation variants. Then, the number of phonemes for each pronunciation variant is incorporated into the confusability measure to compensate for ASR errors due to words of a shorter length. We investigate the effect of the proposed method on ASR performance, where Korean is selected as the target language and Korean utterances spoken by Chinese native speakers are considered as non-native speech. It is shown from the experiments that an ASR system using the multiple pronunciation dictionary optimized by the proposed method can provide a relative average word error rate reduction of 6.25%, with 11.67% less ASR decoding time, as compared with that using a multiple pronunciation dictionary without the optimization.

  • PDF

The Incredible Shrinking Noun Phrase: Ongoing Change in Japanese Word Formation

  • Kevin Heffernan;Yusuke Imanishi
    • Asia Pacific Journal of Corpus Research
    • /
    • v.4 no.1
    • /
    • pp.1-23
    • /
    • 2023
  • The Japanese language, as a typical agglutinating language, permits large noun phrases (NP) containing ten or more morphemes. In this paper, we argue that the nature of the NP in Japanese is changing. Our data are drawn from the Balanced Corpus of Contemporary Written Japanese. We conduct a series of apparent-time studies of ongoing changes in complex NPs. We first examine the length of compound nouns, followed by the usage of bound suffixes. We then examine ongoing changes in complex NPs that contain genitive case markers. Finally, we examine noun incorporation. All of our studies show a trend towards shorter, less complex NPs. Furthermore, our results suggest that the usage rate of phrases that modify the noun inside the NP (compound nouns, bound nouns, NPs containing genitive case, noun incorporation) appears to be decreasing over time. On the other hand, the usage rate of modifying material outside of the NP (positional phrases, relative clauses) appears to be increasing over time. We conclude by suggesting that our results reflect a diachronic change of decreasing synthetic morphology and increasing analytic morphology. We end by pointing out the implications of this work on our understanding syntheticity and analyticity.