• Title/Summary/Keyword: Frequency split

Search Result 247, Processing Time 0.028 seconds

Stock Price Prediction by Utilizing Category Neutral Terms: Text Mining Approach (카테고리 중립 단어 활용을 통한 주가 예측 방안: 텍스트 마이닝 활용)

  • Lee, Minsik;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.123-138
    • /
    • 2017
  • Since the stock market is driven by the expectation of traders, studies have been conducted to predict stock price movements through analysis of various sources of text data. In order to predict stock price movements, research has been conducted not only on the relationship between text data and fluctuations in stock prices, but also on the trading stocks based on news articles and social media responses. Studies that predict the movements of stock prices have also applied classification algorithms with constructing term-document matrix in the same way as other text mining approaches. Because the document contains a lot of words, it is better to select words that contribute more for building a term-document matrix. Based on the frequency of words, words that show too little frequency or importance are removed. It also selects words according to their contribution by measuring the degree to which a word contributes to correctly classifying a document. The basic idea of constructing a term-document matrix was to collect all the documents to be analyzed and to select and use the words that have an influence on the classification. In this study, we analyze the documents for each individual item and select the words that are irrelevant for all categories as neutral words. We extract the words around the selected neutral word and use it to generate the term-document matrix. The neutral word itself starts with the idea that the stock movement is less related to the existence of the neutral words, and that the surrounding words of the neutral word are more likely to affect the stock price movements. And apply it to the algorithm that classifies the stock price fluctuations with the generated term-document matrix. In this study, we firstly removed stop words and selected neutral words for each stock. And we used a method to exclude words that are included in news articles for other stocks among the selected words. Through the online news portal, we collected four months of news articles on the top 10 market cap stocks. We split the news articles into 3 month news data as training data and apply the remaining one month news articles to the model to predict the stock price movements of the next day. We used SVM, Boosting and Random Forest for building models and predicting the movements of stock prices. The stock market opened for four months (2016/02/01 ~ 2016/05/31) for a total of 80 days, using the initial 60 days as a training set and the remaining 20 days as a test set. The proposed word - based algorithm in this study showed better classification performance than the word selection method based on sparsity. This study predicted stock price volatility by collecting and analyzing news articles of the top 10 stocks in market cap. We used the term - document matrix based classification model to estimate the stock price fluctuations and compared the performance of the existing sparse - based word extraction method and the suggested method of removing words from the term - document matrix. The suggested method differs from the word extraction method in that it uses not only the news articles for the corresponding stock but also other news items to determine the words to extract. In other words, it removed not only the words that appeared in all the increase and decrease but also the words that appeared common in the news for other stocks. When the prediction accuracy was compared, the suggested method showed higher accuracy. The limitation of this study is that the stock price prediction was set up to classify the rise and fall, and the experiment was conducted only for the top ten stocks. The 10 stocks used in the experiment do not represent the entire stock market. In addition, it is difficult to show the investment performance because stock price fluctuation and profit rate may be different. Therefore, it is necessary to study the research using more stocks and the yield prediction through trading simulation.

Studies on Dry Matter Yields , Chemical Composition and Net Energy Accumulation in Three Leading Temperate Grass Species I. Influence of meteorolgical factors on the dry matter productivity and net energy value under different cutting management (주요 북방형목초의 건물수량 , 화학성분 및 Net Energy 축적에 관한 연구 I. 기상환경 및 예취관리에 따른 건물 및 에너지 생산성 변화)

  • F. Muhlschlegel;G. Voigtlander
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.6 no.2
    • /
    • pp.103-110
    • /
    • 1986
  • The experiments were carried out to study the influence of meteorological factors and cutting management on dry matter accumulation and net energy value in orchardgrass (Dactlylis glomerata L.) cv. Potomac and Baraula, perennial ryegrass (Lolium perenne L.) cv. Reveille and Semperweide and meadow fescue (Festuca pratensis Huds.) cv. Cosmos 11 and N.F.G.. The field trials were designed as a split plot design with three cutting regimes of 6-7 cuts at grazing stage, 4-5 cuts at silage stage and 3 cuts at hat stage in Korea and West Germany from 1975 to 1979. The results obtained are summarized as follows: 1. Productivity of orchardgrass, perennial ryegrass and meadow fescue were mainly affected by cutting systems and meteorological factors, especially air temperature, rainfalls, solar radiation and their interactions. In West Germany, cutting frequency was to be found asan most important factor influenced to dry matter yield and net energy value. 2. Orchardgrass, taken as average of all experimental sites in Korea, produced high yield of 875 kg/10 a in dry matter, which was as much as 32% and 27% higher than those of perennial ryegrass and meadow fescue, respectively. The annual dry matter yields of orchardgrass from 1976 to 1977 were shown a little variation. Dry matter yields in Freising and Braunschweig in West Germany were increased in all grass species continuously. 3. Orchardgrass, perennial ryegrass and meadow fescue showed different response to cutting frequency. The highest dry matter yields were found under 3 cuts at hay stage for orchardgrass and 4-5 cuts at silage stage for perennial ryegrass and meadow fescue. In West Germany, dry matter yields, as average of all grass species under different cutting systems, were 1326 kg, 1175 kg and 1098 kg/10a for 3 cuts, 4-5 cuts and 6-7 cuts, respectively. 4. Chemical composition and net energy concentration of temperate grasses were influenced by cutting managements. The highest yields of digestible crude protein were obtained under 6-7 cuts at grazing stage both in Korea and West Germany. In net energy yields, 3 cutting system produced the highest yield with 694 (orchardgrass), 665 (perennial ryegrass) an 623 kStE/10 a (meadow fescue). However, frequent cutting at grazing and silage stage produced higher yields than 3 cuts at hay stage in Cheju, Suweon and Taekwalyong.

  • PDF

Studies on Dry Matter Yields , Chemical Composition and Net Energy Accumulation in Three Leading Temperate Grass Species II. Synthesis and accumulation pattern of nonstructural carbohydrate (주요 북방형목초의 건물수량 , 화학성분 및 New Energy 축적에 관한 연구 II. 비구조성탄수화물의 합성 및 축적형태)

  • ;;F. Muhlschlegel
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.6 no.2
    • /
    • pp.111-118
    • /
    • 1986
  • Sysnthesis and accumulation pattern or nonstructural carbohydrates in orchardgrass (Dactylis glomerata L.) cv. Potomac and Baraula, perennial ryegrass (Lolium perenne L.) cv. Reveille and Semperweide and meadow fescue (Festuca pratensis Huds.) cv. Cosmos 11 and N.F.G. were studied under different meteorological environments and cutting managements. The field experiments were conducted as a split plot design with three cutting regimes of 6-7 cuts at grzing stage, 4-5 cuts at silage stage and 3 cuts at hay stage in Korea and West Germany from 1975 to 1979. The results obtained are summarized as follows: 1. Accumlation of nonstructural carbohydrates in temperate grasses was influenced by grass species and regional climatic environments. Total nonstructural carbohydrates (TNC) of orchardgrass, perennial ryegrass and meadow fescue in Korea, taken as average of all cutting regimes, were shown a value of 4.39%, 6.08% and 8.01%, respectively, while those under cool summer climatic condition in West Germany accumulated to 10.42% (orchardgrass), 18.02% (perennial ryegrass) and 12.73% (meadow fescue). 2. Nonstructural carbohydrates in orchardgrass were accumulated mainly as mono-and disaccharose, while those in perennial ryegrass resreved as fructosan. The contents of fructosan and mono-and disaccharose were 1.34% and 3.04% for orchardgrass, 3.25% and 2.83% for perenninal ryegrass, respectively. Meadow fescue had a concentration of 3.93% fructosan and 4.08% mono-and disaccharose. 3. Synthesis and accumulation of nonstructural carbohydrates in temperate grasses were negative associated with increasing of air temperature (P$\leq$ 0.1%). Under hot stress during summer season in Korea, the contents of fructosan, mono-and disaccharose were decreased to about 0.34% nd 1.28% from a value of 1.34% and 2.69% in spring season. In Freising and Braunschweig, the concentration of reserved carbohydrates was less influenced by growing season. 4. Synthesis and accumulation pattern of nonstructural carbohydrates were shown a great respons to cutting frequency of the plants. Frequent cutting system under high temperature lowered the accumulation of reserved carbohydrates, especially fructosan and also caused to decrease the plant regrowth. However, under cool temperature, it shows a less differences of tructosan, mono-and disaccharose in the plants at all cutting systems.

  • PDF

Full mouth Rehabilitation with Orthognathic Surgery in Facial Asymmetry Patient : Case Report (안면 비대칭환자의 악교정 수술을 동반한 완전구강회복)

  • Im, So-Min;Shin, Hyoung-Joo;Kim, Dae-Gon;Park, Chan-Jin;Cho, Lee-Ra
    • Journal of Dental Rehabilitation and Applied Science
    • /
    • v.26 no.3
    • /
    • pp.359-371
    • /
    • 2010
  • Facial asymmetry has been found with a higher frequency (70~84%) in skeletal class III malocclusion patients. Anticipating the poor prognosis of prosthesis due to malocclusion, occlusal stability must be obtained by orthodontic treatment. Moreover, orthodontic surgery would be needed in some severe cases for better functional and esthetic results. The orthognathic surgery is performed on one jaw or two jaw depending on the results of facial diagnosis. Genioplasty may change the vertical, horizontal, sagittal position of chin by osteotomy or augmentation using implants, also. This case is about a 24 year-old male patient who visited our clinic to solve the facial asymmetry and mandibular prognathism. Skeletal class III malocclusion, maxillary canting and menton deviation to left by 13 mm were detected. Multiple ill-fitting prostheses, unesthetic maxillary anterior prostheses, and several dental caries were found. After pre-operative orthodontic treatment, Le-Fort I osteotomy, sagittal split ramus osteotomy, genioplasty, right mandibular angle augmentation were done for the correction of jaw relation and asymmetry. By diagnostic wax-up after post-operative orthodontic treatment, maxillary full mouth rehabilitation and mandibular posterior restorations were planned out. For better result, clinical crown lengthening procedure was done on #11, 12 and implant was placed on left mandibular first molar area. The patient was satisfied with the final prostheses. Because of his high caries risk, long-term prognosis will depend on the consistent maintenance of oral hygiene and periodic follow-up.

Study on the Technological System of the Cooperative Cultivation of Paddy Rice in Korea (수도집단재배의 기술체계에 관한 연구)

  • Min-Shin Cho
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.8 no.1
    • /
    • pp.129-177
    • /
    • 1970
  • For the purpose of establishing the systematized technical scheme of the cooperative rice cultivation which has most significant impact to improve rice productivity and the farm management, the author have studied the cultivation practices, and the variation of rice growth and yield between the cooperative rice cultivation and the individual rice cultivation at random selected 18 paddy fields. The author also have investigated through comparative method on the cultivation practices, management, organization and operation scheme of the two different rice cultivation methods at 460 paddy fields. The economic feasibility has been ana lysed and added in this report. The results obtained from this study are summarized as follows; 1. In the nursery, the average amount of fertilizer application, especially, phosphate and potassium, and the frequency of chemicals spray for the disease, insect and pest control at the cooperative rice cultivation are significantly higher than those of the individual rice cultivation. 2. The cultivation techniques of the cooperative rice farming after the transplanting can be characterized by a) the earlier transplanting of rice, b) the denser hills per unit area and the lesser number of seedlings per hill, c) the application of larger quantities of fertilizer including nitrogen, phosphate and potassium, d) more divided application of fertilizers, split doses of the nitrogen and potassium, e) the increased frequencies of the chemicals spray for the prevention of disease, insect and pest damages. 3. The rate of lodging in the cooperative rice cultivation was slightly higher than that of the individual rice cultivation, however, the losses of rice yield owing to the occurrence of rice stem borer and grass leaf roller in the cooperative rice cultivation were lower than that of the individual rice cultivation. 4. The culm length, panicle length, straw weight and grain-straw ratio are respectively higher at the cooperative rice cultivation, moreover, the higher variation of the above factors due to different localities of the paddy fields found at the individual rice cultivation. 5. The number of panicles, number of flowers per panicle and the weight of 1, 000 grains, those contributing components to the rice yield were significantly greater in the cooperative rice cultivation, however, not clear difference in the maturing rate was observed. The variation coefficient of the yield component in the cooperative cultivation showed lower than that or the individual rice cultivation. 6. The average yield of brown rice per 10 are in the cooperative rice cultivation obtained 459.0 kilograms while that of the individual rice cultivation brought 374.8 kilograms. The yield of brown rice in the cooperative rice cultivation increased 84.2 kilogram per 10 are over the individual rice cultivation. With lower variation coefficient of the brown rice yield in the cooperative rice cultivation, it can be said that uniformed higher yield could be obtained through the cooperative rice cultivation. 7. Highly significant positive correlations shown between the seeding date and the number of flowers per panicle, the chemical spray and the number of flowers per panicle, the transplanting date and the number of flowers per panicle, phosphate application and yield, potassium application and maturing rate, the split application of fertilizers and yield. Whilst the significant negative correlation was shown between the transplanting date and the maturing rate 8. The results of investigation from 480 paddy fields obtained through comparative method on the following items are identical in general with those obtained at 18 paddy fields: Application of fertilizers, chemical spray for the control of disease, insects and pests both in the nursery and the paddy field, transplanting date, transplanting density, split application of fertilizers and yield n the paddy fields. a) The number of rice varieties used in the cooperative rice cultivation were 13 varieties while the individual rice cultivation used 47 varieties. b) The cooperative rice cultivation has more successfully adopted improved cultivation techniques such as the practice of seed disinfection, adoption of recommended seeding amount, fall ploughing, application of red soil, introduction of power tillers, the rectangular-type transplanting, midsummer drainage and the periodical irrigation. 9. The following results were also obtained from the same investigation and they are: a) In the cooperative rice cultivation, the greater part of the important practices have been carried out through cooperative operation including seed disinfection, ploughing, application of red soil and compost, the control of disease, insects and pests, harvest, threshing and transportation of the products. b) The labor input to the nursery bed and water control in the cooperative rice cultivation was less than that of the individual rice cultivation while the higher rate of labor input was resulted in the red soil and compost application. 10. From the investigation on the organization and operation scheme of the cooperative rice cultivation, the following results were obtained: a) The size of cooperative rice cultivation farm was varied from. 3 ha to 7 ha and 5 ha farm. occupied 55.9 percent of the total farms. And a single cooperative farm was consisted of 10 to 20 plots of paddies. b) The educational back ground of the staff members involved in the cooperative rice cultivation was superior than that of the individual rice cultivation. c) All of the farmers who participated to the questionaires have responded that the cooperative rice cultivation could promise the increased rice yield mainly through the introduction of the improved method of fertilizer application and the effective control of diseases, insects and pests damages. And the majority of farmers were also in the opinion that preparation of the materials and labor input can be timely carried out and the labor requirement for the rice cultivation possibly be saved through the cooperative rice cultivation. d) The farmers who have expressed their wishes to continue and to make further development of the cooperative rice cultivation was 74.5 percent of total farmers participated to the questionaires. 11. From the analysis of economical feasibility on the two different methods of cultivation, the following results were obtained: a) The value of operation cost for the compost, chemical fertilizers, agricultural chemicals and labor input in the cooperative rice cultivation was respectively higher by 335 won, 199 won, 288 won and 303 won over the individual rice cultivation. However, the other production costs showed no distinct differences between the two cultivation methods. b) Although the total value of expenses for the fertilizers, agricultural chemicals, labor input and etc. in the cooperative rice cultivation were approximately doubled to the amount of the individual rice cultivation, the net income, substracted operation costs from the gross income, was obtained 24, 302 won in the cooperative rice cultivation and 20, 168 won was obtained from the individual rice cultivation. Thereby, it can be said that net income from the cooperative rice cultivation increased 4, 134 won over the individual rice cultivation. It was revealed in this study that the cooperative rice cultivation has not only contributed to increment of the farm income through higher yield but also showed as an effective means to introduce highly improved cultivation techniques to the farmers. It may also be concluded, therefore, the cooperative rice cultivation shall continuously renovate the rice production process of the farmers.

  • PDF

Performance Analysis of Frequent Pattern Mining with Multiple Minimum Supports (다중 최소 임계치 기반 빈발 패턴 마이닝의 성능분석)

  • Ryang, Heungmo;Yun, Unil
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.1-8
    • /
    • 2013
  • Data mining techniques are used to find important and meaningful information from huge databases, and pattern mining is one of the significant data mining techniques. Pattern mining is a method of discovering useful patterns from the huge databases. Frequent pattern mining which is one of the pattern mining extracts patterns having higher frequencies than a minimum support threshold from databases, and the patterns are called frequent patterns. Traditional frequent pattern mining is based on a single minimum support threshold for the whole database to perform mining frequent patterns. This single support model implicitly supposes that all of the items in the database have the same nature. In real world applications, however, each item in databases can have relative characteristics, and thus an appropriate pattern mining technique which reflects the characteristics is required. In the framework of frequent pattern mining, where the natures of items are not considered, it needs to set the single minimum support threshold to a too low value for mining patterns containing rare items. It leads to too many patterns including meaningless items though. In contrast, we cannot mine any pattern if a too high threshold is used. This dilemma is called the rare item problem. To solve this problem, the initial researches proposed approximate approaches which split data into several groups according to item frequencies or group related rare items. However, these methods cannot find all of the frequent patterns including rare frequent patterns due to being based on approximate techniques. Hence, pattern mining model with multiple minimum supports is proposed in order to solve the rare item problem. In the model, each item has a corresponding minimum support threshold, called MIS (Minimum Item Support), and it is calculated based on item frequencies in databases. The multiple minimum supports model finds all of the rare frequent patterns without generating meaningless patterns and losing significant patterns by applying the MIS. Meanwhile, candidate patterns are extracted during a process of mining frequent patterns, and the only single minimum support is compared with frequencies of the candidate patterns in the single minimum support model. Therefore, the characteristics of items consist of the candidate patterns are not reflected. In addition, the rare item problem occurs in the model. In order to address this issue in the multiple minimum supports model, the minimum MIS value among all of the values of items in a candidate pattern is used as a minimum support threshold with respect to the candidate pattern for considering its characteristics. For efficiently mining frequent patterns including rare frequent patterns by adopting the above concept, tree based algorithms of the multiple minimum supports model sort items in a tree according to MIS descending order in contrast to those of the single minimum support model, where the items are ordered in frequency descending order. In this paper, we study the characteristics of the frequent pattern mining based on multiple minimum supports and conduct performance evaluation with a general frequent pattern mining algorithm in terms of runtime, memory usage, and scalability. Experimental results show that the multiple minimum supports based algorithm outperforms the single minimum support based one and demands more memory usage for MIS information. Moreover, the compared algorithms have a good scalability in the results.

Edge to Edge Model and Delay Performance Evaluation for Autonomous Driving (자율 주행을 위한 Edge to Edge 모델 및 지연 성능 평가)

  • Cho, Moon Ki;Bae, Kyoung Yul
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.191-207
    • /
    • 2021
  • Up to this day, mobile communications have evolved rapidly over the decades, mainly focusing on speed-up to meet the growing data demands of 2G to 5G. And with the start of the 5G era, efforts are being made to provide such various services to customers, as IoT, V2X, robots, artificial intelligence, augmented virtual reality, and smart cities, which are expected to change the environment of our lives and industries as a whole. In a bid to provide those services, on top of high speed data, reduced latency and reliability are critical for real-time services. Thus, 5G has paved the way for service delivery through maximum speed of 20Gbps, a delay of 1ms, and a connecting device of 106/㎢ In particular, in intelligent traffic control systems and services using various vehicle-based Vehicle to X (V2X), such as traffic control, in addition to high-speed data speed, reduction of delay and reliability for real-time services are very important. 5G communication uses high frequencies of 3.5Ghz and 28Ghz. These high-frequency waves can go with high-speed thanks to their straightness while their short wavelength and small diffraction angle limit their reach to distance and prevent them from penetrating walls, causing restrictions on their use indoors. Therefore, under existing networks it's difficult to overcome these constraints. The underlying centralized SDN also has a limited capability in offering delay-sensitive services because communication with many nodes creates overload in its processing. Basically, SDN, which means a structure that separates signals from the control plane from packets in the data plane, requires control of the delay-related tree structure available in the event of an emergency during autonomous driving. In these scenarios, the network architecture that handles in-vehicle information is a major variable of delay. Since SDNs in general centralized structures are difficult to meet the desired delay level, studies on the optimal size of SDNs for information processing should be conducted. Thus, SDNs need to be separated on a certain scale and construct a new type of network, which can efficiently respond to dynamically changing traffic and provide high-quality, flexible services. Moreover, the structure of these networks is closely related to ultra-low latency, high confidence, and hyper-connectivity and should be based on a new form of split SDN rather than an existing centralized SDN structure, even in the case of the worst condition. And in these SDN structural networks, where automobiles pass through small 5G cells very quickly, the information change cycle, round trip delay (RTD), and the data processing time of SDN are highly correlated with the delay. Of these, RDT is not a significant factor because it has sufficient speed and less than 1 ms of delay, but the information change cycle and data processing time of SDN are factors that greatly affect the delay. Especially, in an emergency of self-driving environment linked to an ITS(Intelligent Traffic System) that requires low latency and high reliability, information should be transmitted and processed very quickly. That is a case in point where delay plays a very sensitive role. In this paper, we study the SDN architecture in emergencies during autonomous driving and conduct analysis through simulation of the correlation with the cell layer in which the vehicle should request relevant information according to the information flow. For simulation: As the Data Rate of 5G is high enough, we can assume the information for neighbor vehicle support to the car without errors. Furthermore, we assumed 5G small cells within 50 ~ 250 m in cell radius, and the maximum speed of the vehicle was considered as a 30km ~ 200 km/hour in order to examine the network architecture to minimize the delay.