Search | Korea Science

A novel classification approach based on Naïve Bayes for Twitter sentiment analysis

Song, Junseok;Kim, Kyung Tae;Lee, Byungjun;Kim, Sangyoung;Youn, Hee Yong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.11 no.6
- /
- pp.2996-3011
- /
- 2017
With rapid growth of web technology and dissemination of smart devices, social networking service(SNS) is widely used. As a result, huge amount of data are generated from SNS such as Twitter, and sentiment analysis of SNS data is very important for various applications and services. In the existing sentiment analysis based on the $Na{\ddot{i}}ve$ Bayes algorithm, a same number of attributes is usually employed to estimate the weight of each class. Moreover, uncountable and meaningless attributes are included. This results in decreased accuracy of sentiment analysis. In this paper two methods are proposed to resolve these issues, which reflect the difference of the number of positive words and negative words in calculating the weights, and eliminate insignificant words in the feature selection step using Multinomial $Na{\ddot{i}}ve$ Bayes(MNB) algorithm. Performance comparison demonstrates that the proposed scheme significantly increases the accuracy compared to the existing Multivariate Bernoulli $Na{\ddot{i}}ve$ Bayes(BNB) algorithm and MNB scheme.
https://doi.org/10.3837/tiis.2017.06.011 인용 PDF KSCI

P2P Traffic Classification using Advanced Heuristic Rules and Analysis of Decision Tree Algorithms (개선된 휴리스틱 규칙 및 의사 결정 트리 분석을 이용한 P2P 트래픽 분류 기법)

Ye, Wujian;Cho, Kyungsan
- Journal of the Korea Society of Computer and Information
- /
- v.19 no.3
- /
- pp.45-54
- /
- 2014
In this paper, an improved two-step P2P traffic classification scheme is proposed to overcome the limitations of the existing methods. The first step is a signature-based classifier at the packet-level. The second step consists of pattern heuristic rules and a statistics-based classifier at the flow-level. With pattern heuristic rules, the accuracy can be improved and the amount of traffic to be classified by statistics-based classifier can be reduced. Based on the analysis of different decision tree algorithms, the statistics-based classifier is implemented with REPTree. In addition, the ensemble algorithm is used to improve the performance of statistics-based classifier Through the verification with the real datasets, it is shown that our hybrid scheme provides higher accuracy and lower overhead compared to other existing schemes.
https://doi.org/10.9708/jksci.2014.19.3.045 인용 PDF KSCI

Analysis and Implementation of Speech/Music Classification for 3GPP2 SMV Codec Based on Support Vector Machine (SMV코덱의 음성/음악 분류 성능 향상을 위한 Support Vector Machine의 적용)

Kim, Sang-Kyun;Chang, Joon-Hyuk
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.45 no.6
- /
- pp.142-147
- /
- 2008
In this paper, we propose a novel a roach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the support vector machine (SVM). The SVM makes it possible to build on an optimal hyperplane that is separated without the error where the distance between the closest vectors and the hyperplane is maximal. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then feature vectors which are a lied to the SVM are selected from relevant parameters of the SMV for the efficient speech/music classification. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.
PDF KSCI

Efficient Channel Management to Maximize Spectrum Holes in Cognitive Radio Networks (CR 네트워크에서의 유휴자원 증대를 위한 효율적인 채널 관리 방법)

Jeong, Pil-Jung;Shin, Yo-An;Lee, Won-Cheol;Yoo, Myung-Sik
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.32 no.10B
- /
- pp.621-629
- /
- 2007
In cognitive radio (CR) network, the channels are generally classified into either the unavailable channels that are occupied by incumbent users or the available channels that are not occupied. The conventional channel classification scheme may result in poor utilization of spectrum holes since it does not take the spatial relationship between CR node and incumbent users into consideration. In this paper, we propose an efficient channel management scheme for the centralized CR network to maximize the spectrum holes by overcoming the shortcomings of conventional scheme. In addition, we mathematically analyze the effectiveness of proposed scheme. Based on the proposed channel management scheme, we also propose the rendezvous algorithm, which can establish the control channels between base station and CR node under the dynamically changing spectrum environment.
PDF KSCI

Region adaptive demosaicing algorithm (영역 적응적 컬러보간 알고리듬)

Kim, Chang-Won;Oh, Hyun-Mook;Kang, Moon-Gi
- Proceedings of the IEEK Conference
- /
- 2008.06a
- /
- pp.849-850
- /
- 2008
Region adaptive demosaicing scheme based on the spectral spatial correlation is prosed in this paper. Region classification and adaptive demosaicing makes the decision of the edge direction more accurately.
PDF

Adaptive Object Classification using DWT and FI (이산웨이블릿 변환과 퍼지추론을 이용한 적응적 물체 분류)

Kim, Yoon-Ho
- Journal of Advanced Navigation Technology
- /
- v.10 no.3
- /
- pp.219-225
- /
- 2006
This paper presents a method of object classification based on discrete wavelet transform (DWT) and fuzzy inference(FI). It concentrated not only on the design of fuzzy inference algorithm which is suitable for low speed uninhabited transportation such as, conveyor but also on the minimize the number of fuzzy rule. In the preprocess of feature extracting, feature parameters are extracted by using characteristics of the coefficients matrix of DWT. Such feature parameters as area, perimeter and a/p ratio are used obtained from DWT coefficients blocks. Secondly, fuzzy if - then rules that can be able to adapt the variety of surroundings are developed. In order to verify the performance of proposed scheme, In the middle of fuzzy inference, the Mamdani's and the Larsen 's implication operators are utilized. Experimental results showed that proposed scheme can be applied to the variety of surroundings.
PDF

A Deep Learning Based Over-Sampling Scheme for Imbalanced Data Classification (불균형 데이터 분류를 위한 딥러닝 기반 오버샘플링 기법)

Son, Min Jae;Jung, Seung Won;Hwang, Een Jun
- KIPS Transactions on Software and Data Engineering
- /
- v.8 no.7
- /
- pp.311-316
- /
- 2019
Classification problem is to predict the class to which an input data belongs. One of the most popular methods to do this is training a machine learning algorithm using the given dataset. In this case, the dataset should have a well-balanced class distribution for the best performance. However, when the dataset has an imbalanced class distribution, its classification performance could be very poor. To overcome this problem, we propose an over-sampling scheme that balances the number of data by using Conditional Generative Adversarial Networks (CGAN). CGAN is a generative model developed from Generative Adversarial Networks (GAN), which can learn data characteristics and generate data that is similar to real data. Therefore, CGAN can generate data of a class which has a small number of data so that the problem induced by imbalanced class distribution can be mitigated, and classification performance can be improved. Experiments using actual collected data show that the over-sampling technique using CGAN is effective and that it is superior to existing over-sampling techniques.
https://doi.org/10.3745/KTSDE.2019.8.7.311 인용 PDF KSCI HTML

A FASTER LU DECOMPOSITION FOR PARALLEL C PROGRAMS

Lee, Sang-Moon;Lee, Chin-Young
- Journal of applied mathematics & informatics
- /
- v.3 no.2
- /
- pp.217-234
- /
- 1996
This report introduces a faster parallel LU decomposi-tion algorithm that gives a speedup almost equal to the number of nodes used. The new algorithm takes an advantage of an important C feature that lays out a matrix using a row major scheme and is based on the currently widely used LU decomposition algorithm with one major modification to eliminate most of the communication overhead. Empirical results are included in this report. For example solving a dense matrix that contains 100,000,000 elements gives a speedup of 50 when executed on 50 nodes of an intel Paragon in parallel.

Neuro-Fuzzy System and Its Application Using CART Algorithm and Hybrid Parameter Learning (CART 알고리즘과 하이브리드 학습을 통한 뉴로-퍼지 시스템과 응용)

Oh, B.K.;Kwak, K.C.;Ryu, J.W.
- Proceedings of the KIEE Conference
- /
- 1998.07b
- /
- pp.578-580
- /
- 1998
The paper presents an approach to the structure identification based on the CART (Classification And Regression Tree) algorithm and to the parameter identification by hybrid learning method in neuro-fuzzy system. By using the CART algorithm, the proposed method can roughly estimate the numbers of membership function and fuzzy rule using the centers of decision regions. Then the parameter identification is carried out by the hybrid learning scheme using BP (Back-propagation) and RLSE (Recursive Least Square Estimation) from the numerical data. Finally, we will show it's usefulness for fuzzy modeling to truck backer upper control.
PDF

A Corner Matching Algorithm with Uncertainty Handling Capability

Lee, Kil-jae;Zeungnam Bien
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1997.11a
- /
- pp.228-233
- /
- 1997
An efficient corner matching algorithm is developed to minimize the amount of calculation. To reduce the amount of calculation, all available information from a corner detector is used to make model. This information has uncertainties due to discretization noise and geometric distortion, and this is represented by fuzzy rule base which can represent and handle the uncertainties. Form fuzzy inference procedure, a matched segment list is extracted, and resulted segment list is used to calculate the transformation between object of model and scene. To reduce the false hypotheses, a vote and re-vote method is developed. Also an auto tuning scheme of the fuzzy rule base is developed to find out the uncertainties of features from recognized results automatically. To show the effectiveness of the developed algorithm, experiments are conducted for images of real electronic components.
PDF

Search Result 144, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)