• Title/Summary/Keyword: Matrix vector

Search Result 759, Processing Time 0.029 seconds

Performance Analysis of the Array Shape Estimation Methods Based on the Nearfield Signal Modeling (근거리 신호 모델링을 기반으로 한 어레이 형상 추정 기법들의 성능 분석)

  • Park, Hee-Young;Lee, Chung-Yong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.5
    • /
    • pp.221-228
    • /
    • 2008
  • To estimate array shape with reference sources in SONAR systems, nearfield signal modeling is required for the reference sources near a towed array. Array shape estimation method based on the nearfield signal modeling generally exploits the spatial covariance matrix of the received reference sources. Among those method, nearfield eigenvector method uses the eigenvector corresponding to the maximum eigenvalue as a steering vector of the reference source. In this paper, we propose a simplified subspace fitting method based on the nearfield signal modeling with spherical wave modeling. Furthermore, we analyze performance of the array shape estimation methods based on the nearfield signal modeling for various environments. The results of the numerical experiments indicate that the simplified subspace fitting method and the nearfield eigenvector method with single reference source shows almost similar performance. Furthermore, the simplified subspace fitting method with 2 reference sources consistently estimates the shape of the array regardless of the incident angle of the reference sources, whereas the nearfield eigenvector method cannot apply for the case of 2 reference sources.

Label Embedding for Improving Classification Accuracy UsingAutoEncoderwithSkip-Connections (다중 레이블 분류의 정확도 향상을 위한 스킵 연결 오토인코더 기반 레이블 임베딩 방법론)

  • Kim, Museong;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.175-197
    • /
    • 2021
  • Recently, with the development of deep learning technology, research on unstructured data analysis is being actively conducted, and it is showing remarkable results in various fields such as classification, summary, and generation. Among various text analysis fields, text classification is the most widely used technology in academia and industry. Text classification includes binary class classification with one label among two classes, multi-class classification with one label among several classes, and multi-label classification with multiple labels among several classes. In particular, multi-label classification requires a different training method from binary class classification and multi-class classification because of the characteristic of having multiple labels. In addition, since the number of labels to be predicted increases as the number of labels and classes increases, there is a limitation in that performance improvement is difficult due to an increase in prediction difficulty. To overcome these limitations, (i) compressing the initially given high-dimensional label space into a low-dimensional latent label space, (ii) after performing training to predict the compressed label, (iii) restoring the predicted label to the high-dimensional original label space, research on label embedding is being actively conducted. Typical label embedding techniques include Principal Label Space Transformation (PLST), Multi-Label Classification via Boolean Matrix Decomposition (MLC-BMaD), and Bayesian Multi-Label Compressed Sensing (BML-CS). However, since these techniques consider only the linear relationship between labels or compress the labels by random transformation, it is difficult to understand the non-linear relationship between labels, so there is a limitation in that it is not possible to create a latent label space sufficiently containing the information of the original label. Recently, there have been increasing attempts to improve performance by applying deep learning technology to label embedding. Label embedding using an autoencoder, a deep learning model that is effective for data compression and restoration, is representative. However, the traditional autoencoder-based label embedding has a limitation in that a large amount of information loss occurs when compressing a high-dimensional label space having a myriad of classes into a low-dimensional latent label space. This can be found in the gradient loss problem that occurs in the backpropagation process of learning. To solve this problem, skip connection was devised, and by adding the input of the layer to the output to prevent gradient loss during backpropagation, efficient learning is possible even when the layer is deep. Skip connection is mainly used for image feature extraction in convolutional neural networks, but studies using skip connection in autoencoder or label embedding process are still lacking. Therefore, in this study, we propose an autoencoder-based label embedding methodology in which skip connections are added to each of the encoder and decoder to form a low-dimensional latent label space that reflects the information of the high-dimensional label space well. In addition, the proposed methodology was applied to actual paper keywords to derive the high-dimensional keyword label space and the low-dimensional latent label space. Using this, we conducted an experiment to predict the compressed keyword vector existing in the latent label space from the paper abstract and to evaluate the multi-label classification by restoring the predicted keyword vector back to the original label space. As a result, the accuracy, precision, recall, and F1 score used as performance indicators showed far superior performance in multi-label classification based on the proposed methodology compared to traditional multi-label classification methods. This can be seen that the low-dimensional latent label space derived through the proposed methodology well reflected the information of the high-dimensional label space, which ultimately led to the improvement of the performance of the multi-label classification itself. In addition, the utility of the proposed methodology was identified by comparing the performance of the proposed methodology according to the domain characteristics and the number of dimensions of the latent label space.

Comparative study of flood detection methodologies using Sentinel-1 satellite imagery (Sentinel-1 위성 영상을 활용한 침수 탐지 기법 방법론 비교 연구)

  • Lee, Sungwoo;Kim, Wanyub;Lee, Seulchan;Jeong, Hagyu;Park, Jongsoo;Choi, Minha
    • Journal of Korea Water Resources Association
    • /
    • v.57 no.3
    • /
    • pp.181-193
    • /
    • 2024
  • The increasing atmospheric imbalance caused by climate change leads to an elevation in precipitation, resulting in a heightened frequency of flooding. Consequently, there is a growing need for technology to detect and monitor these occurrences, especially as the frequency of flooding events rises. To minimize flood damage, continuous monitoring is essential, and flood areas can be detected by the Synthetic Aperture Radar (SAR) imagery, which is not affected by climate conditions. The observed data undergoes a preprocessing step, utilizing a median filter to reduce noise. Classification techniques were employed to classify water bodies and non-water bodies, with the aim of evaluating the effectiveness of each method in flood detection. In this study, the Otsu method and Support Vector Machine (SVM) technique were utilized for the classification of water bodies and non-water bodies. The overall performance of the models was assessed using a Confusion Matrix. The suitability of flood detection was evaluated by comparing the Otsu method, an optimal threshold-based classifier, with SVM, a machine learning technique that minimizes misclassifications through training. The Otsu method demonstrated suitability in delineating boundaries between water and non-water bodies but exhibited a higher rate of misclassifications due to the influence of mixed substances. Conversely, the use of SVM resulted in a lower false positive rate and proved less sensitive to mixed substances. Consequently, SVM exhibited higher accuracy under conditions excluding flooding. While the Otsu method showed slightly higher accuracy in flood conditions compared to SVM, the difference in accuracy was less than 5% (Otsu: 0.93, SVM: 0.90). However, in pre-flooding and post-flooding conditions, the accuracy difference was more than 15%, indicating that SVM is more suitable for water body and flood detection (Otsu: 0.77, SVM: 0.92). Based on the findings of this study, it is anticipated that more accurate detection of water bodies and floods could contribute to minimizing flood-related damages and losses.

A Study on Music Summarization (음악요약 생성에 관한 연구)

  • Kim Sung-Tak;Kim Sang-Ho;Kim Hoi-Rin;Choi Ji-Hoon;Lee Han-Kyu;Hong Jin-Woo
    • Journal of Broadcast Engineering
    • /
    • v.11 no.1 s.30
    • /
    • pp.3-14
    • /
    • 2006
  • Music summarization means a technique which automatically generates the most importantand representative a part or parts ill music content. The techniques of music summarization have been studied with two categories according to summary characteristics. The first one is that the repeated part is provided as music summary and the second provides the combined segments which consist of segments with different characteristics as music summary in music content In this paper, we propose and evaluate two kinds of music summarization techniques. The algorithm using multi-level vector quantization which provides a repeated part as music summary gives fixed-length music summary is evaluated by overlapping ration between hand-made repeated parts and automatically generated summary. As results, the overlapping ratios of conventional methods are 42.2% and 47.4%, but that of proposed method with fixed-length summary is 67.1%. Optimal length music summary is evaluated by the portion of overlapping between summary and repeated part which is different length according to music content and the result shows that automatically-generated summary expresses more effective part than fixed-length summary with optimal length. The cluster-based algorithm using 2-D similarity matrix and k-means algorithm provides the combined segments as music summary. In order to evaluate this algorithm, we use MOS test consisting of two questions(How many similar segments are in summarized music? How many segments are included in same structure?) and the results show good performance.

Phase Transformation of 2 Components(CaO-, $Y_2O_3$-, MgO-$ZrO_2$) and 3 Components(MgO-$ZrO_2-Al_2O_3)$ Zirconia by X-ray Diffraction and Raman Spectroscopy (X-선회절과 Raman 분광분석을 이용한 2성분계(CaO-, $Y_2O_3$-, MgO-$ZrO_2$) 및 3성분계(MgO-$ZrO_2-Al_2O_3)$ Zirconia의 상전이연구)

  • 은희태;황진명
    • Journal of the Korean Ceramic Society
    • /
    • v.34 no.2
    • /
    • pp.145-156
    • /
    • 1997
  • ZrO2 phase transformations depending on the type and amount of dopants and the sintering temperatures were studied for the 2 components (CaO-, Y2O3-, MgO-ZrO2) and the 3 components(MgO-ZrO2-Al2O3)ZrO2 powder by X-ray diffraction and Raman spectroscopy. In the CaO- and Y2O3-ZrO2 systems, as the CaO and Y2O3 contents increased to 6~15mol% and 3~15mol% respectively, we were not able to identify between tetragonal and cubic in the X-ray diffraction patterns. On the other hand, all Raman modes shifted to lower wavenumbers, decreasing in intensity and the number of bands, markedly. These phenomena were caused by tetragonallongrightarrowcubic phase transformation and interpreted by the breakdown of the wave vector selection rule(k=0) and the structural disorder associated with the formation of oxygen sublattice which was caused by the substitution between Zr4+ ion and Ca2+ or Y3+ ion in ZrO2 matrix. The monoclinic to cubic phase transformation occurred in 10mol% MgO-ZrO2 system. As the Al2O3 content increased from 0 to 20mol% in the MgO-ZrO2-Al2O3 systems, cubic phase transformed to monoclinic phase, this is because the MgO didn't play a role in a stabilizer because of the formation of the spinel(MgAl2O4) by the reaction between MgO and Al2O3, Also, the ZrO2 phase transformation was explained by the change of it's lattice parameters depending on the type and amount of dopants. Namely, as the amount of dopant increased to 10~13mol%, the axial ra-tio c/a came close to unity with increasing the lattice parameter a and decreasing the lattice parameter c. At that time, the tetragonallongrightarrowcubic phase transformation occurred.

  • PDF

Amino Acid Biosynthesis and Gene Regulation in Seed (종자내 아미노산 합성 조절 유전자에 관한 연구)

  • ;;;;;Fumio Takaiwa
    • Proceedings of the Botanical Society of Korea Conference
    • /
    • 1996.07a
    • /
    • pp.61-74
    • /
    • 1996
  • Human and monogastric animals can not synthesize 10 out of the 20 amino asids and therefor need to obtain these from their diet. The plant seed is a major source of dietary protein. It is particular important in their study to increase nutritional quality of the seed storage proteins. The low contents of lysine, asparagine and threonenein various cereal seeds and of cystein and methionine. In legume seeds is due to the low proportions of these amino acids in the major storage proteins, we have tried to apply the three strategies; (1) mutagenesis and selection of specific amino acid analogue resistance, (2) cloning and expression study of lysine biosynthesis related gene, (3) transfomation of lysine rich soybean glycinin gene. The 5-methyltryptophan (5MT) resistant cell lines, SAR1, SAR2 and SAR3 were selected from anther derived callus of rice (Oryza sativa L. "Sasanishiki"). Among these selected cell lines, two (SAR1 and SAR3) were able to grow stably at 200 mg/L of 5MT. Analysis of the freed amino acids in callus shows that 5MT resistant cells (SAR3) accumulated free tryptophan at least up to 50 times higher than those that of the higher than of SAS. These results indicated that the 5MT resistant cell lines are useful in studies of amino acid biosynthesis. Tr75, a rice (Oryza sativa L., var. Sasanishiki) mutant resistant to 5MT was segregated from the progenies of its initial mutant line, TR1. The 5MT resistant of TR75 was inherited in the M8 generations as a single dominant nuclear gene. The content of free amino acids in the TR75 homozygous seeds increased approximately 1.5 to 2.0 fold compared to wild-type seeds. Especially, the contents of tryptophan, phenylalanine and aspartic acid were 5.0, 5.3 and 2.7 times higher than those of wild-type seeds, respectively. The content of lysine is significantly low in rice. The lysine is synthesized by a complex pathway that is predominantly regulated by feedback inhibition of several enzymes including asparginase, aspatate kinase, dihydrodipicolinat synthase, etc. For understanding the regulation mechanism of lysine synthesis in rice, we try to clone the lysine biosynthetic metabolism related gene, DHPS and asparaginase, from rice. We have isolated a rice DHPS genomic clone which contains an ORF of 1044 nucleotides (347 amino acids, Mr. 38, 381 daltons), an intron of 587 nucleotides and 5'and 3'-flanking regions by screening of rice genomic DNA library. Deduced amino acid sequence of mature peptide domain of GDHPS clone is highly conserved in monocot and dicot plants whereas that of transit peptide domain is extremely different depending on plant specie. Southern blot analysis indicated that GDHPS is located two copy gene in rice genome. The transcripts of a rice GDHPS were expressed in leaves and roots but not detected in callus tissues. The transcription level of GDHPS is much higher in leaves indicating enormous chloroplast development than roots. Genomic DNA clones for asparaginase genes were screened from the rice genomic library by using plaque hybridization technique. Twelve different genomic clones were isolated from first and second screening, and 8 of 12 clones were analyzed by restriction patterns and identified by Southern Blotting, Restriction enzyme digestion patterns and Southern blot analysis of 8 clones show the different pattern for asparaginase gene. Genomic Southern blot analysis from rice were done. It is estimated that rice has at least 2-3 copy of asparaginase gene. One of 8 positive clones was subcloned into the pBluescript SK(+) vector, and was constructed the physical map. For transformation of lysine rich storage protein into tobacco, soybean glycinin genes are transformed into tobacco. To examine whether glycinin could be stably accumulated in endosperm tissue, the glycinin cDNA was transcriptionally fused to an endosperm-specific promotor of the rice storage protein glutelin gene and then introduced into tobacco genomic via Agrobacterium-mediated transformation. Consequently the glycinin gene was expressed in a seed-and developmentally-specific manner in transgenic tobacco seeds. Glycinin were targeted to vacuole-derived protein bodies in the endosperm tissue and highly accumulated in the matrix region of many transgenic plant (1-4% of total seed proteins). Synthesized glycinin was processed into mature form, and assembled into a hexamer in a similar manner as the glycinin in soybean seed. Modified glycinin, in which 4 contiguous methionine residues were inserted at the variable regions corresponding to the C - teminal regions of the acidic and basic polypeptides, were also found to be accumulated similarly as in the normal glycinin. There was no apparent difference in the expression level, processing and targeting to protein bodies, or accumulation level between normal and modified glycinin. glycinin.

  • PDF

Research on hybrid music recommendation system using metadata of music tracks and playlists (음악과 플레이리스트의 메타데이터를 활용한 하이브리드 음악 추천 시스템에 관한 연구)

  • Hyun Tae Lee;Gyoo Gun Lim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.145-165
    • /
    • 2023
  • Recommendation system plays a significant role on relieving difficulties of selecting information among rapidly increasing amount of information caused by the development of the Internet and on efficiently displaying information that fits individual personal interest. In particular, without the help of recommendation system, E-commerce and OTT companies cannot overcome the long-tail phenomenon, a phenomenon in which only popular products are consumed, as the number of products and contents are rapidly increasing. Therefore, the research on recommendation systems is being actively conducted to overcome the phenomenon and to provide information or contents that are aligned with users' individual interests, in order to induce customers to consume various products or contents. Usually, collaborative filtering which utilizes users' historical behavioral data shows better performance than contents-based filtering which utilizes users' preferred contents. However, collaborative filtering can suffer from cold-start problem which occurs when there is lack of users' historical behavioral data. In this paper, hybrid music recommendation system, which can solve cold-start problem, is proposed based on the playlist data of Melon music streaming service that is given by Kakao Arena for music playlist continuation competition. The goal of this research is to use music tracks, that are included in the playlists, and metadata of music tracks and playlists in order to predict other music tracks when the half or whole of the tracks are masked. Therefore, two different recommendation procedures were conducted depending on the two different situations. When music tracks are included in the playlist, LightFM is used in order to utilize the music track list of the playlists and metadata of each music tracks. Then, the result of Item2Vec model, which uses vector embeddings of music tracks, tags and titles for recommendation, is combined with the result of LightFM model to create final recommendation list. When there are no music tracks available in the playlists but only playlists' tags and titles are available, recommendation was made by finding similar playlists based on playlists vectors which was made by the aggregation of FastText pre-trained embedding vectors of tags and titles of each playlists. As a result, not only cold-start problem can be resolved, but also achieved better performance than ALS, BPR and Item2Vec by using the metadata of both music tracks and playlists. In addition, it was found that the LightFM model, which uses only artist information as an item feature, shows the best performance compared to other LightFM models which use other item features of music tracks.

Animal Infectious Diseases Prevention through Big Data and Deep Learning (빅데이터와 딥러닝을 활용한 동물 감염병 확산 차단)

  • Kim, Sung Hyun;Choi, Joon Ki;Kim, Jae Seok;Jang, Ah Reum;Lee, Jae Ho;Cha, Kyung Jin;Lee, Sang Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.137-154
    • /
    • 2018
  • Animal infectious diseases, such as avian influenza and foot and mouth disease, occur almost every year and cause huge economic and social damage to the country. In order to prevent this, the anti-quarantine authorities have tried various human and material endeavors, but the infectious diseases have continued to occur. Avian influenza is known to be developed in 1878 and it rose as a national issue due to its high lethality. Food and mouth disease is considered as most critical animal infectious disease internationally. In a nation where this disease has not been spread, food and mouth disease is recognized as economic disease or political disease because it restricts international trade by making it complex to import processed and non-processed live stock, and also quarantine is costly. In a society where whole nation is connected by zone of life, there is no way to prevent the spread of infectious disease fully. Hence, there is a need to be aware of occurrence of the disease and to take action before it is distributed. Epidemiological investigation on definite diagnosis target is implemented and measures are taken to prevent the spread of disease according to the investigation results, simultaneously with the confirmation of both human infectious disease and animal infectious disease. The foundation of epidemiological investigation is figuring out to where one has been, and whom he or she has met. In a data perspective, this can be defined as an action taken to predict the cause of disease outbreak, outbreak location, and future infection, by collecting and analyzing geographic data and relation data. Recently, an attempt has been made to develop a prediction model of infectious disease by using Big Data and deep learning technology, but there is no active research on model building studies and case reports. KT and the Ministry of Science and ICT have been carrying out big data projects since 2014 as part of national R &D projects to analyze and predict the route of livestock related vehicles. To prevent animal infectious diseases, the researchers first developed a prediction model based on a regression analysis using vehicle movement data. After that, more accurate prediction model was constructed using machine learning algorithms such as Logistic Regression, Lasso, Support Vector Machine and Random Forest. In particular, the prediction model for 2017 added the risk of diffusion to the facilities, and the performance of the model was improved by considering the hyper-parameters of the modeling in various ways. Confusion Matrix and ROC Curve show that the model constructed in 2017 is superior to the machine learning model. The difference between the2016 model and the 2017 model is that visiting information on facilities such as feed factory and slaughter house, and information on bird livestock, which was limited to chicken and duck but now expanded to goose and quail, has been used for analysis in the later model. In addition, an explanation of the results was added to help the authorities in making decisions and to establish a basis for persuading stakeholders in 2017. This study reports an animal infectious disease prevention system which is constructed on the basis of hazardous vehicle movement, farm and environment Big Data. The significance of this study is that it describes the evolution process of the prediction model using Big Data which is used in the field and the model is expected to be more complete if the form of viruses is put into consideration. This will contribute to data utilization and analysis model development in related field. In addition, we expect that the system constructed in this study will provide more preventive and effective prevention.

Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

  • Thay, Setha;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.1-20
    • /
    • 2013
  • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.