Search | Korea Science

Point-level deep learning approach for 3D acoustic source localization

Lee, Soo Young;Chang, Jiho;Lee, Seungchul
- Smart Structures and Systems
- /
- v.29 no.6
- /
- pp.777-783
- /
- 2022
Even though several deep learning-based methods have been applied in the field of acoustic source localization, the previous works have only been conducted using the two-dimensional representation of the beamforming maps, particularly with the planar array system. While the acoustic sources are more required to be localized in a spherical microphone array system considering that we live and hear in the 3D world, the conventional 2D equirectangular map of the spherical beamforming map is highly vulnerable to the distortion that occurs when the 3D map is projected to the 2D space. In this study, a 3D deep learning approach is proposed to fulfill accurate source localization via distortion-free 3D representation. A target function is first proposed to obtain 3D source distribution maps that can represent multiple sources' positional and strength information. While the proposed target map expands the source localization task into a point-wise prediction task, a PointNet-based deep neural network is developed to precisely estimate the multiple sources' positions and strength information. While the proposed model's localization performance is evaluated, it is shown that the proposed method can achieve improved localization results from both quantitative and qualitative perspectives.
https://doi.org/10.12989/sss.2022.29.6.777 인용 KSCI

Leveraging Deep Learning and Farmland Fertility Algorithm for Automated Rice Pest Detection and Classification Model

Hussain. A;Balaji Srikaanth. P
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.18 no.4
- /
- pp.959-979
- /
- 2024
Rice pest identification is essential in modern agriculture for the health of rice crops. As global rice consumption rises, yields and quality must be maintained. Various methodologies were employed to identify pests, encompassing sensor-based technologies, deep learning, and remote sensing models. Visual inspection by professionals and farmers remains essential, but integrating technology such as satellites, IoT-based sensors, and drones enhances efficiency and accuracy. A computer vision system processes images to detect pests automatically. It gives real-time data for proactive and targeted pest management. With this motive in mind, this research provides a novel farmland fertility algorithm with a deep learning-based automated rice pest detection and classification (FFADL-ARPDC) technique. The FFADL-ARPDC approach classifies rice pests from rice plant images. Before processing, FFADL-ARPDC removes noise and enhances contrast using bilateral filtering (BF). Additionally, rice crop images are processed using the NASNetLarge deep learning architecture to extract image features. The FFA is used for hyperparameter tweaking to optimise the model performance of the NASNetLarge, which aids in enhancing classification performance. Using an Elman recurrent neural network (ERNN), the model accurately categorises 14 types of pests. The FFADL-ARPDC approach is thoroughly evaluated using a benchmark dataset available in the public repository. With an accuracy of 97.58, the FFADL-ARPDC model exceeds existing pest detection methods.
https://doi.org/10.3837/tiis.2024.04.008 인용 PDF HTML

Strength assessment of RC deep beams and corbels

Adrija, D.;Geevar, Indu;Menon, Devdas;Prasad, Meher
- Structural Engineering and Mechanics
- /
- v.77 no.2
- /
- pp.273-291
- /
- 2021
The strut-and-tie method (STM) has been widely accepted and used as a rational approach for the design of disturbed regions ('D' regions) of reinforced concrete members such as in corbels and deep beams, where traditional flexure theory does not apply. This paper evaluates the applicability of the equilibrium based STM in strength predictions of deep beams (with rectangular and circular cross-section) and corbels using the available experiments in literature. STM is found to give fairly good results for corbel and deep beams. The failure modes of these deep members are also studied, and an optimum amount of distribution reinforcement is suggested to eliminate the premature diagonal splitting failure. A comparison with existing empirical and semi empirical methods also show that STM gives more reliable results. The nonlinear finite element analysis (NLFEA) of 50 deep beams and 20 corbels could capture the complete behaviour of deep members including crack pattern, failure load and failure load accurately.
https://doi.org/10.12989/sem.2021.77.2.273 인용 KSCI

AraProdMatch: A Machine Learning Approach for Product Matching in E-Commerce

Alabdullatif, Aisha;Aloud, Monira
- International Journal of Computer Science & Network Security
- /
- v.21 no.4
- /
- pp.214-222
- /
- 2021
Recently, the growth of e-commerce in Saudi Arabia has been exponential, bringing new remarkable challenges. A naive approach for product matching and categorization is needed to help consumers choose the right store to purchase a product. This paper presents a machine learning approach for product matching that combines deep learning techniques with standard artificial neural networks (ANNs). Existing methods focused on product matching, whereas our model compares products based on unstructured descriptions. We evaluated our electronics dataset model from three business-to-consumer (B2C) online stores by putting the match products collectively in one dataset. The performance evaluation based on k-mean classifier prediction from three real-world online stores demonstrates that the proposed algorithm outperforms the benchmarked approach by 80% on average F1-measure.
https://doi.org/10.22937/IJCSNS.2021.21.4.26 인용 PDF KSCI

Music Composition with Collaboratory AI Composers

Kim, Haekwang;You, Younghwan
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2021.06a
- /
- pp.23-25
- /
- 2021
This paper describes an approach of composing music with multiple AI composers. This approach enriches more the creativity space of artificial intelligence music composition than using only one composer. This paper presents a simple example with 2 different deep learning composers working together for composing one music. For the experiment, the two composers adopt the same deep learning architecture of an LSTM model trained with different data. The output of a composer is a sequence of notes. Each composer alternatively appends its output to the resulting music which is input to both the composers. Experiments compare different music generated by the proposed multiple composer approach with the traditional one composer approach.
PDF

Deep Learning Network Approach for Pain Recognition Using Physiological Signals (생리적 신호를 이용한 통증 인식을 위한 딥 러닝 네트워크)

Phan, Kim Ngan;Lee, Guee-Sang;Yang, Hyung-Jeong;Kim, Soo-Hyung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.11a
- /
- pp.1001-1004
- /
- 2021
Pain is an unpleasant experience for the patient. The recognition and assessment of pain help tailor the treatment to the patient, and they are also challenging in the medical. In this paper, we propose an approach for pain recognition through a deep neural network applied to pre-processed physiological. The proposed approach applies the idea of shortcut connections to concatenate the spatial information of a convolutional neural network and the temporal information of a recurrent neural network. In addition, our proposed approach applies the attention mechanism and achieves competitive performance on the BioVid Heat Pain dataset.
https://doi.org/10.3745/PKIPS.y2021m11a.1001 인용 PDF

Group-based speaker embeddings for text-independent speaker verification (문장 독립 화자 검증을 위한 그룹기반 화자 임베딩)

Jung, Youngmoon;Eom, Youngsik;Lee, Yeonghyeon;Kim, Hoirin
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.5
- /
- pp.496-502
- /
- 2021
Recently, deep speaker embedding approach has been widely used in text-independent speaker verification, which shows better performance than the traditional i-vector approach. In this work, to improve the deep speaker embedding approach, we propose a novel method called group-based speaker embedding which incorporates group information. We cluster all speakers of the training data into a predefined number of groups in an unsupervised manner, so that a fixed-length group embedding represents the corresponding group. A Group Decision Network (GDN) produces a group weight, and an aggregated group embedding is generated from the weighted sum of the group embeddings and the group weights. Finally, we generate a group-based embedding by adding the aggregated group embedding to the deep speaker embedding. In this way, a speaker embedding can reduce the search space of the speaker identity by incorporating group information, and thereby can flexibly represent a significant number of speakers. We conducted experiments using the VoxCeleb1 database to show that our proposed approach can improve the previous approaches.
https://doi.org/10.7776/ASK.2021.40.5.496 인용 PDF KSCI

Analysis of Approachs to Learning Based on Student-Student Verbal Interactions according to the Type of Inquiry Experiments Using Everyday Materials (실생활 소재 탐구 실험 형태에 따른 학생-학생 언어적 상호작용에서의 학습 접근 수준 분석)

Kim, Hye-Sim;Lee, Eun-Kyeong;Kang, Seong-Joo
- Journal of The Korean Association For Science Education
- /
- v.26 no.1
- /
- pp.16-24
- /
- 2006
The purpose of this study was to compare student-student verbal interaction from two type's experiments; problem-solving and task-solving. For this study, five 3rd grade middle school students were selected and their verbal interactions recorded via voice and video; and later transcribed. The student-student verbal interactions were classified as questions, explanations, thoughts, or metacognition fields, which were separated into deep versus surface learning approaches. For the problem-solving experiment, findings revealed that the number of verbal interactions is more than doubled and in particular, the number of verbal interactions using deep-approach is more than quadrupled from the point of problem-recognition to problem-solution. As for the task-solving experiment, findings showed that verbal interactions remained evenly distributed throughout the entire experiment. Finally, it was also discovered that students relied upon a more deep learning approach during the problem-solving experiment than the task-solving experiment.
https://doi.org/10.14697/jkase.2006.26.1.16 인용 PDF KSCI

Fuzzy modelling approach for shear strength prediction of RC deep beams

Mohammadhassani, Mohammad;Saleh, Aidi MD.;Suhatril, M;Safa, M.
- Smart Structures and Systems
- /
- v.16 no.3
- /
- pp.497-519
- /
- 2015
This study discusses the use of Adaptive-Network-Based-Fuzzy-Inference-System (ANFIS) in predicting the shear strength of reinforced-concrete deep beams. 139 experimental data have been collected from renowned publications on simply supported high strength concrete deep beams. The results show that the ANFIS has strong potential as a feasible tool for predicting the shear strength of deep beams within the range of the considered input parameters. ANFIS's results are highly accurate, precise and therefore, more satisfactory. Based on the Sensitivity analysis, the shear span to depth ratio (a/d) and concrete cylinder strength ($f_c^{\prime}$) have major influence on the shear strength prediction of deep beams. The parametric study confirms the increase in shear strength of deep beams with an equal increase in the concrete strength and decrease in the shear span to-depth-ratio.
https://doi.org/10.12989/sss.2015.16.3.497 인용 KSCI

Deep-Learning Approach for Text Detection Using Fully Convolutional Networks

Tung, Trieu Son;Lee, Gueesang
- International Journal of Contents
- /
- v.14 no.1
- /
- pp.1-6
- /
- 2018
Text, as one of the most influential inventions of humanity, has played an important role in human life since ancient times. The rich and precise information embodied in text is very useful in a wide range of vision-based applications such as the text data extracted from images that can provide information for automatic annotation, indexing, language translation, and the assistance systems for impaired persons. Therefore, natural-scene text detection with active research topics regarding computer vision and document analysis is very important. Previous methods have poor performances due to numerous false-positive and true-negative regions. In this paper, a fully-convolutional-network (FCN)-based method that uses supervised architecture is used to localize textual regions. The model was trained directly using images wherein pixel values were used as inputs and binary ground truth was used as label. The method was evaluated using ICDAR-2013 dataset and proved to be comparable to other feature-based methods. It could expedite research on text detection using deep-learning based approach in the future.
https://doi.org/10.5392/IJoC.2018.14.1.001 인용 PDF KSCI HTML

Search Result 1,051, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)