Search | Korea Science

Estimation of fruit number of apple tree based on YOLOv5 and regression model (YOLOv5 및 다항 회귀 모델을 활용한 사과나무의 착과량 예측 방법)

Hee-Jin Gwak;Yunju Jeong;Ik-Jo Chun;Cheol-Hee Lee
- Journal of IKEEE
- /
- v.28 no.2
- /
- pp.150-157
- /
- 2024
In this paper, we propose a novel algorithm for predicting the number of apples on an apple tree using a deep learning-based object detection model and a polynomial regression model. Measuring the number of apples on an apple tree can be used to predict apple yield and to assess losses for determining agricultural disaster insurance payouts. To measure apple fruit load, we photographed the front and back sides of apple trees. We manually labeled the apples in the captured images to construct a dataset, which was then used to train a one-stage object detection CNN model. However, when apples on an apple tree are obscured by leaves, branches, or other parts of the tree, they may not be captured in images. Consequently, it becomes difficult for image recognition-based deep learning models to detect or infer the presence of these apples. To address this issue, we propose a two-stage inference process. In the first stage, we utilize an image-based deep learning model to count the number of apples in photos taken from both sides of the apple tree. In the second stage, we conduct a polynomial regression analysis, using the total apple count from the deep learning model as the independent variable, and the actual number of apples manually counted during an on-site visit to the orchard as the dependent variable. The performance evaluation of the two-stage inference system proposed in this paper showed an average accuracy of 90.98% in counting the number of apples on each apple tree. Therefore, the proposed method can significantly reduce the time and cost associated with manually counting apples. Furthermore, this approach has the potential to be widely adopted as a new foundational technology for fruit load estimation in related fields using deep learning.
https://doi.org/10.7471/ikeee.2024.28.2.150 인용 PDF

Development of deep learning network based low-quality image enhancement techniques for improving foreign object detection performance (이물 객체 탐지 성능 개선을 위한 딥러닝 네트워크 기반 저품질 영상 개선 기법 개발)

Ki-Yeol Eom;Byeong-Seok Min
- Journal of Internet Computing and Services
- /
- v.25 no.1
- /
- pp.99-107
- /
- 2024
Along with economic growth and industrial development, there is an increasing demand for various electronic components and device production of semiconductor, SMT component, and electrical battery products. However, these products may contain foreign substances coming from manufacturing process such as iron, aluminum, plastic and so on, which could lead to serious problems or malfunctioning of the product, and fire on the electric vehicle. To solve these problems, it is necessary to determine whether there are foreign materials inside the product, and may tests have been done by means of non-destructive testing methodology such as ultrasound ot X-ray. Nevertheless, there are technical challenges and limitation in acquiring X-ray images and determining the presence of foreign materials. In particular Small-sized or low-density foreign materials may not be visible even when X-ray equipment is used, and noise can also make it difficult to detect foreign objects. Moreover, in order to meet the manufacturing speed requirement, the x-ray acquisition time should be reduced, which can result in the very low signal- to-noise ratio(SNR) lowering the foreign material detection accuracy. Therefore, in this paper, we propose a five-step approach to overcome the limitations of low resolution, which make it challenging to detect foreign substances. Firstly, global contrast of X-ray images are increased through histogram stretching methodology. Second, to strengthen the high frequency signal and local contrast, we applied local contrast enhancement technique. Third, to improve the edge clearness, Unsharp masking is applied to enhance edges, making objects more visible. Forth, the super-resolution method of the Residual Dense Block (RDB) is used for noise reduction and image enhancement. Last, the Yolov5 algorithm is employed to train and detect foreign objects after learning. Using the proposed method in this study, experimental results show an improvement of more than 10% in performance metrics such as precision compared to low-density images.
https://doi.org/10.7472/jksii.2024.25.1.99 인용 PDF HTML

Restoring Omitted Sentence Constituents in Encyclopedia Documents Using Structural SVM (Structural SVM을 이용한 백과사전 문서 내 생략 문장성분 복원)

Hwang, Min-Kook;Kim, Youngtae;Ra, Dongyul;Lim, Soojong;Kim, Hyunki
- Journal of Intelligence and Information Systems
- /
- v.21 no.2
- /
- pp.131-150
- /
- 2015
Omission of noun phrases for obligatory cases is a common phenomenon in sentences of Korean and Japanese, which is not observed in English. When an argument of a predicate can be filled with a noun phrase co-referential with the title, the argument is more easily omitted in Encyclopedia texts. The omitted noun phrase is called a zero anaphor or zero pronoun. Encyclopedias like Wikipedia are major source for information extraction by intelligent application systems such as information retrieval and question answering systems. However, omission of noun phrases makes the quality of information extraction poor. This paper deals with the problem of developing a system that can restore omitted noun phrases in encyclopedia documents. The problem that our system deals with is almost similar to zero anaphora resolution which is one of the important problems in natural language processing. A noun phrase existing in the text that can be used for restoration is called an antecedent. An antecedent must be co-referential with the zero anaphor. While the candidates for the antecedent are only noun phrases in the same text in case of zero anaphora resolution, the title is also a candidate in our problem. In our system, the first stage is in charge of detecting the zero anaphor. In the second stage, antecedent search is carried out by considering the candidates. If antecedent search fails, an attempt made, in the third stage, to use the title as the antecedent. The main characteristic of our system is to make use of a structural SVM for finding the antecedent. The noun phrases in the text that appear before the position of zero anaphor comprise the search space. The main technique used in the methods proposed in previous research works is to perform binary classification for all the noun phrases in the search space. The noun phrase classified to be an antecedent with highest confidence is selected as the antecedent. However, we propose in this paper that antecedent search is viewed as the problem of assigning the antecedent indicator labels to a sequence of noun phrases. In other words, sequence labeling is employed in antecedent search in the text. We are the first to suggest this idea. To perform sequence labeling, we suggest to use a structural SVM which receives a sequence of noun phrases as input and returns the sequence of labels as output. An output label takes one of two values: one indicating that the corresponding noun phrase is the antecedent and the other indicating that it is not. The structural SVM we used is based on the modified Pegasos algorithm which exploits a subgradient descent methodology used for optimization problems. To train and test our system we selected a set of Wikipedia texts and constructed the annotated corpus in which gold-standard answers are provided such as zero anaphors and their possible antecedents. Training examples are prepared using the annotated corpus and used to train the SVMs and test the system. For zero anaphor detection, sentences are parsed by a syntactic analyzer and subject or object cases omitted are identified. Thus performance of our system is dependent on that of the syntactic analyzer, which is a limitation of our system. When an antecedent is not found in the text, our system tries to use the title to restore the zero anaphor. This is based on binary classification using the regular SVM. The experiment showed that our system's performance is F1 = 68.58%. This means that state-of-the-art system can be developed with our technique. It is expected that future work that enables the system to utilize semantic information can lead to a significant performance improvement.
https://doi.org/10.13088/jiis.2015.21.2.131 인용 PDF KSCI

Recognition method using stereo images-based 3D information for improvement of face recognition (얼굴인식의 향상을 위한 스테레오 영상기반의 3차원 정보를 이용한 인식)

Park Chang-Han;Paik Joon-Ki
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.43 no.3 s.309
- /
- pp.30-38
- /
- 2006
In this paper, we improved to drops recognition rate according to distance using distance and depth information with 3D from stereo face images. A monocular face image has problem to drops recognition rate by uncertainty information such as distance of an object, size, moving, rotation, and depth. Also, if image information was not acquired such as rotation, illumination, and pose change for recognition, it has a very many fault. So, we wish to solve such problem. Proposed method consists of an eyes detection algorithm, analysis a pose of face, md principal component analysis (PCA). We also convert the YCbCr space from the RGB for detect with fast face in a limited region. We create multi-layered relative intensity map in face candidate region and decide whether it is face from facial geometry. It can acquire the depth information of distance, eyes, and mouth in stereo face images. Proposed method detects face according to scale, moving, and rotation by using distance and depth. We train by using PCA the detected left face and estimated direction difference. Simulation results with face recognition rate of 95.83% (100cm) in the front and 98.3% with the pose change were obtained successfully. Therefore, proposed method can be used to obtain high recognition rate with an appropriate scaling and pose change according to the distance.
PDF KSCI

A Study on SCOTT Transformer Protection Relay Malfunction Case and Improvement Methodology (스코트 변압기 보호계전기 오동작 사례분석 및 개선방안 고찰)

Lee, Jong-Hwa;Lho, Young-Hwan
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.18 no.7
- /
- pp.394-399
- /
- 2017
In Korean AC power railway substations, SCOTT winding transformers are under operation to have a single phase power supply together with a phase angle of $90^{\circ}$ on the secondary side of the main transformer. In the case of an internal fault of the transformer, the transformer protection relay should be cut off on the primary side, the transformer should be inoperative to the external fault of the transformer or to the normal train operation. Reducing the malfunction of the relay through an exact fault determination is very important for securing a stable power system and improving its reliability. The main transformers are protected using Buchholtz's relay and a differential relay as the internal fault detection devices, but there are some cases of the main transformer operation under the deactivation of this protection function due to a malfunction of the differential relay. In this paper, the characteristics of the SCOTT transformer and differential relay as well as the malfunctioning of the protection relays are presented. The modeling of the SCOTT transformer protection relay was accomplished by the power system analysis program and the Comtrade file from 'A substation', which was used as the input data for the fault wave, and the harmonics were analyzed to determine if the relay operates or not. In addition, an improvement plan for malfunctioning cases through wave form analysis is suggested.
https://doi.org/10.5762/KAIS.2017.18.7.394 인용 PDF KSCI

Detecting Errors in POS-Tagged Corpus on XGBoost and Cross Validation (XGBoost와 교차검증을 이용한 품사부착말뭉치에서의 오류 탐지)

Choi, Min-Seok;Kim, Chang-Hyun;Park, Ho-Min;Cheon, Min-Ah;Yoon, Ho;Namgoong, Young;Kim, Jae-Kyun;Kim, Jae-Hoon
- KIPS Transactions on Software and Data Engineering
- /
- v.9 no.7
- /
- pp.221-228
- /
- 2020
Part-of-Speech (POS) tagged corpus is a collection of electronic text in which each word is annotated with a tag as the corresponding POS and is widely used for various training data for natural language processing. The training data generally assumes that there are no errors, but in reality they include various types of errors, which cause performance degradation of systems trained using the data. To alleviate this problem, we propose a novel method for detecting errors in the existing POS tagged corpus using the classifier of XGBoost and cross-validation as evaluation techniques. We first train a classifier of a POS tagger using the POS-tagged corpus with some errors and then detect errors from the POS-tagged corpus using cross-validation, but the classifier cannot detect errors because there is no training data for detecting POS tagged errors. We thus detect errors by comparing the outputs (probabilities of POS) of the classifier, adjusting hyperparameters. The hyperparameters is estimated by a small scale error-tagged corpus, in which text is sampled from a POS-tagged corpus and which is marked up POS errors by experts. In this paper, we use recall and precision as evaluation metrics which are widely used in information retrieval. We have shown that the proposed method is valid by comparing two distributions of the sample (the error-tagged corpus) and the population (the POS-tagged corpus) because all detected errors cannot be checked. In the near future, we will apply the proposed method to a dependency tree-tagged corpus and a semantic role tagged corpus.
https://doi.org/10.3745/KTSDE.2020.9.7.221 인용 PDF KSCI

Lightening of Human Pose Estimation Algorithm Using MobileViT and Transfer Learning

Kunwoo Kim;Jonghyun Hong;Jonghyuk Park
- Journal of the Korea Society of Computer and Information
- /
- v.28 no.9
- /
- pp.17-25
- /
- 2023
In this paper, we propose a model that can perform human pose estimation through a MobileViT-based model with fewer parameters and faster estimation. The based model demonstrates lightweight performance through a structure that combines features of convolutional neural networks with features of Vision Transformer. Transformer, which is a major mechanism in this study, has become more influential as its based models perform better than convolutional neural network-based models in the field of computer vision. Similarly, in the field of human pose estimation, Vision Transformer-based ViTPose maintains the best performance in all human pose estimation benchmarks such as COCO, OCHuman, and MPII. However, because Vision Transformer has a heavy model structure with a large number of parameters and requires a relatively large amount of computation, it costs users a lot to train the model. Accordingly, the based model overcame the insufficient Inductive Bias calculation problem, which requires a large amount of computation by Vision Transformer, with Local Representation through a convolutional neural network structure. Finally, the proposed model obtained a mean average precision of 0.694 on the MS COCO benchmark with 3.28 GFLOPs and 9.72 million parameters, which are 1/5 and 1/9 the number compared to ViTPose, respectively.
https://doi.org/10.9708/jksci.2023.28.09.017 인용 PDF HTML

Calculation method and application of natural frequency of integrated model considering track-beam-bearing-pier-pile cap-soil

Yulin Feng;Yaoyao Meng;Wenjie Guo;Lizhong Jiang;Wangbao Zhou
- Steel and Composite Structures
- /
- v.49 no.1
- /
- pp.81-89
- /
- 2023
A simplified calculation method of natural vibration characteristics of high-speed railway multi-span bridge-longitudinal ballastless track system is proposed. The rail, track slab, base slab, main beam, bearing, pier, cap and pile foundation are taken into account, and the multi-span longitudinal ballastless track-beam-bearing-pier-cap-pile foundation integrated model (MBTIM) is established. The energy equation of each component of the MBTIM based on Timoshenko beam theory is constructed. Using the improved Fourier series, and the Rayleigh-Ritz method and Hamilton principle are combined to obtain the extremum of the total energy function. The simplified calculation formula of the natural vibration frequency of the MBTIM under the influence of vertical and longitudinal vibration is derived and verified by numerical methods. The influence law of the natural vibration frequency of the MBTIM is analyzed considering and not considering the participation of each component of the MBTIM, the damage of the track interlayer component and the stiffness change of each layer component. The results show that the error between the calculation results of the formula and the numerical method in this paper is less than 3%, which verifies the correctness of the method in this paper. The high-order frequency of the MBTIM is significantly affected considering the track, bridge pier, pile soil and pile cap, while considering the influence of pile cap on the low-order and high-order frequency of the MBTIM is large. The influence of component damage such as void beneath slab, mortar debonding and fastener failure on each order frequency of the MBTIM is basically the same, and the influence of component damage less than 10m on the first fourteen order frequency of the MBTIM is small. The bending stiffness of track slab and rail has no obvious influence on the natural frequency of the MBTIM, and the bending stiffness of main beam has influence on the natural frequency of the MBTIM. The bending stiffness of pier and base slab only has obvious influence on the high-order frequency of the MBTIM. The natural vibration characteristics of the MBTIM play an important guiding role in the safety analysis of high-speed train running, the damage detection of track-bridge structure and the seismic design of railway bridge.
https://doi.org/10.12989/scs.2023.49.1.081 인용

Optimization-based Deep Learning Model to Localize L3 Slice in Whole Body Computerized Tomography Images (컴퓨터 단층촬영 영상에서 3번 요추부 슬라이스 검출을 위한 최적화 기반 딥러닝 모델)

Seongwon Chae;Jae-Hyun Jo;Ye-Eun Park;Jin-Hyoung, Jeong;Sung Jin Kim;Ahnryul Choi
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.16 no.5
- /
- pp.331-337
- /
- 2023
In this paper, we propose a deep learning model to detect lumbar 3 (L3) CT images to determine the occurrence and degree of sarcopenia. In addition, we would like to propose an optimization technique that uses oversampling ratio and class weight as design parameters to address the problem of performance degradation due to data imbalance between L3 level and non-L3 level portions of CT data. In order to train and test the model, a total of 150 whole-body CT images of 104 prostate cancer patients and 46 bladder cancer patients who visited Gangneung Asan Medical Center were used. The deep learning model used ResNet50, and the design parameters of the optimization technique were selected as six types of model hyperparameters, data augmentation ratio, and class weight. It was confirmed that the proposed optimization-based L3 level extraction model reduced the median L3 error by about 1.0 slices compared to the control model (a model that optimized only 5 types of hyperparameters). Through the results of this study, accurate L3 slice detection was possible, and additionally, we were able to present the possibility of effectively solving the data imbalance problem through oversampling through data augmentation and class weight adjustment.
https://doi.org/10.17661/jkiiect.2023.16.5.331 인용 PDF HTML

Detection Fastener Defect using Semi Supervised Learning and Transfer Learning (준지도 학습과 전이 학습을 이용한 선로 체결 장치 결함 검출)

Sangmin Lee;Seokmin Han
- Journal of Internet Computing and Services
- /
- v.24 no.6
- /
- pp.91-98
- /
- 2023
Recently, according to development of artificial intelligence, a wide range of industry being automatic and optimized. Also we can find out some research of using supervised learning for deteceting defect of railway in domestic rail industry. However, there are structures other than rails on the track, and the fastener is a device that binds the rail to other structures, and periodic inspections are required to prevent safety accidents. In this paper, we present a method of reducing cost for labeling using semi-supervised and transfer model trained on rail fastener data. We use Resnet50 as the backbone network pretrained on ImageNet. At first we randomly take training data from unlabeled data and then labeled that data to train model. After predict unlabeled data by trained model, we adopted a method of adding the data with the highest probability for each class to the training data by a predetermined size. Futhermore, we also conducted some experiments to investigate the influence of the number of initially labeled data. As a result of the experiment, model reaches 92% accuracy which has a performance difference of around 5% compared to supervised learning. This is expected to improve the performance of the classifier by using relatively few labels without additional labeling processes through the proposed method.
https://doi.org/10.7472/jksii.2023.24.6.91 인용 PDF HTML

Search Result 383, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)