• Title/Summary/Keyword: Recognition and Detection

Search Result 2,258, Processing Time 0.031 seconds

LASPI: Hardware friendly LArge-scale stereo matching using Support Point Interpolation (LASPI: 지원점 보간법을 이용한 H/W 구현에 용이한 스테레오 매칭 방법)

  • Park, Sanghyun;Ghimire, Deepak;Kim, Jung-guk;Han, Youngki
    • Journal of KIISE
    • /
    • v.44 no.9
    • /
    • pp.932-945
    • /
    • 2017
  • In this paper, a new hardware and software architecture for a stereo vision processing system including rectification, disparity estimation, and visualization was developed. The developed method, named LArge scale stereo matching method using Support Point Interpolation (LASPI), shows excellence in real-time processing for obtaining dense disparity maps from high quality image regions that contain high density support points. In the real-time processing of high definition (HD) images, LASPI does not degrade the quality level of disparity maps compared to existing stereo-matching methods such as Efficient LArge-scale Stereo matching (ELAS). LASPI has been designed to meet a high frame-rate, accurate distance resolution performance, and a low resource usage even in a limited resource environment. These characteristics enable LASPI to be deployed to safety-critical applications such as an obstacle recognition system and distance detection system for autonomous vehicles. A Field Programmable Gate Array (FPGA) for the LASPI algorithm has been implemented in order to support parallel processing and 4-stage pipelining. From various experiments, it was verified that the developed FPGA system (Xilinx Virtex-7 FPGA, 148.5MHz Clock) is capable of processing 30 HD ($1280{\times}720pixels$) frames per second in real-time while it generates disparity maps that are applicable to real vehicles.

An Embedded FAST Hardware Accelerator for Image Feature Detection (영상 특징 추출을 위한 내장형 FAST 하드웨어 가속기)

  • Kim, Taek-Kyu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.2
    • /
    • pp.28-34
    • /
    • 2012
  • Various feature extraction algorithms are widely applied to real-time image processing applications for extracting significant features from images. Feature extraction algorithms are mostly combined with image processing algorithms mostly for image tracking and recognition. Feature extraction function is used to supply feature information to the other image processing algorithms and it is mainly implemented in a preprocessing stage. Nowadays, image processing applications are faced with embedded system implementation for a real-time processing. In order to satisfy this requirement, it is necessary to reduce execution time so as to improve the performance. Reducing the time for executing a feature extraction function dose not only extend the execution time for the other image processing algorithms, but it also helps satisfy a real-time requirement. This paper explains FAST (Feature from Accelerated Segment Test algorithm) of E. Rosten and presents FPGA-based embedded hardware accelerator architecture. The proposed acceleration scheme can be implemented by using approximately 2,217 Flip Flops, 5,034 LUTs, 2,833 Slices, and 18 Block RAMs in the Xilinx Vertex IV FPGA. In the Modelsim - based simulation result, the proposed hardware accelerator takes 3.06 ms to extract 954 features from a image with $640{\times}480$ pixels and this result shows the cost effectiveness of the propose scheme.

RPCA-GMM for Speaker Identification (화자식별을 위한 강인한 주성분 분석 가우시안 혼합 모델)

  • 이윤정;서창우;강상기;이기용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.519-527
    • /
    • 2003
  • Speech is much influenced by the existence of outliers which are introduced by such an unexpected happenings as additive background noise, change of speaker's utterance pattern and voice detection errors. These kinds of outliers may result in severe degradation of speaker recognition performance. In this paper, we proposed the GMM based on robust principal component analysis (RPCA-GMM) using M-estimation to solve the problems of both ouliers and high dimensionality of training feature vectors in speaker identification. Firstly, a new feature vector with reduced dimension is obtained by robust PCA obtained from M-estimation. The robust PCA transforms the original dimensional feature vector onto the reduced dimensional linear subspace that is spanned by the leading eigenvectors of the covariance matrix of feature vector. Secondly, the GMM with diagonal covariance matrix is obtained from these transformed feature vectors. We peformed speaker identification experiments to show the effectiveness of the proposed method. We compared the proposed method (RPCA-GMM) with transformed feature vectors to the PCA and the conventional GMM with diagonal matrix. Whenever the portion of outliers increases by every 2%, the proposed method maintains almost same speaker identification rate with 0.03% of little degradation, while the conventional GMM and the PCA shows much degradation of that by 0.65% and 0.55%, respectively This means that our method is more robust to the existence of outlier.

Individual Ortho-rectification of Coast Guard Aerial Images for Oil Spill Monitoring (유출유 모니터링을 위한 해경 항공 영상의 개별정사보정)

  • Oh, Youngon;Bui, An Ngoc;Choi, Kyoungah;Lee, Impyeong
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1479-1488
    • /
    • 2022
  • Accidents in which oil spills occur intermittently in the ocean due to ship collisions and sinkings. In order to prepare prompt countermeasures when such an accident occurs, it is necessary to accurately identify the current status of spilled oil. To this end, the Coast Guard patrols the target area with a fixed-wing airplane or helicopter and checks it with the naked eye or video, but it was difficult to determine the area contaminated by the spilled oil and its exact location on the map. Accordingly, this study develops a technology for direct ortho-rectification by automatically geo-referencing aerial images collected by the Coast Guard without individual ground reference points to identify the current status of spilled oil. First, meta information required for georeferencing is extracted from a visualized screen of sensor information such as video by optical character recognition (OCR). Based on the extracted information, the external orientation parameters of the image are determined. Images are individually orthorectified using the determined the external orientation parameters. The accuracy of individual orthoimages generated through this method was evaluated to be about tens of meters up to 100 m. The accuracy level was reasonably acceptable considering the inherent errors of the position and attitude sensors, the inaccuracies in the internal orientation parameters such as camera focal length, without using no ground control points. It is judged to be an appropriate level for identifying the current status of spilled oil contaminated areas in the sea. In the future, if real-time transmission of images captured during flight becomes possible, individual orthoimages can be generated in real time through the proposed individual orthorectification technology. Based on this, it can be effectively used to quickly identify the current status of spilled oil contamination and establish countermeasures.

A Comparison of Pre-Processing Techniques for Enhanced Identification of Paralichthys olivaceus Disease based on Deep Learning (딥러닝 기반 넙치 질병 식별 향상을 위한 전처리 기법 비교)

  • Kang, Ja Young;Son, Hyun Seung;Choi, Han Suk
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.3
    • /
    • pp.71-80
    • /
    • 2022
  • In the past, fish diseases were bacterial in aqua farms, but in recent years, the frequency of fish diseases has increased as they have become viral and mixed. Viral diseases in an enclosed space called a aqua farm have a high spread rate, so it is very likely to lead to mass death. Fast identification of fish diseases is important to prevent group death. However, diagnosis of fish diseases requires a high level of expertise and it is difficult to visually check the condition of fish every time. In order to prevent the spread of the disease, an automatic identification system of diseases or fish is needed. In this paper, in order to improve the performance of the disease identification system of Paralichthys olivaceus based on deep learning, the existing pre-processing method is compared and tested. Target diseases were selected from three most frequent diseases such as Scutica, Vibrio, and Lymphocystis in Paralichthys olivaceus. The RGB, HLS, HSV, LAB, LUV, XYZ, and YCRCV were used as image pre-processing methods. As a result of the experiment, HLS was able to get the best results than using general RGB. It is expected that the fish disease identification system can be advanced by improving the recognition rate of diseases in a simple way.

Proposal for Ignition Source and Flammable Material Safety Management through 3D Modeling of Hazardous Area: Focus on Indoor Mixing Processes (폭발위험장소 구분도의 3D Modeling을 통한 점화원 및 가연물 안전관리 방안 제안: 실내 혼합공정을 중심으로)

  • Hak-Jae Kim;Duk-Han Kim;Young-Woo Chon
    • Journal of the Society of Disaster Information
    • /
    • v.20 no.1
    • /
    • pp.47-59
    • /
    • 2024
  • Purpose: This study aims to propose measures for the prevention of fire and explosion accidents within manufacturing facilities by improving the existing classification criteria for hazardous locations based on the leakage patterns of flammable liquids. The objective is to suggest ways to safely manage ignition sources and combustible materials. Method: The hazardous locations were calculated using "KS C IEC 60079-10-1," and the calculated explosion hazard distances were visualized in 3D. Additionally, the formula for the atmospheric dispersion of flammable vapors, as outlined in "P-91-2023," was utilized to calculate the dispersion rates within the hazardous locations represented in 3D. Result: Visualization of hazardous locations in 3D enabled the identification of blind spots in the floor plan, facilitating immediate recognition of ignition sources within these areas. Furthermore, when calculating the time taken for the Lower Explosive Limit (LEL) to reach within the volumetric space of the hazardous locations represented in 3D, it was found that the risk level did not correspond identically with the explosion hazard distances. Conclusion: Considering the atmospheric dispersion of flammable liquids, it was concluded that safety management should be conducted. Therefore, a method for calculating the concentration values requiring detection and alert based on realistically achievable ventilation rates within the facility is proposed.

The Study about Role and Importance of Site Activity Stage in Safety Activity for the International Conference among Several Countries (다자간 국제회의 안전활동에 있어서 현장활동단계의 역할 및 중요성에 관한 연구 : 부산 APEC 행사를 중심으로)

  • Lee, Sun-Ki
    • Korean Security Journal
    • /
    • no.19
    • /
    • pp.105-138
    • /
    • 2009
  • This study's purpose is to present the improvement of effectiveness of security activity for international conference among Several Countries which can be held hereafter. On the basis of security activity problems originating in APEC that had been held in Busan in 2005. I made up questions three times to on the members of the police, military, fire figher and national intelligence service who had experienced in Busan APEC and recognition of possible problem and possibility of improvement on each item of questions was analyzed by Delphi Method. Also interviews with 4 security experts selected from each security agency were conducted to present improvement in each part of problem. The results obtained from the face to face interview with four experts of security-enforcement agency about the role and importance of site activity stage for international conference among several countries are as followings ; First, the system that experienced security-enforcement agents can be selected for the next national security event is needed, by data-basing the security-enforcement agents who were experienced in security event as man power management. Second, the middle-term plan for the introduction of high-tech equipment and joint inspection with relevant security agents are needed for the efficient explosive technical detection. Third, high-tech security equipment could be introduced through the international high-tech security equipment exhibition. Fourth, an anti-terrorism plan should be measured by sharing information through the cooperation with domestic and international intelligence agency. Fifth, public relations should be measured systematically by organization rather than agents' individual public relations. Sixth, political consideration to secure integrative coordination with other agency is needed for security activity, through normal cooperation with fire fighting related agency such as an electric, gas, elevator company. Seventh, a definite press guideline is needed for a convenient news coverage and safety during security event.

  • PDF

Application of Texture Features algorithm using Computer Aided Diagnosis of Papillary Thyroid Cancer in the Ultrasonography (초음파영상에서 갑상선 결절의 컴퓨터자동진단을 위한 Texture Features 알고리즘 응용)

  • Ko, Seong-Jin;Lee, Jin-Soo;Ye, Soo-Young;Kim, Changsoo
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.5
    • /
    • pp.303-310
    • /
    • 2013
  • Thyroid nodular disease is the most frequently appeared in thyroid disease. Thyroid ultrasonography offers location of nodules, size, the number, information of internal echo characteristic. Thus, it makes possible to sort high-risk nodule containing high possibility about thyroid cancer and to induct precisely when take a Fine Needle Biopsy Aspiration. On thyroid nodule, the case which is diagnosed as malignant is less than 5% but screening test is very important on ultrasound and also must be reduced unnecessary procedure. Therefore, in this study an approach for describing a region is to quantity its texture content. We applied TFA algorithm on case which has been pathologically diagnosed as papillary thyroid cancer. we obtained experiment image which set the ROI on ultrasound and cut the $50{\times}50$ pixel size, histogram equalization. Consequently, Disease recognition detection efficiency of GLavg, SKEW, UN, ENT parameter were high as 91~100%. It is suggestion about possibility on CAD which distinguishes thyroid nodule. In addition, it will be helpful to differential diagnosis of thyroid nodule. If the study on additional parameter algorithm is continuously progressed from now on, it is able to arrange practical base on CAD and it is possible to apply various disease in the thyroid US.

Image Analysis of Computer Aided Diagnosis using Gray Level Co-occurrence Matrix in the Ultrasonography for Benign Prostate Hyperplasia (전립선비대증 초음파 영상에서 GLCM을 이용한 컴퓨터보조진단의 영상분석)

  • Cho, Jin-Young;Kim, Chang-Soo;Kang, Se-Sik;Ko, Seong-Jin;Ye, Soo-Young
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.3
    • /
    • pp.184-191
    • /
    • 2015
  • Prostate ultrasound is used to diagnose prostate cancer, BPH, prostatitis and biopsy of prostate cancer to determine the size of prostate. BPH is one of the common disease in elderly men. Prostate is divided into 4 blocks, peripheral zone, central zone, transition zone, anterior fibromuscular stroma. BPH is histologically transition zone urethra accompanying excessive nodular hyperplasia causes a lower urinary tract symptoms(LUTS) caused by urethral closure as causing the hyperplastic nodule characterized finding progressive ambient. Therefore, in this study normal transition zone image for hyperplasia prostate and normal transition zone image is analyzed quantitatively using a computer algorithm. We applied texture features of GLCM to set normal tissue 60 cases and BPH tissue 60cases setting analysis area $50{\times}50pixels$ which was analyzed by comparing the six parameters for each partial image. Consequently, Disease recognition detection efficiency of Autocorrelation, Cluster prominence, entropy, Sum average, parameter were high as 92~98%.This could be confirmed by quantitative image analysis to nodular hyperplasia change transition zone of the prostate. This is expected secondary means to diagnose BPH and the data base will be considered in various prostate examination.

A Study on the DID based Smart Remocon and FIDO Transaction Certification for Home-shopping (DID 기반의 스마트 리모콘과 홈쇼핑 FIDO 거래인증 연구)

  • Yeo, Hyupgoo;Kang, Mingoo;Sonh, Seungil
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.60-66
    • /
    • 2020
  • In this paper, the FIDO (Fast IDentity Online) transaction certification platform was proposed for applying the DID (Decentralized ID) of blockchain with home shopping channels to the IPTV service providers based on the Remocon (Remote Control). In this case, the DID based smart remocon applies biometric identification techniques for personal identification. These individual DID smart remote controls apply distributed ID blockchain, enabling home shopping viewers to conduct reliable ratings surveys through the detection of channel changed information. In addition, this smart remocon utilizes the product purchased information history on home shopping channels, allowing IPTV's home shopping viewers to compare the same broadcasted production information on all channels by blockchain technique and their production characteristics. IPTV service providers can process home shopping order/authorization informations in one-stop service via a number of home shopping broadcasting companies, and DID smart remote controls for home shopping viewers with the checking results of their real-time online access to confirm the FIDO2.0 transaction certification homepage. Thus, the FIDO transaction authentication platforms of IPTV service provider(Telecommunication company) can be expected to improve the benefits of home shopping customers, and to reduce the broadcasting companies' burden of payment, too.