통합 검색 | Korea Science

Intelligent and Robust Face Detection

Park, Min-sick;Park, Chang-woo;Kim, Won-ha;Park, Mignon
- 한국지능시스템학회논문지
- /
- 제11권7호
- /
- pp.641-648
- /
- 2001
A face detection in color images is important for many multimedia applications. It is first step for face recognition and can be used for classifying specific shorts. This paper describes a new method to detect faces in color images based on the skin color and hair color. This paper presents a fuzzy-based method for classifying skin color region in a complex background under varying illumination. The Fuzzy rule bases of the fuzzy system are generated using training method like a genetic algorithm(GA). We find the skin color region and hair color region using the fuzzy system and apply the convex-hull to each region and find the face from their intersection relationship. To validity the effectiveness of the proposed method, we make experiment with various cases.
PDF

임의형상의 버 발생 메카니즘의 기하학적 해석 (Geometrical Analysis on the Formation Mechanism of Milling Burr on Arbitrary Feature)

이제열;안용진;김영진
- 한국CDE학회논문집
- /
- 제6권4호
- /
- pp.222-228
- /
- 2001
In the milling operation, the burr can be generated on the intersection of cutting tool and workpiece. Due to burr formation, we expect lower efficiency in the operation and the cost increase. In order to understand the burr formation mechanism in the milling operation on the arbitrary feature, we developed an algorithm to analyse and predict the exit burr formation mechanism. Firstly, the recognition of arbitrary shaped workpiece was done through the CAD data. This data includes point information on the vertices of the workpiece. Secondly, tile CAM data regarding tool geometry, tool path, cutting speed, and material data are retrieved to simulate the actual cutting process. Thirdly, we predict the exit burr formation on the edge of workpiece based on the geometric analysis. Lastly, an algorithm implemented in the Windows environment to visualize the burr formation simulation. With this information, we can predict which portion of workpiece would have the exit burr in advance so that we call manage to find a way to minimize the edit burr formation in the actual cutting.
PDF

열림방향을 이용한 자동차번호판 숫자인식 (Digit Recognition for Vehicle License Plate Based on Opened Enclosure)

유쟁;김동욱
- 한국정보전자통신기술학회논문지
- /
- 제8권6호
- /
- pp.453-459
- /
- 2015
본 논문에서는 열림부분에 기반을 둔 자동차 번호판의 숫자인식 기법을 제안한다. 제안된 인식기법에서 숫자를 상부와 하부로 나누고, 각각에 대해 열림부분을 판정하여 숫자를 인식한다. 제안된 기법에서, 상부와 하부의 분할은 정해진 절단선을 바탕으로 하며, 교점의 개수에 따라 절단선의 위치가 조절된다. 제안된 방법은 템플릿 매칭 방법에 비해 잡음이나 회전 등의 영향을 받지 않으며 강건하다. 모의실험에서 제안된 기법의 성능을 평가하기 위해 번호판에 사용되는 숫자들에 대해 처리를 하고, 그 결과를 제시하였다. 제안된 기법은 번호판의 숫자인식에서 매우 높은 인식률을 보인다.
https://doi.org/10.17661/jkiiect.2015.8.6.453 인용 PDF KSCI

Novel View Generation Using Affine Coordinates

Sengupta, Kuntal;Ohya, Jun
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 1997년도 Proceedings International Workshop on New Video Media Technology
- /
- pp.125-130
- /
- 1997
In this paper we present an algorithm to generate new views of a scene, starting with images from weakly calibrated cameras. Errors in 3D scene reconstruction usually gets reflected in the quality of the new scene generated, so we seek a direct method for reprojection. In this paper, we use the knowledge of dense point matches and their affine coordinate values to estimate the corresponding affine coordinate values in the new scene. We borrow ideas from the object recognition literature, and extend them significantly to solve the problem of reprojection. Unlike the epipolar line intersection algorithms for reprojection which requires at least eight matched points across three images, we need only five matched points. The theory of reprojection is used with hardware based rendering to achieve fast rendering. We demonstrate our results of novel view generation from stereopairs for arbitrary locations of the virtual camera.
PDF

이미지 색채 속성을 활용한 감성 정량화 알고리즘 (Color Images Utilizing the Properties Emotional Quantification Algorithm)

이연란
- 한국콘텐츠학회논문지
- /
- 제15권11호
- /
- pp.1-9
- /
- 2015
감성인식을 규칙적으로 제어하고 변화하려는 감성컴퓨터 연구에 관심이 집중되고 있다. 따라서 색채 감성컴퓨팅의 정량화한 객관적인 평가 방식 적용이 필수적인 상황이다. 이에 본 논문은 이미지 감성인식을 숫자화한 표현방식으로 색채 디지털 감성컴퓨팅 계산을 적용한다. 감성컴퓨팅 연구방식은 이미지에 집중된 감성인식인 색채 속성으로 구성하고, 색채 감성속성은 색상, 명도, 채도로 구분한다. 감성속성의 비중톤 증감에 따른 감성점수와 가중치를 감성식에 적용하여 계산한다. 감성계산식은 쾌정도(X축), 긴장도(Y축)를 계산한다. 그리고 감성좌표에 쾌정도(X축), 긴장도(Y축)의 교차점을 감성점으로 위치한다. 감성좌표는 러셀의 핵심 효과(Core Affect)를 활용한다. 감성점 위치에서 대표감성크기와 감성상관관계를 숫자화하고, 색채 감성컴퓨팅을 통해 정량화한다.
https://doi.org/10.5392/JKCA.2015.15.11.001 인용 PDF KSCI

적대적 학습을 이용한 도로 노면 파손 탐지 알고리즘 (Detection Algorithm of Road Surface Damage Using Adversarial Learning)

심승보
- 한국ITS학회 논문지
- /
- 제20권4호
- /
- pp.95-105
- /
- 2021
도로 노면 파손 탐지는 쾌적한 주행 환경과 안전사고의 예방을 위해 필요하다. 도로 관리 기관은 자동화 기술 기반의 검사 장비와 시스템을 활용하고 있다. 이러한 자동화 기술 중에서도 도로 노면의 파손을 탐지하는 기술은 중요한 역할을 수행한다. 최근 들어 딥러닝을 이용한 기술에 대한 연구가 활발하게 진행 중이다. 이러한 딥러닝 기술 개발을 위해서는 도로 영상과 라벨 영상이 필요하다. 하지만 라벨 영상을 확보하기 위해서는 많은 시간과 노동력이 요구된다. 본 논문에서는 이러한 문제를 해결하기 위하여 준지도 학습 기법 중 하나인 적대적 학습 방법을 제안했다. 이를 구현하기 위해서 5,327장의 도로 영상과 1,327장의 라벨 영상을 사용하여 경량화 심층 신경망 모델을 학습했다. 그리고 이를 400장의 도로 영상으로 실험한 결과 80.54%의 mean intersection over union과 77.85%의 F1 score를 갖는 모델을 개발하였다. 결과적으로 라벨 영상 없이 도로 영상만을 학습에 추가하여 인식 성능을 향상시킬 수 있는 기술을 개발하였고, 향후 도로 노면 관리를 위한 기술로 활용되길 기대한다.
https://doi.org/10.12815/kits.2021.20.4.95 인용 PDF KSCI

A Model for Machine Fault Diagnosis based on Mutual Exclusion Theory and Out-of-Distribution Detection

Cui, Peng;Luo, Xuan;Liu, Jing
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제16권9호
- /
- pp.2927-2941
- /
- 2022
The primary task of machine fault diagnosis is to judge whether the current state is normal or damaged, so it is a typical binary classification problem with mutual exclusion. Mutually exclusive events and out-of-domain detection have one thing in common: there are two types of data and no intersection. We proposed a fusion model method to improve the accuracy of machine fault diagnosis, which is based on the mutual exclusivity of events and the commonality of out-of-distribution detection, and finally generalized to all binary classification problems. It is reported that the performance of a convolutional neural network (CNN) will decrease as the recognition type increases, so the variational auto-encoder (VAE) is used as the primary model. Two VAE models are used to train the machine's normal and fault sound data. Two reconstruction probabilities will be obtained during the test. The smaller value is transformed into a correction value of another value according to the mutually exclusive characteristics. Finally, the classification result is obtained according to the fusion algorithm. Filtering normal data features from fault data features is proposed, which shields the interference and makes the fault features more prominent. We confirm that good performance improvements have been achieved in the machine fault detection data set, and the results are better than most mainstream models.
https://doi.org/10.3837/tiis.2022.09.005 인용 PDF KSCI HTML

IKONOS 컬러 입체영상을 이용한 대규모 도심지역의 3차원 건물복원 (3-D Building Reconstruction from Standard IKONOS Stereo Products in Dense Urban Areas)

이석군;박정환
- 대한토목학회논문집
- /
- 제26권3D호
- /
- pp.535-540
- /
- 2006
본 논문에서는 고해상도 컬러 입체영상을 활용하여 도심지역의 3차원 건물정보를 효율적으로 복원하기 위한 일련의 처리방법을 제안하고자 한다. 본 연구에서 제안된 방법은 BDT 기법을 활용한 건물 추출, Hausdorff 거리와 컬러인덱싱 기법을 활용한 영상정합, 마지막으로 사진측량기법을 활용한 건물복원 등의 3단계의 처리과정을 포함하고 있다. 제안된 알고리즘의 실험은 고해상도 위성영상의 대표격인 IKONOS 컬러 입체영상을 대상으로 수행되었으며, 실험을 통해 건물추출에 있어서 영상의 배경부분과 건물부분의 밝기값의 분산을 증가시키는 BDT 기법이 건물추출에 우수함을 확인할 수 있었다. 또한, 2가지 건물인식기법을 활용한 영상정합 과정에 있어서도 컬러정보와 경계정보를 모두 사용할 경우 대부분의 추출건물들을 자동인식하고 이를 초기위치로 원활한 영상정합이 수행될 수 있음을 확인하였다. 최종적으로 실험지역에 대한 3차원 건물정보는 전방 다항식비례모형을 통해 획득되었으며 기준자료와의 비교를 통해 정확도 분석을 수행하였다.
https://doi.org/10.12652/Ksce.2006.26.3D.535 인용 PDF

Indoor Path Recognition Based on Wi-Fi Fingerprints

Donggyu Lee;Jaehyun Yoo
- Journal of Positioning, Navigation, and Timing
- /
- 제12권2호
- /
- pp.91-100
- /
- 2023
The existing indoor localization method using Wi-Fi fingerprinting has a high collection cost and relatively low accuracy, thus requiring integrated correction of convergence with other technologies. This paper proposes a new method that significantly reduces collection costs compared to existing methods using Wi-Fi fingerprinting. Furthermore, it does not require labeling of data at collection and can estimate pedestrian travel paths even in large indoor spaces. The proposed pedestrian movement path estimation process is as follows. Data collection is accomplished by setting up a feature area near an indoor space intersection, moving through the set feature areas, and then collecting data without labels. The collected data are processed using Kernel Linear Discriminant Analysis (KLDA) and the valley point of the Euclidean distance value between two data is obtained within the feature space of the data. We build learning data by labeling data corresponding to valley points and some nearby data by feature area numbers, and labeling data between valley points and other valley points as path data between each corresponding feature area. Finally, for testing, data are collected randomly through indoor space, KLDA is applied as previous data to build test data, the K-Nearest Neighbor (K-NN) algorithm is applied, and the path of movement of test data is estimated by applying a correction algorithm to estimate only routes that can be reached from the most recently estimated location. The estimation results verified the accuracy by comparing the true paths in indoor space with those estimated by the proposed method and achieved approximately 90.8% and 81.4% accuracy in two experimental spaces, respectively.
https://doi.org/10.11003/JPNT.2023.12.2.91 인용 PDF HTML

계층적 군집화 기반 Re-ID를 활용한 객체별 행동 및 표정 검출용 영상 분석 시스템 (Video Analysis System for Action and Emotion Detection by Object with Hierarchical Clustering based Re-ID)

이상현;양성훈;오승진;강진범
- 지능정보연구
- /
- 제28권1호
- /
- pp.89-106
- /
- 2022
최근 영상 데이터의 급증으로 이를 효과적으로 처리하기 위해 객체 탐지 및 추적, 행동 인식, 표정 인식, 재식별(Re-ID)과 같은 다양한 컴퓨터비전 기술에 대한 수요도 급증했다. 그러나 객체 탐지 및 추적 기술은 객체의 영상 촬영 장소 이탈과 재등장, 오클루전(Occlusion) 등과 같이 성능을 저하시키는 많은 어려움을 안고 있다. 이에 따라 객체 탐지 및 추적 모델을 근간으로 하는 행동 및 표정 인식 모델 또한 객체별 데이터 추출에 난항을 겪는다. 또한 다양한 모델을 활용한 딥러닝 아키텍처는 병목과 최적화 부족으로 성능 저하를 겪는다. 본 연구에서는 YOLOv5기반 DeepSORT 객체추적 모델, SlowFast 기반 행동 인식 모델, Torchreid 기반 재식별 모델, 그리고 AWS Rekognition의 표정 인식 모델을 활용한 영상 분석 시스템에 단일 연결 계층적 군집화(Single-linkage Hierarchical Clustering)를 활용한 재식별(Re-ID) 기법과 GPU의 메모리 스루풋(Throughput)을 극대화하는 처리 기법을 적용한 행동 및 표정 검출용 영상 분석 시스템을 제안한다. 본 연구에서 제안한 시스템은 간단한 메트릭을 사용하는 재식별 모델의 성능보다 높은 정확도와 실시간에 가까운 처리 성능을 가지며, 객체의 영상 촬영 장소 이탈과 재등장, 오클루전 등에 의한 추적 실패를 방지하고 영상 내 객체별 행동 및 표정 인식 결과를 동일 객체에 지속적으로 연동하여 영상을 효율적으로 분석할 수 있다.
https://doi.org/10.13088/jiis.2022.28.1.089 인용 PDF KSCI

검색결과 20건 처리시간 0.032초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)