통합 검색 | Korea Science

비디오 질의 응답 시스템을 위한 전이 학습 기반의 멀티 모달 퓨전 정답 선택 모델 (Transfer Learning-based Multi-Modal Fusion Answer Selection Model for Video Question Answering System)

박규민;박성배
- 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
- /
- 한국정보과학회언어공학연구회 2021년도 제33회 한글 및 한국어 정보처리 학술대회
- /
- pp.548-553
- /
- 2021
비디오 질의 응답은 입력으로 주어진 비디오와 질문에 적절할 정답을 제공하기 위해 텍스트, 이미지 등 다양한 정보처리가 요구되는 대표적인 multi-modal 문제이다. 질의 응답 시스템은 질의 응답의 성능을 높이기 위해 다수의 서로 다른 응답 모듈을 사용하기도 하며 생성된 정답 후보군 중 가장 적절할 정답을 선택하는 정답 선택 모듈이 필요하다. 정답 선택 모듈은 응답 모듈의 서로 다른 관점을 고려하여 응답 선택을 선택할 필요성이 있다. 하지만 응답 모듈이 black-box 모델인 경우 정답 선택 모듈은 응답 모듈의 parameter와 예측 분포를 통해 지식을 전달 받기 어렵다. 그리고 학습 데이터셋은 응답 모듈이 학습에 사용했기 때문에 과적합 문제로 각 모듈의 관점을 학습하기엔 어려우며 학습 데이터셋 이외 비교적 적은 데이터셋으로 학습해야 하는 문제점이 있다. 본 논문에서는 정답 선택 성능을 높이기 위해 전이 학습 기반의 멀티모달 퓨전 정답 선택 모델을 제안한다. DramaQA 데이터셋을 통해 성능을 측정하여 제안된 모델의 우수성을 실험적으로 증명하였다.
PDF

머신러닝 기반 낙상 인식 알고리즘 (Fall Detection Algorithm Based on Machine Learning)

정준현;김남호
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2021년도 추계학술대회
- /
- pp.226-228
- /
- 2021
구글사에서 출시된 ML Kit API의 Pose detection를 사용한 영상기반 낙상 알고리즘을 제안한다. Pose detection 알고리듬을 사용하여 추출된 신체의 33개의 3차원 특징점을 활용하여 낙상을 인식한다. 추출된 특징점을 분석하여 낙상을 인식하는 알고리듬은 k-NN을 사용한다. 영상의 크기와 영상내의 인체의 크기에 영향을 받지 않도록 정규화과정을 거치며 특징점들의 상대적인 움직임을 분석하여 낙상을 인식한다. 본 실험을 위해 사용한 13개의 테스트 영상중 13개의 영상에서 낙상을 인식하여 100%의 성공률을 보였다.
PDF

Customized Safety Information Delivery System for Unskilled Construction Worker Training

Jo, Junhyeon;Baik, Sangeun;Pedro, Akeem;Lee, Doyeop;Park, Chansik
- 국제학술발표논문집
- /
- The 9th International Conference on Construction Engineering and Project Management
- /
- pp.525-532
- /
- 2022
Accidents at construction sites in Korea account for more than half of all industrial accidents. To solve this problem, a policy to strengthen safety education was implemented to ensure the safety of workers. However, it was analyzed that there is a high possibility of accidents because workers did not receive proper safety information for each risk factor due to general lecture-style education. In addition, statistics show that the accident status of workers with fewer years of period is high, indicating that a customized information delivery method needs to be proposed for unskilled workers with fewer years of period. Research on the importance of education has been conducted, but no information delivery method has been identified. For unskilled workers to effectively receive safety information, appropriate delivery formats (text, photos, illustrations, 4D-BIM, 360-based panorama, video, animation) were analyzed, and a new method of education was proposed. If customized safety information is provided according to this proposal, effective information delivery to unskilled workers will be possible, and it is expected to be verified in various ways.
PDF

딥러닝 영상인식을 이용한 헬멧 미착용 검출 시스템 (System for Detection not Wearing Helmet using Deep Learning Video Recognition)

함경윤;이정우;이장현;강길남;조영준;박동훈;류명춘
- 한국컴퓨터정보학회:학술대회논문집
- /
- 한국컴퓨터정보학회 2022년도 제65차 동계학술대회논문집 30권1호
- /
- pp.277-278
- /
- 2022
최근 전동킥보드 보급이 이루어지면서 이와 관련된 교통사고가 증가하고 있다. 이에 따라 전동킥보드 주행 시 헬멧 착용을 의무화하는 도로교통법 개정안이 시행되고 있지만, 물리적으로 대부분 현장에서 단속이 어렵다. 본 논문에서는 딥러닝 영상인식 기술을 활용한 객체검출(object detection) 모델인 YOLOv4를 기반으로 전동킥보드 사용자의 헬멧 미착용 검출시스템을 제안하였다. 이를 통해 전동킥보드 주행 시 헬멧 착용 여부를 효율적으로 단속하는데 활용 할 수 있을 것으로 기대한다.
PDF

대형 가상현실 공연장을 위한 360 도 비디오 스트리밍 시스템 프로토타입 구현 (Implementing 360-degree VR Video Streaming System Prototype for Large-scale Immersive Displays)

류영일;최이현;류은석
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송∙미디어공학회 2022년도 하계학술대회
- /
- pp.1241-1244
- /
- 2022
최근 K-Pop 을 위시한 예술공연 콘텐츠에 몰입형 미디어를 접목한 온택트 (Ontact) 미디어 스트리밍 서비스가 주목받고 있는 가운데, 본 논문은 일반적으로 사용되는 2D 디스플레이 또는 HMD (Head-Mounted Display) 기반 VR (Virtual Reality, VR) 서비스에서 탈피하여, 대형 가상현실 공연장을 위한 360 도 VR 비디오 스트리밍 시스템을 제안한다. 제안된 시스템은 Phase 1, 2, 3 의 연구개발 단계를 밟아 6DoF (Degrees of Freedom) 시점 자유도를 지원하는 360 도 VR 비디오 스트리밍 시스템을 개발하는 것을 최종목표로 하고 있으며, 현재는 Phase 1: 대형 가상현실 공연장을 위한 3DoF 360 도 VR 비디오 스트리밍 시스템 프로토타입의 개발까지 완료되었다. 구현된 스트리밍 시스템 프로토타입은 서브픽처 기반 Viewport-dependent 스트리밍 기술이 적용되어 있으며, 기존 방식과 비교하였을 때 약 80%의 비트율 감소, 약 543%의 영상 디코딩 속도 향상을 확인하였다. 또한, 단순 구현 및 성능평가에서 그치지 않고, 실제 미국 UCSB 에 위치한 대형 가상현실 공연장 AlloSphere 에서의 시범방송을 수행하여, 향후 Phase 2, 3 연구단계를 위한 연구적 기반을 마련하였다.
PDF

Designing A Concatenated Code To Improve The Error Performance Of Low-Priority Data In T-DMB System With The Hierarchical Modulation

이이극;김성관;김한종
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2008년도 춘계종합학술대회 A
- /
- pp.689-692
- /
- 2008
Hierarchical modulation has been considered for achieving higher data rates in Terrestrial-DMB(T-DMB) systems. And for achieving a higher data rates transmission, the low-priority (LP) data, which is used to carry additional data, such as video data, audio data and textual data, should be perfectly decoded in a certain value of $E_b/N_o$. Unfortunately, the man-made noise badly affects the high-priority (HP) symbol, which is used to carry the conventional data in the existed T-DMB system; and since the advanced T-DMB system is proposed to fit for the legacy T-DMB receivers, the low-priority symbols in the hierarchical modulation are much worse affected by the neighbors, who are both in the same quadrant. Because of the feature that mentioned previously, the turbo code has been considered to deal with the LP data. And due to the degradation which caused by the shortened symbol distance, the error performance of LP data is not sufficient by only using the turbo code. In this paper, we propose a Reed-Solomon code used outside of turbo code, and with the turbo code, it becomes a concatenated code. In this paper, there are some simulation results, within the comparison of those performances, we can see how a Reed-Solomon code is utilized for degradation of error performance which is caused by the hierarchical constellation, and how to design a Reed-Solomon code which is suitable for improving the degradation of error performance.
PDF

무인 헬기 사진측량시스템을 이용한 Web 상에서의 문화재 관리 정보시스템 구축 (Construction of Information System for Management of Cultural Heritage on the Web Using a Pilotless Helicopter Photogrammetry System)

이종출;양인태;장호식;허종호
- 한국측량학회:학술대회논문집
- /
- 한국측량학회 2004년도 춘계학술발표회논문집
- /
- pp.389-394
- /
- 2004
Structure-typed cultural heritage, objects of preservation are positioned as one of the very important heritage in the nation, and the preservation of prototypical structures become influential in national development and against natural disaster. For this reason, Digital Close Range Photogrammetry has recently been diversely used. Despite its popular use, the measurement has limits that make it unsuitable for photographing precise cultural heritage situated at high mountainous terrain or where people can not approach easily. These high gigantic stone statues are among the preserved structure-typed cultural heritage. In order to supplement the limits, when using the measurement, a camera tripod with +30m, a ladder truck and a shore should be equipped, which means additional equipment leads to it being a waste of cost and time. In this vein, a device was developed in detail, using a RC Helicopter installed with a CCD video camera with ease of control, safety, equipment, carrying, movement and approach, then checked image shot by a wireless modem at real time and considered the economical efficiency without re-photographing. Next, the author digitized the images of the nationally designated structure-typed cultural heritage, used materials on their restoration as the third dimension in order to construct the integrated management-information system for cultural heritage. Through the above processes, this study can provide specific information on 3D images and 3D CAD sections of structured-typed cultural heritage for both the public and specialists on the web. Moreover, it suggests the foundation to restore the damaged cultural heritage in the future by aiming for their effective management and preservation.
PDF

ATM에HDSL 정합 기능 및 서비스 구현 (An Achievement of High-rate Digital Subscriber Lines(HDSL) Interface Function into the ATM Switching System and its Service Implementation)

양충렬;장재득;김진태;강석열;김환우
- 한국정보처리학회논문지
- /
- 제4권9호
- /
- pp.2378-2390
- /
- 1997
본 논문에서는 ATM(Asynchronous Transfer Mode) 교환기에 E1급 HDSL(High-rate Digital Subscriber Lines) 정합 기능을 구현하였다. 26 게이지(0.4mm) 및 24 게이지(0.5mm) 페어 동 선로(copper telephone lines)로 구성되는 CSA(Carrier Serving Areas) 환경에서 누화(crosstalk), 임펄스, 전원 선로 잡음(power line noise) 및 longitudinal 같은 주요 전송 손실이 존재하는 기존 전화 가입자 선로 상에서 E1급 HDSL 데이터를 전송할 때 $10^{-7}$의 셀 손실 성능을 만족하는 가입자 서비스 루프 거리 및 셀 손실율을 평가하였다. 또한 ATM에서 HDSL을 이용한 MPEG-1급 주문형 비디오서비스, 영상 회의${\cdot}$서비스 및 고속 인터넷 서비스 기능을 확인하였다.
PDF

시각 장애인을 위한 Smart Portable Navigation System 개발과 1:N 서비스 구현 (Smart Portable Navigation System Development and Implementation of 1:N Service for Visually impaired Persons)

변재령;김영길
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2012년도 춘계학술대회
- /
- pp.191-193
- /
- 2012
기존의 개발된 시각 장애인을 위한 길 안내 서비스를 위한 보조기구는 지팡이에 장착된 RFID 태그를 이용, 표지블록과 RF통신을 하는 정도의 간단한 보행 안내 서비스였습니다. 이는 RFID의 리더기의 인식거리가 짧고, 명확한 장애물의 위치, 크기 및 형태를 판단 할 수 없다. 이에 위험 사항이나 길안내 중 경로 이탈 발생 시 대책방안이 시급히 필요하다. 오늘 날 스마트 디바이스 개발로 인해 사용자들에게 다양한 혜택과 편리성을 제공 하고 있다. 이에 안드로이드 플랫폼 Client 와 Server(PC)간의 소켓 스트림을 이용, 실시간 영상정보와 음성, 위치정보를 전송하여 시각장애인의 위험 상황에 즉각적인 조치를 취할 수 있는 시스템 및 1:N 서비스를 구현하고자 한다.
PDF

HTTP를 활용한 동적 적응적 스트리밍 시스템 (Dynamic adaptive streaming system using HTTP)

반태학;박상노;김태승;이병권;정회경
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2012년도 춘계학술대회
- /
- pp.488-490
- /
- 2012
오늘날 QoS/QoE 기술의 일환인 HTTP를 활용한 동적 적응적 스트리밍 기술이 이슈화 되고 있다. 본 논문에서는 HTTP를 활용한 동적 적응적 스트리밍 기술에 대해 연구하였다. 이를 기반으로 HTTP를 활용한 동적 적응적 스트리밍 시스템을 설계 및 구현하였다. 본 시스템에서는 MPEG2-TS 파일에 대해 비트율별 변환, 일정 시간 별 Segment의 분할, 앞에서의 파일들을 참고하여 최종적으로 전송에서 활용되는 MPD(Media Presentation Description) File의 생성과 서버와 클라이언트 간의 스트리밍에 대해 HTTP를 활용한 동적이고 적응적인 네트워크 환경에서의 비트율별 스트리머로 구성된다. 이는 불특정 다수의 네트워크 환경에서 끊김없는 지속적인 영상의 재생을 위한 다양한 스트리밍 분야와 멀티미디어 분야에 활용될 것이다.
PDF

검색결과 888건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)