통합 검색 | Korea Science

Fast Depth Video Coding with Intra Prediction on VVC

Wei, Hongan;Zhou, Binqian;Fang, Ying;Xu, Yiwen;Zhao, Tiesong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권7호
- /
- pp.3018-3038
- /
- 2020
In the stereoscopic or multiview display, the depth video illustrates visual distances between objects and camera. To promote the computational efficiency of depth video encoder, we exploit the intra prediction of depth videos under Versatile Video Coding (VVC) and observe a diverse distribution of intra prediction modes with different coding unit sizes. We propose a hybrid scheme to further boost fast depth video coding. In the first stage, we adaptively predict the HADamard (HAD) costs of intra prediction modes and initialize a candidate list according to the HAD costs. Then, the candidate list is further improved by considering the probability distribution of candidate modes with different CU sizes. Finally, early termination of CU splitting is performed at each CU depth level based on the Bayesian theorem. Our proposed method is incorporated into VVC intra prediction for fast coding of depth videos. Experiments with 7 standard sequences and 4 Quantization parameters (Qps) validate the efficiency of our method.
https://doi.org/10.3837/tiis.2020.07.016 인용 PDF KSCI HTML

효율적인 비동기 전송을 지원하기 위한 RTLS 미들웨어의 확장 (API Extension of RTLS Middleware for Efficient Asynchronous Transmission)

박재관;홍봉희;이승철
- 한국공간정보시스템학회 논문지
- /
- 제11권2호
- /
- pp.111-118
- /
- 2009
최근, 많은 기업에서 실시간 자산 관리를 위해 RTLS 시스템을 구축하고 있다. RFID와 달리, RTLS 태그는 이동 과정과 한정되지 않고 임의의 위치에서 지속적으로, 자동적으로 인식된다. 그러나, RTLS 미들웨어의 표준 API는 2가지 한계점이 있다. 미들웨어가 애플리케이션으로 불필요한 데이터를 포함하는 대용량의 데이터를 전달해야 한다는 것과 미들웨어에서 애플리케이션으로 질의 결과를 전달하는 방식에서 동기 방식만을 지원한다는 문제가 그것이다. 이 논문에서는 이러한 문제를 해결하기 위해, 다양한 질의에 대해 애플리케이션으로 전달되는 데이터 량을 줄이기 위한 질의 타입별 정제 조건을 명세할 수 있는 SessionSpec을 정의하고 실시간 이벤트 처리를 위한 비동기 방식 지원 방법을 제안한다. 또한, 이러한 방법을 적용한 RTLS 미들웨어를 설계하고 구현하여 그 결과를 확인하였다.
PDF

A Real-Time Integrated Hierarchical Temporal Memory Network for the Real-Time Continuous Multi-Interval Prediction of Data Streams

Kang, Hyun-Syug
- Journal of Information Processing Systems
- /
- 제11권1호
- /
- pp.39-56
- /
- 2015
Continuous multi-interval prediction (CMIP) is used to continuously predict the trend of a data stream based on various intervals simultaneously. The continuous integrated hierarchical temporal memory (CIHTM) network performs well in CMIP. However, it is not suitable for CMIP in real-time mode, especially when the number of prediction intervals is increased. In this paper, we propose a real-time integrated hierarchical temporal memory (RIHTM) network by introducing a new type of node, which is called a Zeta1FirstSpecializedQueueNode (ZFSQNode), for the real-time continuous multi-interval prediction (RCMIP) of data streams. The ZFSQNode is constructed by using a specialized circular queue (sQUEUE) together with the modules of original hierarchical temporal memory (HTM) nodes. By using a simple structure and the easy operation characteristics of the sQUEUE, entire prediction operations are integrated in the ZFSQNode. In particular, we employed only one ZFSQNode in each level of the RIHTM network during the prediction stage to generate different intervals of prediction results. The RIHTM network efficiently reduces the response time. Our performance evaluation showed that the RIHTM was satisfied to continuously predict the trend of data streams with multi-intervals in the real-time mode.
https://doi.org/10.3745/JIPS.02.0011 인용 PDF KSCI

Inverse SAR에서 속도를 모르는 움직이는 물체의 이미징 알고리즘 (Imaging an Unknown Velocity Target in Inverse SAR)

양훈기;김은수
- 한국통신학회논문지
- /
- 제19권5호
- /
- pp.796-804
- /
- 1994
본 논문은 Inverse SAR를 이용하여 속도를 모르는 움직이는 물체의 영상 이미지를 얻는 이미징 알고리즘을 제시하였고 실제 데이터를 알고리즘에 적용되었다. 실제 데이터는 stepped-frequency 변조된 레이더 신호를 송신하였고 수신된 데이터는 sampling rate이 충분하지 않으나 reference 신호를 mixing 시켜 unaliased 되게 만든 후 interpolation 에 의해서 해결하였다. 알고리즘을 적용시키는데 요구되는 물체의 속도는 subaperture processing 방법에 의해서 얻어졌으며 얻어진 속도에 의해서 squint-mode SAR geometry 로 변환한 후 최근에 제시된 approximation 이 없는 이미징 알고리즘을 사용하여 최종적으로 이미지를 얻게 되었다. 또한 ISAR가 데이터를 송수신 하는 동안 물체의 속도가 변하는 경우 이것을 보상하는 방법을 제시하였다.
PDF

Prioritized Multipath Video Forwarding in WSN

Asad Zaidi, Syed Muhammad;Jung, Jieun;Song, Byunghun
- Journal of Information Processing Systems
- /
- 제10권2호
- /
- pp.176-192
- /
- 2014
The realization of Wireless Multimedia Sensor Networks (WMSNs) has been fostered by the availability of low cost and low power CMOS devices. However, the transmission of bulk video data requires adequate bandwidth, which cannot be promised by single path communication on an intrinsically low resourced sensor network. Moreover, the distortion or artifacts in the video data and the adherence to delay threshold adds to the challenge. In this paper, we propose a two stage Quality of Service (QoS) guaranteeing scheme called Prioritized Multipath WMSN (PMW) for transmitting H.264 encoded video. Multipath selection based on QoS metrics is done in the first stage, while the second stage further prioritizes the paths for sending H.264 encoded video frames on the best available path. PMW uses two composite metrics that are comprised of hop-count, path energy, BER, and end-to-end delay. A color-coded assisted network maintenance and failure recovery scheme has also been proposed using (a) smart greedy mode, (b) walking back mode, and (c) path switchover. Moreover, feedback controlled adaptive video encoding can smartly tune the encoding parameters based on the perceived video quality. Computer simulation using OPNET validates that the proposed scheme significantly outperforms the conventional approaches on human eye perception and delay.
https://doi.org/10.3745/JIPS.03.0002 인용 PDF KSCI

Audio and Video Bimodal Emotion Recognition in Social Networks Based on Improved AlexNet Network and Attention Mechanism

Liu, Min;Tang, Jun
- Journal of Information Processing Systems
- /
- 제17권4호
- /
- pp.754-771
- /
- 2021
In the task of continuous dimension emotion recognition, the parts that highlight the emotional expression are not the same in each mode, and the influences of different modes on the emotional state is also different. Therefore, this paper studies the fusion of the two most important modes in emotional recognition (voice and visual expression), and proposes a two-mode dual-modal emotion recognition method combined with the attention mechanism of the improved AlexNet network. After a simple preprocessing of the audio signal and the video signal, respectively, the first step is to use the prior knowledge to realize the extraction of audio characteristics. Then, facial expression features are extracted by the improved AlexNet network. Finally, the multimodal attention mechanism is used to fuse facial expression features and audio features, and the improved loss function is used to optimize the modal missing problem, so as to improve the robustness of the model and the performance of emotion recognition. The experimental results show that the concordance coefficient of the proposed model in the two dimensions of arousal and valence (concordance correlation coefficient) were 0.729 and 0.718, respectively, which are superior to several comparative algorithms.
https://doi.org/10.3745/JIPS.02.0161 인용 PDF KSCI

GMM-Based Maghreb Dialect Identification System

Nour-Eddine, Lachachi;Abdelkader, Adla
- Journal of Information Processing Systems
- /
- 제11권1호
- /
- pp.22-38
- /
- 2015
While Modern Standard Arabic is the formal spoken and written language of the Arab world; dialects are the major communication mode for everyday life. Therefore, identifying a speaker's dialect is critical in the Arabic-speaking world for speech processing tasks, such as automatic speech recognition or identification. In this paper, we examine two approaches that reduce the Universal Background Model (UBM) in the automatic dialect identification system across the five following Arabic Maghreb dialects: Moroccan, Tunisian, and 3 dialects of the western (Oranian), central (Algiersian), and eastern (Constantinian) regions of Algeria. We applied our approaches to the Maghreb dialect detection domain that contains a collection of 10-second utterances and we compared the performance precision gained against the dialect samples from a baseline GMM-UBM system and the ones from our own improved GMM-UBM system that uses a Reduced UBM algorithm. Our experiments show that our approaches significantly improve identification performance over purely acoustic features with an identification rate of 80.49%.
https://doi.org/10.3745/JIPS.02.0015 인용 PDF KSCI

초고속 통신망에서 비디오 컨퍼런싱을 위한 다중 멀티캐스트 서버 (Multi-Multicast Server for Video Conferencing on Information Super Highway)

안상준;이승로;한선영
- 한국정보처리학회논문지
- /
- 제3권7호
- /
- pp.1858-1867
- /
- 1996
본 논문은 초고속 통신망에서 비디오 컨퍼런싱을 위한 플랫폼을 나타낸다. 이플 랫폼은 ATM(Asynchronous Transfer Mode) 망 상에서 IP 멀티캐스트 데이타를 멀티캐 스팅하기 위해 다중 멀티채스트 서버를 이용한다. 본 논문에서 제안한 MARS(Multicast Address ResolutionServer)를 사용하여 D class IP 주소를 ATM 주소와 매핑하고, 또한 하나의 MCS(MultiCast Server)의 다운에 대한 처리를 수행 하도록 한다. 기존에 제안 된 하나의 MCS 사용 시 문제시 되던 병목현상을 해결한다.
PDF

영상과 GPS 정보를 결합한 Follow-me Selfie 드론 (Visual-GPS combined Drone Follow-me Selfie Drone)

도 딴 뚜안;안희준
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2017년도 추계학술발표대회
- /
- pp.134-137
- /
- 2017
Follow-me function of drones is new and attractive for selfie drone users, where the drone autonomously follows and capture the user. Currently the products use the difference between GPS's in the drone and user side mobile GCS, but the targeting accuracy is not satisfactory owing to the low accuracy of GPS data, often the order of ten meters. We designed a new follow-me mode algorithm that utilizes the accuracy of visual tracking algorithm and the reliability of GPS-based. The experiment shows that proposed follow-me can capture much accurately the target user in the center of video content than GPS-only methods, and recover the vision algorithm failure quickly in 5-10 seconds.
https://doi.org/10.3745/PKIPS.y2017m11a.134 인용 PDF

실내 게이트웨이 설치 환경에서 P2P 기반의 LoRa 통신 성능 측정 실험에 관한 연구 (Performance Measurement of LoRaWAN Communications using P2P Mode with Indoor Gateway Placement)

강경우;이은규
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2017년도 추계학술발표대회
- /
- pp.1254-1257
- /
- 2017
LoRa는 저전력 및 장거리 작동을 위해 설계된 새로운 ISM 대역 무선 기술이며, LoRaWAN은 LoRa에서 정의된 광역 네트워크 프로토콜이다. 본 논문에서는 실제 환경에서 LoRaWAN 기술의 통신 성능을 검증하는 것을 목표로 한다. 이를 위해, 캠퍼스 내에 LoRaWAN 실험을 위한 실제 테스트 베드를 구축했다. 사용자들이 사용하는 실제 환경을 만들기 위해 통신 게이트웨이를 실내에 설치하였고, 캠퍼스의 실내외 다수 위치에서 데이터를 P2P 방식으로 게이트웨이에게 전송한다. 실험에서는 대역폭, 코딩 속도, 확산 계수 및 전송 전력을 변화시켰으며, 성능 검증을 위해 신호대잡음비와 패킷 전송률을 측정하여 결과를 분석한다.
https://doi.org/10.3745/PKIPS.y2017m11a.1254 인용 PDF

검색결과 523건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)