• Title/Summary/Keyword: multi-camera

Search Result 879, Processing Time 0.028 seconds

An Improved Motion/Disparity Vector Prediction for Multi-view Video Coding (다시점 비디오 부호화를 위한 개선된 움직임/변이 벡터 예측)

  • Lim, Sung-Chang;Lee, Yung-Lyul
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.2
    • /
    • pp.37-48
    • /
    • 2008
  • Generally, a motion vector and a disparity vector represent the motion information of an object in a single-view of camera and the displacement of the same scene between two cameras that located spatially different from each other, respectively. Conventional H.264/AVC does not use the disparity vector in the motion vector prediction because H.264/AVC has been developed for the single-view video. But, multi-view video coding that uses the inter-view prediction structure based on H.264/AVC can make use of the disparity vector instead of the motion vector when the current frame refers to the frame of different view. Therefore, in this paper, we propose an improved motion/disparity vector prediction method that consists of global disparity vector replacement and extended neighboring block prediction. From the experimental results of the proposed method compared with the conventional motion vector prediction of H.264/AVC, we achieved average 1.07% and 1.32% of BD (Bjontegaard delta)-bitrate saving for ${\pm}32$ and ${\pm}64$ of global vector search range, respectively, when the search range of the motion vector prediction is set to ${\pm}16$.

A Study on the Restoration of a Low-Resoltuion Iris Image into a High-Resolution One Based on Multiple Multi-Layered Perceptrons (다중 다층 퍼셉트론을 이용한 저해상도 홍채 영상의 고해상도 복원 연구)

  • Shin, Kwang-Yong;Kang, Byung-Jun;Park, Kang-Ryoung;Shin, Jae-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.3
    • /
    • pp.438-456
    • /
    • 2010
  • Iris recognition uses a unique iris pattern of user to identify person. In order to enhance the performance of iris recognition, it is reported that the diameter of iris region should be greater than 200 pixels in the captured iris image. So, the previous iris system used zoom lens camera, which can increase the size and cost of system. To overcome these problems, we propose a new method of enhancing the accuracy of iris recognition on low-resolution iris images which are captured without a zoom lens. This research is novel in the following two ways compared to previous works. First, this research is the first one to analyze the performance degradation of iris recognition according to the decrease of the image resolution by excluding other factors such as image blurring and the occlusion of eyelid and eyelash. Second, in order to restore a high-resolution iris image from single low-resolution one, we propose a new method based on multiple multi-layered perceptrons (MLPs) which are trained according to the edge direction of iris patterns. From that, the accuracy of iris recognition with the restored images was much enhanced. Experimental results showed that when the iris images down-sampled by 6% compared to the original image were restored into the high resolution ones by using the proposed method, the EER of iris recognition was reduced as much as 0.133% (1.485% - 1.352%) in comparison with that by using bi-linear interpolation

Fast Multi-View Synthesis Using Duplex Foward Mapping and Parallel Processing (순차적 이중 전방 사상의 병렬 처리를 통한 다중 시점 고속 영상 합성)

  • Choi, Ji-Youn;Ryu, Sae-Woon;Shin, Hong-Chang;Park, Jong-Il
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.11B
    • /
    • pp.1303-1310
    • /
    • 2009
  • Glassless 3D display requires multiple images taken from different viewpoints to show a scene. The simplest way to get multi-view image is using multiple camera that as number of views are requires. To do that, synchronize between cameras or compute and transmit lots of data comes critical problem. Thus, generating such a large number of viewpoint images effectively is emerging as a key technique in 3D video technology. Image-based view synthesis is an algorithm for generating various virtual viewpoint images using a limited number of views and depth maps. In this paper, because the virtual view image can be express as a transformed image from real view with some depth condition, we propose an algorithm to compute multi-view synthesis from two reference view images and their own depth-map by stepwise duplex forward mapping. And also, because the geometrical relationship between real view and virtual view is repetitively, we apply our algorithm into OpenGL Shading Language which is a programmable Graphic Process Unit that allow parallel processing to improve computation time. We demonstrate the effectiveness of our algorithm for fast view synthesis through a variety of experiments with real data.

Multi-point Dynamic Displacement Measurements of Structures Using Digital Image Correlation Technique (Digital Image Correlation기법을 이용한 구조물의 다중 동적변위응답 측정)

  • Kim, Sung-Wan;Kim, Nam-Sik
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.13 no.3
    • /
    • pp.11-19
    • /
    • 2009
  • Recently, concerns relating to the maintenance of large structures have been increased. In addition, the number of large structures that need to be evaluated for their structural safety due to natural disasters and structural deterioration has been rapidly increasing. It is common for the structural characteristics of an older large structure to differ from the characteristics in the initial design stage, and changes in dynamic characteristics may result from a reduction in stiffness due to cracks on the materials. The process of deterioration of such structures enables the detection of damaged locations, as well as a quantitative evaluation. One of the typical measuring instruments used for the monitoring of bridges and buildings is the dynamic measurement system. Conventional dynamic measurement systems require considerable cabling to facilitate a direct connection between sensor and DAQ logger. For this reason, a method of measuring structural responses from a remote distance without the mounted sensors is needed. In terms of non-contact methods that are applicable to dynamic response measurement, the methods using the doppler effect of a laser or a GPS are commonly used. However, such methods could not be generally applied to bridge structures because of their costs and inaccuracies. Alternatively, a method using a visual image can be economical as well as feasible for measuring vibration signals of inaccessible bridge structures and extracting their dynamic characteristics. Many studies have been conducted using camera visual signals instead of conventional mounted sensors. However, these studies have been focused on measuring displacement response by an image processing technique after recording a position of the target mounted on the structure, in which the number of measurement targets may be limited. Therefore, in this study, a model experiment was carried out to verify the measurement algorithm for measuring multi-point displacement responses by using a DIC (Digital Image Correlation) technique.

Discovery of the Dmitri Donskoi ship near Ulleung Island(East Sea of Korea), using geophysical surveys (물리탐사기술을 이용한 침몰선 Dmitri Donskoi호 탐사)

  • Yoo, Hai-Soo;Kim, Su-Jeong;Park, Dong-Won
    • Geophysics and Geophysical Exploration
    • /
    • v.8 no.1
    • /
    • pp.104-111
    • /
    • 2005
  • Dmitri Donskoi, the Russian cruiser launched in 1883, is known to have sunk near Ulleung Island (East Sea, Korea) on May 29, 1905, while it was participating in the Russo-Japanese War. In order to find this ship, information about its possible location was obtained from Russian and Japanese maritime historical records. The supposed location of the ship was identified, and we conducted a five-year geophysical survey from 1999 to 2003. A reconnaissance three-dimensional topographic survey of the sea floor was carried out using multi-beam echo sounder, marine magnetometer, and side-scan sonar. An anomalous body identified through the initial reconnaissance survey was identified by a detailed survey using a remotely operated vehicle, deep-sea camera, and the mini-submarine Pathfinder. Interpretation of the acquired data showed that the ship is hanging on the side of a channel, at the bottom of the sea 400 m below sea level. The location is about 2 km from Port Jeodong, Uleung Island. We discovered 152 mm naval guns and other war materiel still attached to the hull of the ship. In addition, the remnants of the steering gear and other machinery that were burnt during the final action were found near the hull. Strong magnetic fields, resulting from the presence of volcanic rocks in the survey area, affected the resolution of the magnetic data gathered; as a result, we could not locate the ship reliably using the magnetic method. Severe sea floor topography in the gully around the hull gave rise to diffuse reflections in the side-scan sonar data, and this prevented us from identifying the anomalous body with the side-scan sonar technique. However, the sea-floor image obtained from the multi-bean echo sounder was very useful in verifying the location of the ship.

Contactless User Identification System using Multi-channel Palm Images Facilitated by Triple Attention U-Net and CNN Classifier Ensemble Models

  • Kim, Inki;Kim, Beomjun;Woo, Sunghee;Gwak, Jeonghwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.33-43
    • /
    • 2022
  • In this paper, we propose an ensemble model facilitated by multi-channel palm images with attention U-Net models and pretrained convolutional neural networks (CNNs) for establishing a contactless palm-based user identification system using conventional inexpensive camera sensors. Attention U-Net models are used to extract the areas of interest including hands (i.e., with fingers), palms (i.e., without fingers) and palm lines, which are combined to generate three channels being ped into the ensemble classifier. Then, the proposed palm information-based user identification system predicts the class using the classifier ensemble with three outperforming pre-trained CNN models. The proposed model demonstrates that the proposed model could achieve the classification accuracy, precision, recall, F1-score of 98.60%, 98.61%, 98.61%, 98.61% respectively, which indicate that the proposed model is effective even though we are using very cheap and inexpensive image sensors. We believe that in this COVID-19 pandemic circumstances, the proposed palm-based contactless user identification system can be an alternative, with high safety and reliability, compared with currently overwhelming contact-based systems.

A Study on the Digital Holographic Image Acquisition Method using Chroma Key Composition (크로마키 합성을 이용한 디지털 홀로그래피 이미지 획득 방법 연구)

  • Kim, Ho-sik;Kwon, Soon-chul;Lee, Seung-hyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.313-321
    • /
    • 2022
  • As 5G is getting developed, people are getting interested in immersive content. Some predicts that immersive content may be implemented in real life such as holograms, which were only possible in movies. Holograms, which has been studied for a long time since Dennis Gabor published the basic theory in 1948, are constantly developing in a new direction with digital technology. It is developing from a traditional optical hologram, which is produced by recording the interference pattern of light to a computer generated hologram (CGH) and a digital hologram printer. In order to produce a hologram using a digital hologram printer, holographic element (Hogel) image must first be created using multi-view images. There are a method of directly photographing an actual image and a method of modeling an object using 3D graphic production tool and rendering the motion of a virtual camera to acquire a series of multi-view images. In this paper, we propose a new method of getting image, which is one of the visual effect, VFX, producing multi-view images using chroma key composition. We shoot on the green screen of actual object, suggest the overall workflow of composition with 3D computer graphic(CG) and explain the role of each step. We expected that it will be helpful in researching a new method of image acquisition in the future if all or part of the proposed workflow to be applied.

Preliminary Study on All-in-JPEG with Multi-Content Storage Format extending JPEG (JPEG를 확장한 멀티 콘텐츠 저장 포맷 All-in-JPEG에 관한 예비 연구)

  • Yu-Jin Kim;Kyung-Mi Kim;Song-Yeon Yoo;Chae-Won Park;Kitae Hwang;In-Hwan Jung;Jae-Moon Lee
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.5
    • /
    • pp.183-189
    • /
    • 2023
  • This paper proposes a new JPEG format, All-in-JPEG, which can include not only multiple photos but also various media such as audio and text by extending the JPEG format. All-in-JPEG add images, audio, and text at the existing JPEG file, and stores meta information by utilizing the APP3 segment of JPEG. With All-in-JPEG, smartphone users can save many pictures taken in burst shots in one file, and it is also very convenient to share them with others. In addition, you can create a live photo, such as saving a short audio at the time of taking a photo or moving a part of the photo. In addition, it can be used for various applications such as a photo diary app that stores images, voices, and diary text in a single All-in-JPEG file. In this paper, we developed an app that creates and edits All-in-JPEG, a photo diary app, and a magic photo function, and verified feasibility of the All-in-JPEG through them.

The Individual Discrimination Location Tracking Technology for Multimodal Interaction at the Exhibition (전시 공간에서 다중 인터랙션을 위한 개인식별 위치 측위 기술 연구)

  • Jung, Hyun-Chul;Kim, Nam-Jin;Choi, Lee-Kwon
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.19-28
    • /
    • 2012
  • After the internet era, we are moving to the ubiquitous society. Nowadays the people are interested in the multimodal interaction technology, which enables audience to naturally interact with the computing environment at the exhibitions such as gallery, museum, and park. Also, there are other attempts to provide additional service based on the location information of the audience, or to improve and deploy interaction between subjects and audience by analyzing the using pattern of the people. In order to provide multimodal interaction service to the audience at the exhibition, it is important to distinguish the individuals and trace their location and route. For the location tracking on the outside, GPS is widely used nowadays. GPS is able to get the real time location of the subjects moving fast, so this is one of the important technologies in the field requiring location tracking service. However, as GPS uses the location tracking method using satellites, the service cannot be used on the inside, because it cannot catch the satellite signal. For this reason, the studies about inside location tracking are going on using very short range communication service such as ZigBee, UWB, RFID, as well as using mobile communication network and wireless lan service. However these technologies have shortcomings in that the audience needs to use additional sensor device and it becomes difficult and expensive as the density of the target area gets higher. In addition, the usual exhibition environment has many obstacles for the network, which makes the performance of the system to fall. Above all these things, the biggest problem is that the interaction method using the devices based on the old technologies cannot provide natural service to the users. Plus the system uses sensor recognition method, so multiple users should equip the devices. Therefore, there is the limitation in the number of the users that can use the system simultaneously. In order to make up for these shortcomings, in this study we suggest a technology that gets the exact location information of the users through the location mapping technology using Wi-Fi and 3d camera of the smartphones. We applied the signal amplitude of access point using wireless lan, to develop inside location tracking system with lower price. AP is cheaper than other devices used in other tracking techniques, and by installing the software to the user's mobile device it can be directly used as the tracking system device. We used the Microsoft Kinect sensor for the 3D Camera. Kinect is equippedwith the function discriminating the depth and human information inside the shooting area. Therefore it is appropriate to extract user's body, vector, and acceleration information with low price. We confirm the location of the audience using the cell ID obtained from the Wi-Fi signal. By using smartphones as the basic device for the location service, we solve the problems of additional tagging device and provide environment that multiple users can get the interaction service simultaneously. 3d cameras located at each cell areas get the exact location and status information of the users. The 3d cameras are connected to the Camera Client, calculate the mapping information aligned to each cells, get the exact information of the users, and get the status and pattern information of the audience. The location mapping technique of Camera Client decreases the error rate that occurs on the inside location service, increases accuracy of individual discrimination in the area through the individual discrimination based on body information, and establishes the foundation of the multimodal interaction technology at the exhibition. Calculated data and information enables the users to get the appropriate interaction service through the main server.

An Optimization Method of Measuring Heart Position in Dynamic Myocardial Perfusion SPECT with a CZT-based camera (동적 심근관류 SPECT에서 심장의 위치 측정방법에 대한 고찰)

  • Seong, Ji Hye;Lee, Dong Hun;Kim, Eun Hye;Jung, Woo Young
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.23 no.1
    • /
    • pp.75-79
    • /
    • 2019
  • Purpose Cadmium-zinc-telluride (CZT) camera with semiconductor detector is capable of dynamic myocardial perfusion SPECT for coronary flow reserve (CFR). Image acquisition with the heart positioned within 2 cm in the center of the quality field of view (QFOV) is recommended because the CZT detector based on focused multi-pinhole collimators and is stationary gantry without rotation. The aim of this study was to investigate the optimal method for measuring position of the heart within the center of the QFOV when performing dynamic myocardial perfusion SPECT with the Discovery NM 530c camera. Materials and Methods From June to September 2018, 45 patients were subject to dynamic myocardial perfusion SPECT with D530c. For accurate heart positioning, the patient's heart was scanned with a mobile ultrasound and marked at the top of the probe where the mitral valve (MV) was visible in the parasternal long-axis view (PLAX). And, the marked point on the patient's body matched with the reference point indicated CZT detector in dynamic stress. The heart was positioned to be in the center of the QFOV in rest. The coordinates of dynamic stress and rest were compared statistically. Results The coordinates of the dynamic stress using mobile ultrasound and those taken of the rest were recorded for comparative analysis with regard to the position of the couch and analyzed. There were no statistically significant differences in the coordinates of Table in & out, Table up & down, and Detector in & out (P > 0.05). The difference in distance between the 2 groups was measured at $0.25{\pm}1.00$, $0.24{\pm}0.96$ and $0.25{\pm}0.82cm$ respectively, with no difference greater than 2 cm in all categories. Conclusion The position of the heart taken using mobile ultrasound did not differ significantly from that of the center of the QFOV. Therefore, The use of mobile ultrasound in dynamic stress will help to select the correct position of the heart, which will be effective in clinical diagnosis by minimizing the image quality improvement and the patient's exposure to radiation.