Search | Korea Science

Dimensionality Reduction Based Frequency Domain Audio Signal Compression Method (차원 축소를 이용한 주파수 영역 오디오 신호 압축)

Kim, Min-Je;Beack, Seung-Kwon;Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2008.02a
- /
- pp.179-182
- /
- 2008
본 논문은 오디오 부호화 및 복호화 과정에서, 주파수 영역에서 표현된 오디오 신호를 차원 축소 방법으로 압축하여 표현함으로서 오디오 부호화 효율을 증대시키고자 하는 방식에 관한 것이다. 차원 축소는 행렬을 특정한 조건을 바탕으로 두 개의 행렬의 곱으로 표현하는 방식으로, 특정 행렬로 표현된 데이터를 좀 더 작은 데이터량으로 표현하는 것뿐만 아니라 이 과정에서 데이터에 내재되어 있는 추상적인 정보까지도 함축적으로 얻어낼 수 있기 때문에, 일반적으로 데이터의 압축에 좋은 성능을 보인다. 주파수 영역으로 변환된 신호는 일반적으로 (주파수 밴드의 개수) $\times$ (전체 프레임의 개수)인 행렬로 볼 수 있으며, 이 전체 행렬을 입력으로 간주하고, 차원 축소를 수행하여 신호의 압축 효과를 얻을 수 있다. 그러나 이 경우, 행렬 전체를 입력 신호로 보아야 하기 때문에 실시간 부호화가 불가능하며, 신호 전체 길이만큼의 부호화 지연이 발생한다. 이를 해소하기 위해, 본 논문에서는 특정 개수만큼의 프레임을 묶어서 여러 번의 차원 축소를 순차적으로 수행함으로써 부호화 지연을 최소화하는 방식을 제안한다.
PDF

Developing the framework of level diagnosis for green data center (그린데이터센터의 수준진단 프레임워크 개발)

Ra, Jong-Hei;Lee, Sang-Hak
- Journal of Digital Convergence
- /
- v.9 no.2
- /
- pp.141-152
- /
- 2011
The data center has become an increasingly important part of most business operations. An increasing demand for computation has led to increasing industry energy consumption. Therefore, higher-than-normal rates of energy efficiency have become a core issue in the life cycle of data center. In this paper, we proposed the framework of level diagnosis for green data centre that can be used to diagnose the levels of capability maturity model. This framework contains the 5 key areas such as construction, air-conditioning, electricity, information technology, organization and indicators that can be applied as basic level diagnosis guide for green data center.
https://doi.org/10.14400/JDPM.2011.9.2.141 인용 PDF

Lip and Voice Synchronization Using Visual Attention (시각적 어텐션을 활용한 입술과 목소리의 동기화 연구)

Dongryun Yoon;Hyeonjoong Cho
- The Transactions of the Korea Information Processing Society
- /
- v.13 no.4
- /
- pp.166-173
- /
- 2024
This study explores lip-sync detection, focusing on the synchronization between lip movements and voices in videos. Typically, lip-sync detection techniques involve cropping the facial area of a given video, utilizing the lower half of the cropped box as input for the visual encoder to extract visual features. To enhance the emphasis on the articulatory region of lips for more accurate lip-sync detection, we propose utilizing a pre-trained visual attention-based encoder. The Visual Transformer Pooling (VTP) module is employed as the visual encoder, originally designed for the lip-reading task, predicting the script based solely on visual information without audio. Our experimental results demonstrate that, despite having fewer learning parameters, our proposed method outperforms the latest model, VocaList, on the LRS2 dataset, achieving a lip-sync detection accuracy of 94.5% based on five context frames. Moreover, our approach exhibits an approximately 8% superiority over VocaList in lip-sync detection accuracy, even on an untrained dataset, Acappella.
https://doi.org/10.3745/TKIPS.2024.13.4.166 인용 PDF

Post-processing Method of Point Cloud Extracted Based on Image Matching for Unmanned Aerial Vehicle Image (무인항공기 영상을 위한 영상 매칭 기반 생성 포인트 클라우드의 후처리 방안 연구)

Rhee, Sooahm;Kim, Han-gyeol;Kim, Taejung
- Korean Journal of Remote Sensing
- /
- v.38 no.6_1
- /
- pp.1025-1034
- /
- 2022
In this paper, we propose a post-processing method through interpolation of hole regions that occur when extracting point clouds. When image matching is performed on stereo image data, holes occur due to occlusion and building façade area. This area may become an obstacle to the creation of additional products based on the point cloud in the future, so an effective processing technique is required. First, an initial point cloud is extracted based on the disparity map generated by applying stereo image matching. We transform the point cloud into a grid. Then a hole area is extracted due to occlusion and building façade area. By repeating the process of creating Triangulated Irregular Network (TIN) triangle in the hall area and processing the inner value of the triangle as the minimum height value of the area, it is possible to perform interpolation without awkwardness between the building and the ground surface around the building. A new point cloud is created by adding the location information corresponding to the interpolated area from the grid data as a point. To minimize the addition of unnecessary points during the interpolation process, the interpolated data to an area outside the initial point cloud area was not processed. The RGB brightness value applied to the interpolated point cloud was processed by setting the image with the closest pixel distance to the shooting center among the stereo images used for matching. It was confirmed that the shielded area generated after generating the point cloud of the target area was effectively processed through the proposed technique.
https://doi.org/10.7780/kjrs.2022.38.6.1.4 인용 PDF KSCI HTML

Efficient Speaker Identification based on Robust VQ-PCA (강인한 VQ-PCA에 기반한 효율적인 화자 식별)

Lee Ki-Yong
- Journal of Internet Computing and Services
- /
- v.5 no.3
- /
- pp.57-62
- /
- 2004
In this paper, an efficient speaker identification based on robust vector quantizationprincipal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification, Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region, Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.
PDF

A Study on the Flattening of 3D Mesh data of Shoes (신발 곡면의 3차원 격자 데이터의 평면화에 관한 연구)

Kim Young-Bong;Lee Yun-Jung
- Journal of Game and Entertainment
- /
- v.2 no.1
- /
- pp.64-70
- /
- 2006
CAD system is a very important technology in designing many products which we are using today. This CAD technology have enlarging its area into 3D CAD systems with the development of computer graphics technologies. In particular, such advances have also been realized in special area such as the CAD system for designing shoes. 3D CAD systems for shoes design must provide compatibility between 3D and 2D data because shoes are made using 2D parts of pieces of leather or cloth. Many designers get high performances using 2D shoe CAD systems because they have had long practices with the 2D systems. Therefore, to get the mapping between 2D modeling and 3D modeling is one of very important components in 3D CAD system. In this paper, we proposed a flattening method that convert 3D shoes data to 2D data.
PDF

Imaging Method in Time Domain for Bistatic Forward-Looking Radar in Short Range Application (근거리 Bistatic 전방 관측 레이다의 시간 영역 영상화 기법)

Sun, Sun-Gu;Cho, Byung-Lae;Lee, Jung-Soo;Park, Gyu-Churl;Ha, Jong-Soo;Han, Seung-Hoon
- The Journal of Korean Institute of Electromagnetic Engineering and Science
- /
- v.22 no.11
- /
- pp.1054-1062
- /
- 2011
This study describes the time domain imaging algorithm which can be well applied to short-range UWB(ultra wideband) bistatic radar. In the imaging method of SAR technology, the frequency domain method is well applied to the areas which satisfy far-field condition. However in the near-field environment, the image quality is not good due to phase error. However back-projection method based on time domain is well applied to short-range imaging radar. Meanwhile because its processing time is very long, real time-processing is very difficult. To resolve this problem FFBP(Fast Factorized Back-Projection) was proposed. Using the raw data gathered on field we implemented back-projection and FFBP method. Then image quality and processing time were analyzed using these methods.
https://doi.org/10.5515/KJKIEES.2011.22.11.1054 인용 PDF KSCI

Concept and Application of Groundwater's Platform Concurrency and Digital Twin (지하수의 플랫폼 동시성과 Digital Twin의 개념과 적용)

Doo Houng Choi;Byung-woo Kim;E Jae Kwon;Hwa-young Kim;Cheol Seo Ki
- Proceedings of the Korea Water Resources Association Conference
- /
- 2023.05a
- /
- pp.13-13
- /
- 2023
디지털 기술은 오늘날 플랫폼과 디지털 트윈의 기술도입을 통해 현실 세계를 네트워크와 가상세계와의 연결이 통합되어진 가상 현실 세계의 입문 도약이다. 현실에서 가상현실의 사이의 디지털 전환(digital transformation)에는 디지털 기술과 솔루션을 비즈니스의 모든 영역에 통합하는 것이 포함된다. 이러한 디지털 전환의 핵심은 데이터에 관한 것이며, 데이터를 활용하여 가치를 창출하고 고객경험과 비즈니스 영역을 극대화하는 방식을 제공한다. 최적의 데이터를 제공하기 위한 플랫폼과 가상 현실세계 구현을 위한 디지털 트윈의 상호연계 관한 기본 개념은 데이터 수집, 데이터 분석, 데이터 시각화 및 데이터 보고와 같은 데이터 비즈니스이다. 현장 데이터는 디지털 양식을 통해 수집, 기록, 저장된다. 현장 IoT 기반 데이터(사진 및 비디오 매체 등)는 지속적으로 수집되고 종종 다른 데이터베이스에 저장되지만 지리 공간적 위치에 연결되지 않는다. 모든 디지털 발전을 조화시키고 지하수 데이터에서 더 빠른 이해를 도출하기 위해서는 디지털 트윈이 시작되어야 한다. 단일 지하수플랫폼에서 현장 조건을 시각화하고 실시간 데이터를 스트리밍하며, 과거 3D 데이터와 상호작용하여지질 또는 지화학 데이터를 선택적 사용을 위해 지하수 플랫폼과 디지털 트윈이 연계되어야 한다. 데이터를 디지털 정보모델과 연결하면 디지털 트윈에 생명을 불어넣을 수 있지만 디지털 트윈의 가치를 극대화하려면 여전히 데이터 플랫폼 서비스와 전달 방식을 선택해야 한다. 지하수 플랫폼동시성을 갖는 디지털 트윈은 정적 및 동적 데이터를 저장하는 데이터베이스 또는 크라우드 서비스에서 데이터를 가져오는 API(애플리케이션 프로그래밍 인터레이스), 디지털 트윈을 위한 호스팅 공간, 디지털 대상을 구축하는 소프트웨어, 구성 요소 간 읽기/쓰기를 위한 스크립트, chatGPT 및 API를 활용할 수 있다. 이를 통해 수집된 데이터의 실시간 양방향 통신기술인 지하수 플랫폼 기술을 활용하여 디지털 트윈을 적용하고 완성할 수 있고, 이를 지하수 분야에도 그대로 적용할 수 있다. 지하수 분야의 디지털 트윈 기술의 근간은 지하수 모니터링을 위한 관측장치와 이를 활용한 지하수 플랫폼의 구축 및 양방향 자료전송을 통한 분석 및 예측기술이다. 특히 낙동강과 같이 유역면적이 넓고 유역 내 지자체가 많아 이해관계가 다양하며, 가뭄과 홍수/태풍 등 기후위기에 따른 극한 기상이변가 자주 발생하고, 또한 보 및 하굿둑 개방 등 정부정책 이행에 따른 민원이 다수 발생하는 지역의 경우 하천과 유역에 대한 지하수 플랫폼과 디지털 트윈의 동시성 기술적용 시 지하수 데이터에 대한 고려가 반드시 수반되어야 한다.
PDF

The segmentation system of brain in MRI based on 3-D region growing algorithm (3 차원 영역확장 알고리즘 기반의 MRI 에서의 뇌 영상 분할 시스템)

Lee, Joung-Min;Yun, Hyun-Joo;Kim, Myeong-Hee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2005.05a
- /
- pp.1769-1772
- /
- 2005
본 논문에서는 사용자의 작업을 최소화하고 결과의 정확성을 높일 수 있는 3 차원 영역 분할 알고리즘을 제시하고 있다. 경계선을 강화하고 유사영역을 평탄화하는 SRAD(Speckle Reducing Anisotropic Diffusion) 필터링은 잡음에 의한 3 차원 영역확장의 오류를 줄이고 분할 대상의 경계부분까지 안정적으로 영역을 확장시켜준다. 3 차원 영역확장 방법은 사용자에 의해 입력된 시작점을 기반으로 영역의 유사성과 집합성을 판단하는 평가함수(cost Function)를 계산하여 3 차원으로 영역을 확장시킨다. 이러한 방법을 이용할 때에 보다 효과적으로 3D MRI 데이터에 대한 영상 분할을 수행할 수 있다. 또한 논문에서 제시한 알고리즘의 검증을 위해서 분할 결과에 대한 의료진의 검증을 수행하였다.
PDF

Meta-data Configuration and Wellness Feature Analysis Technique for Wellness Content Recommendation (웰니스 콘텐츠 추천을 위한 메타데이터 구성 및 웰니스 특성 분석 기법)

Hong, Min-Sung;Lee, O-Joun;Lee, Won-Jin;Lee, Jae-Dong
- Journal of the Korea Society of Computer and Information
- /
- v.19 no.8
- /
- pp.83-93
- /
- 2014
Research into recommendation systems for wellness content has focused on representative research on the convergence of wellness and information technology, as interest in wellness has recently increased. But existing research is not suitable because it uses only one or two of the five wellness areas: physical, emotional, social, intellectual, and spiritual. And It cause decline of reliability and satisfaction for recommendation. Thus, a wellness areal feature analysis and integration management technique is needed. In this paper, suggest meta-data configuration and feature analysis technique of content. Also Cosine similarity of wellness areal features of the content was analyzed by applying a wellness areal score calculated in this way and by suggested wellness areal detailed properties and a measurement system to verify the efficiency of this research. This allows the wellness features of contents analyzed, and even will be able to personalized recommendations service for wellness.
https://doi.org/10.9708/jksci.2014.19.8.083 인용 PDF KSCI

Search Result 960, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)