Search | Korea Science

Parallel Speech Recognition on Distributed Memory Multiprocessors (분산 메모리 다중 프로세서 상에서의 병렬 음성인식)

윤지현;홍성태;정상화;김형순
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.747-749
- /
- 1998
본 논문에서는 음성과 자연언어의 통합처리를 위한 효과적인 병렬 계산 모델을 제안한다. 음소모델은 continuous HMM에 기반을 둔 문맥종속형 음소를 사용하며, 언어모델은 knowledge-based approach를 사용한다. 또한 계층구조의 지식베이스상에서 다수의 가설을 처리하기 위해 memory-based parsing기술을 사용하였다. 본 연구의 병렬 음성인식 알고리즘은 분산메모리 MIMD 구조의 다중 Transputer 시스템을 이용하여 구현되었다. 실험을 통하여 음성인식 과정에서 발생하는 speech-specific problem의 해를 제공하고 음성인식 시스템의 병렬화를 통하여 실시간 음성인식의 가능성을 보여준다.
PDF

Parallel Speedup of NTGST on SIMD type Multiprocessor (SIMD 구조의 다중 프로세서를 이용한 NTGST의 병렬고속화)

김복만;서경석;김종화;최흥문
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.127-130
- /
- 2001
본 논문에서는 SIMD (Single Instruction stream and Multiple Data stream)형 병렬 구조의 다중 프로세서를 이용하여 NTGST (noise-tolerant generalized symmetry transform)를 병렬 고속화하였다. 먼저 NTGST의 화소 및 영상 영역간의 계산 독립성을 이용하여 영상을 분할하여 P개의 프로세서에 할당하고, 이들 각각을 N개의 데이터를 한번에 처리하는 SIMD 구조로 병렬화하여 NP에 비례하는 속도 향상을 얻었다. 실험에서 MMX 기술의 펜티엄 Ⅲ 프로세서를 2개 사용하여 제안한 알고리즘이 기존의 NTGST 보다 8배 가까이 고속으로 처리됨을 확인하였다.
PDF

Task Allocation Methods and Performance Comparison for Parallel Computation Based on Multi-Agent System (다중 에이전트 시스템 기반의 병렬 계산을 위한 작업 할당 기법과 성능비교)

김경하;김영학;오길호
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10c
- /
- pp.502-504
- /
- 2000
최근 높은 컴퓨팅 파워를 요구하는 응용문제 처리를 위해 고 비용의 슈퍼컴퓨터 대신 인터넷상에 분산된 다수의 일반 컴퓨터들을 이용하는 병렬처리에 관한 연구가 진행되고 있다. 본 연구에서는 다중 에이전트 시스템을 기반으로 하여 이질적인 성능을 갖는 컴퓨터들을 병렬 컴퓨팅 환경으로 구성하고 각 호스트의 성능측정 결과에 따라 효율적으로 작업을 분산하는 기법을 제안한다. 또한 본 연구에서 제안한 방법을 다중 에이전트 시스템인 IBM의 AgletsTM을 사용하여 실험적으로 성능을 평가하고, 기존 연구와 비교한다.
PDF

Parallel Computation for Extended Edit Distances Using the Shared Memory on GPU (GPU의 공유메모리를 활용한 확장편집거리 병렬계산)

Kim, Youngho;Na, Joong Chae;Sim, Jeong Seop
- KIPS Transactions on Computer and Communication Systems
- /
- v.4 no.7
- /
- pp.213-218
- /
- 2015
Given two strings X and Y (|X|=m, |Y|=n) over an alphabet ${\Sigma}$, the extended edit distance between X and Y can be computed using dynamic programming in O(mn) time and space. Recently, a parallel algorithm that takes O(m+n) time and O(mn) space using m threads to compute the extended edit distance between X and Y was presented. In this paper, we present an improved parallel algorithm using the shared memory on GPU. The experimental results show that our parallel algorithm runs about 19~25 times faster than the previous parallel algorithm.
https://doi.org/10.3745/KTCCS.2015.4.7.213 인용 PDF KSCI

Parallel Computing of Large Scale FE Model based on Explicit Lagrangian FEM (외연 Lagrangian 유한요소법 기반의 대규모 유한요소 모델 병렬처리)

백승훈;김승조;이민형
- Journal of the Korean Society for Aeronautical & Space Sciences
- /
- v.34 no.8
- /
- pp.33-40
- /
- 2006
A parallel computing strategy for finite element(FE) processing is described and implemented in nonlinear explicit FE code and its parallel performances are evaluated. A self-made linux-cluster supercomputer with 520 CPUs is used as a bench mark test bed. It is observed that speed-up is increased almost idealy even up to 256 CPUs for a large scale model. A communication over head and its effect on the parallel performance is also examined. Parallel performance is compare with the commercial code and developed code shows superior performance as the number of CPUs used are increased.
https://doi.org/10.5139/JKSAS.2006.34.8.033 인용 PDF KSCI

SOAP-based Distributed Processing Scheduling Framework: pyBubble (SOAP기반의 분산처리 스케줄링 프레임웍: pyBubble)

;;;R.S.Ramakrishna
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04a
- /
- pp.742-744
- /
- 2004
본 논문은 웹 서비스 프로토콜인 SOAP기반의 병렬처리 프레임웍인 pyBubble의 설계와 구현에 관한 것이다. 그리드 어플리케이션 프로그래밍의 어려움을 덜기 위해 그리드 미들웨어들로부터의 복잡성에 투명성을 제공하는 것을 본 논문의 목표로 한다. 이는 RPC스타일의 프로그래밍 인터페이스를 지원하면서 파이썬 스크립트 언어의 이식성과 확장성을 통해 기존 병렬처리 어플리케이션의 그리드화와 다양한 자원 스케줄링을 연구 할 수 있도록 하는 스케줄링 프레임웍이 주요 기능적 요소이다. 병렬처리를 위해 비동기 SOAP과 이를 이용한 Task-Farming과 DAG기반의 스케줄링의 지원함으로써 고성능의 그리드 계산환경을 제공하고자 한다.
PDF

An Iterative Algorithm for the Bottom Up Computation of the Data Cube using MapReduce (맵리듀스를 이용한 데이터 큐브의 상향식 계산을 위한 반복적 알고리즘)

Lee, Suan;Jo, Sunhwa;Kim, Jinho
- Journal of Information Technology and Architecture
- /
- v.9 no.4
- /
- pp.455-464
- /
- 2012
Due to the recent data explosion, methods which can meet the requirement of large data analysis has been studying. This paper proposes MRIterativeBUC algorithm which enables efficient computation of large data cube by distributed parallel processing with MapReduce framework. MRIterativeBUC algorithm is developed for efficient iterative operation of the BUC method with MapReduce, and overcomes the limitations about the storage size and processing ability caused by large data cube computation. It employs the idea from the iceberg cube which computes only the interesting aspect of analysts and the distributed parallel process of cube computation by partitioning and sorting. Thus, it reduces data emission so that it can reduce network overload, processing amount on each node, and eventually the cube computation cost. The bottom-up cube computation and iterative algorithm using MapReduce, proposed in this paper, can be expanded in various way, and will make full use of many applications.
KSCI

인라인 타입 마그네트론 스퍼터링 장치에서 증착 두께 분포 병렬 계산

Ju, Jeong-Hun
- Proceedings of the Korean Vacuum Society Conference
- /
- 2014.02a
- /
- pp.225-225
- /
- 2014
일반적인 Cosine law를 이용한 증착 두께의 분포에 대한 계산은 적분의 형태로 이루어져있다. LCD 8G 급의 경우 마그네트론 스퍼터링 타겟의 크기가 깊이 3 m, 폭 25 cm정도인데 대략 6~8개를 설치하여 공정 시간을 줄이고 있다. 이 때 한 쪽 방향으로 이동하는 기판이 타겟 표면과 이루는 각도는 아주 작은 각에서 수직으로 다시 음의 각도로 변화한다. 이 때 발생하는 박막의 미세 조직 변화는 박막 특성에 많은 영향을 준다. 이에 대한 연구를 위한 1단계로 타겟 표면과 기판 표면을 모두 미소 면적소로 구분하고 각각의 면적소 간에 이루어지는 증착 원자의 비행을 충돌이 없다는 가정하에 direct flux 알고리즘으로 처리하였다. 이 때 소요되는 계산 시간은 매우 길어서 single core CPU에서 serial job으로 처리하는 경우 여러 시간이 소요된다. 이에 대한 대안으로 OpenMP를 이용한 작업의 병렬화를 시도하였다. 4 core machine에서 최대 96%의 병렬 효율을 달성하였다.
PDF

A Study on the CAM Designed by Adopting Best-Match Method using Parallel Processing Architecture (병렬 처리 구조를 이용한 최적 정합 방식 CAM 설계에 관한 연구)

김상복;박노경;차균현
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.19 no.6
- /
- pp.1056-1063
- /
- 1994
In this paper a content addressable memory (CAM) is designed by adopting best-match method. It has a single processing element(PE) architecture with high computational efficiency and throughput. It is composed of three main functional blocks(input MUX, best-match CAM, control part). It support fully parallel processing. Logic simulation is completed by using QUICKSIM, Circuit simulation is performanced by using HSPICE. Its layout is based on the ETRI 3 m n-well process design rules. Its maximum operating frequency is 20 MHz.
PDF

Collective I/O with Process grouping (프로세스 그룹화를 이용한 집합 I/O)

차광호;홍정우;이지수
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.10a
- /
- pp.442-444
- /
- 2003
병렬 처리를 요구하는 계산 과학 분야의 문제들 중에는 대용량 데이터 처리를 필요로 하는 경우가 많다. 그러나 기존의 파일 시스템을 그대로 병렬처리 환경에 적용하기에는 많은 문제가 따른다. 이를 위해서 병렬처리를 지원하는 파일 시스템에 대한 연구와 개발이 진행되어 오고 있다. 이와 같은 연구 중 하나인 집합 I/O(Collective I/O)를 본 논문에서 다루고자 한다. 이 집합 I/O는 여러 프로세스의 파일 I/O 요청을 효과적으로 처리하는 방법으로 MPl2의 MPI-10에도 포함되어 있다. 본 논문에서는 어플리케이션 프로그램 측면에서 MPI-10의 집합 I/O를 효과적으로 사용하기 위한 방안을 제시하며, 보편적으로 사용되는 NFS를 이용한 클러스터 시스템에서의 실험 결과를 분석한다.
PDF

Search Result 443, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)