통합 검색 | Korea Science

CC-NUMA 시스템에서의 동기화 기법에 대한 성능 비교 (Performance Comparison of Synchronization Methods for CC-NUMA Systems)

문의선;장성태;전주식
- 한국정보과학회논문지:시스템및이론
- /
- 제27권4호
- /
- pp.394-400
- /
- 2000
동기화는 병렬 프로그램의 수행이 정확하게 이루어지도록 하기 위해 공유 데이타나 프로그램상의 임계구간(critical section)에 대해 배타적인 수행을 보장하는 것을 목적으로 한다. 배타적인 프로그램의 수행은 병렬 프로그램의 병렬성을 제한하므로 효율적인 동기화는 높은 성능의 병렬 프로그램 수행을 위해 반드시 필요하다. 이런 필요에 의해 응용 프로그램이나 시스템의 특성을 이용하여 동기화의 성능을 높이는 기법들이 고안되었다. 본 논문에서는 모의실험을 통해 캐시에 기반을 둔 NUMA(Non-Uniform Memory Access) 시스템에서 나타나는 기존 동기화의 비효율성을 분석하여 제시하고, 이 비효율성을 제거할 수 있는 Freeze&Melt 동기화 기법과의 성능을 비교한다. 제시된 결과를 통해 Test-and-Test&Set 동기화는 동기화 과정에서 발생하는 방송(broadcast) 작업에 의해 비효율이 발생하고, QOLB(Queue-On-Lock-Bit) 동기화는 공유 데이타나 임계구간을 수행할 프로세서의 순서가 미리 정해져 있다는 점에 의해 비효율이 발생함을 확인할 수 있다. 이와 같은 단점들을 극복하고자 제안된 Freeze&Melt 동기화를 이용하여 임계구간을 수행하기까지 대기하는 시간과 임계구간을 수행하는 시간을 줄이고, 클러스터간의 통신량(traffic)을 감소시킴으로써 성능의 향상을 이룰 수 있다.
PDF

Design and Implementation of a Massively Parallel Multithreaded Architecture: DAVRID

Sangho Ha;Kim, Junghwan;Park, Eunha;Yoonhee Hah;Sangyong Han;Daejoon Hwang;Kim, Heunghwan;Seungho Cho
- Journal of Electrical Engineering and information Science
- /
- 제1권2호
- /
- pp.15-26
- /
- 1996
MPAs(Massively Parallel Architectures) should address two fundamental issues for scalability: synchronization and communication latency. Dataflow architecture faces problems of excessive synchronization overhead and inefficient execution of sequential programs while they offer the ability to exploit massive parallelism inherent in programs. In contrast, MPAs based on von Neumann computational model may suffer from inefficient synchronization mechanism and communication latency. DAVRID (DAtaflow/Von Neumann RISC hybrID) is a massively parallel multithreaded architecture which takes advantages of von Neumann and dataflow models. It has good single thread performance as well as tolerates synchronization and communication latency. In this paper, we describe the DAVRID architecture in detail and evaluate its performance through simulation runs over several benchmarks.
PDF

불변 및 가변 종속거리를 갖는 루프의 병렬처리를 위한 새로운 동기화 기법 (A New Synchronization Scheme for Parallel Processing of Loop with Constant and Variable Dependence Distance)

이광형;황종선;박두순
- 전자공학회논문지B
- /
- 제32B권5호
- /
- pp.693-701
- /
- 1995
In most application programs, loops usually comprise most of the computation in a program and are the most important source of parallelism. When loops are executed on multiprocessors, the cross iteration data dependences need to be enforced by synchronization between processors. Existing synchronization schemes have been studied mainly on the loop with constant dependence distance. When these schemes are applied to the loop with variable dependence distance, there exists lots of overhead by the use of unnecessary synchronization variables and execution of unuseful synchronization instructions. Even though there exist various variable synchronization schemes, they have a lot of run-time overhead to compute synchronization information. In this paper, we present a new synchronization scheme, Synch-Free/Synch-Hold for managing synchronization efficiently on the loop with constant and variable dependence distance.
PDF

An Improving Method of Restructuring Parallel Programs for Data Race Detection

Ha, Keum-Sook;Lee, Sung woo;Yoo, Kee-Young
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 ITC-CSCC -2
- /
- pp.715-718
- /
- 2000
Although shared memory parallel programs are designed to be deterministic both in their final results and intermediate states, the races that occur when different processes access a common memory location in an order not guaranteed by synchronization could result in unintended non-deterministic executions of the program. So, Detecting races, particularly first data races, is important for debugging explicit shared memory parallel programs. It is possible that all data races reported by other on-the-fly algorithms would disappear once the first races were removed. To detect races parallel programs with nested loops and inter-thread coordination, it must guarantee the order of synchronization operations in an execution instance. In this paper, we propose an improved restructuring method that guarantee ordering execution instance and preserve the semantics of original program. This method requires O(np) time and (s + up) space, where n is the number of total operations, s is the number of synchronization operations and p is the number of parallelism in the execution. Also, this method makes on-the-fly detection of parallel program with nested loops and inter-thread coordination more easily in space and time complexity.
PDF

완전 중첩 루프에서 병렬처리를 위한 새로운 동기화 기법 (A New Synchronization Scheme for Parallel Processing on Perfectly Nested Do Loops)

이광형;황종선;박두순;김병수
- 전자공학회논문지B
- /
- 제31B권10호
- /
- pp.1-10
- /
- 1994
In most application programs, loops usually contain most of the computation in a program and are the most improtant source of parallelism. When loops are executed on multiprocessors, the cross iteration data dependences need to be enforced by synchronization between processors. In this paper, we propose a new synchronization scheme(Free/Hold) for reducing overgeads occured by synchronization variables in data oriented scheme and delay of time occured by synchronization instruction in statement oriented scheme. The Free/Hold mechanism enforces the correct execution order by inserting synchronization instruction between each instance with data dependence relationship using the RD(Real dependence Distance). We also present an algorithm for removing unnecessary dependences in one-to-many dependences.
PDF

스레드 동기화가 없는 OpenMP 디렉티브 프로그램을 위한 효율적인 경합검증 도구 (An Efficient Tool for Verifying Races in OpenMP Directive Programs without Interthread Synchronization)

하옥균;강문혜;김영주;전용기
- 한국정보과학회논문지:컴퓨팅의 실제 및 레터
- /
- 제14권3호
- /
- pp.301-305
- /
- 2008
OpenMP 디렉티브 프로그램에서 경합은 의도하지 않은 비결정적인 수행결과를 초래하므로 디버깅을 위해서 반드시 탐지되어야 한다. 하지만 이러한 경합을 탐지하는 기존의 도구인 Intel Thread Checker는 경합의 존재를 검증하지 못하며 경합을 탐지하는 비용이 크므로 비실용적이다. 이러한 문제를 해결하기 위해서 본 연구팀은 프로그램의 특성 및 사용자 요구사항의 분석결과를 이용하여 경합을 검증하는 도구를 개발하였으나 스레드 동기화가 없는 모델에서는 최적화되지 못하였다. 본 논문에서는 이러한 선행연구의 결과를 확장하기 위해서 스레드 동기화가 없는 프로그램 모델을 위한 효율적 레이블링 기법과 경합탐지 프로토콜 기법을 적용한다. 합성프로그램을 이용하여 실험한 결과로는 스레드 동기화가 없는 프로그램 모델에서 경합검증 시간이 기존의 도구보다 평균 250배 이상 빠르고, 총 접근사건 수가 동일하면 최대병렬성이 증가하여도 경합검증 시간이 감소함을 보인다.
PDF KSCI

데이터 병렬 프로그램에서 배리어 대기시간의 분석 (Analysis of Barrier Waiting Times in Data Parallel Programs)

정인범
- 산업기술연구
- /
- 제21권A호
- /
- pp.73-80
- /
- 2001
Barrier is widely used for synchronization in parallel programs. Since the process arrived earlier than others should wait at the barrier, the total processor utilization decreases. In this paper, to find the sources of the barrier waiting time, parallel programs are executed on the various grain sizes through execution-driven simulations. In simulation studies, we found that even if approximately equal amounts of work are distributed to each processor, all processes may not arrive at a barrier at the same time. The reasons are that the different numbers of cache misses and instructions within partitioned grains result in the difference in arrival time of processors at the barrier.
PDF

멀티미디어 통신에서 미디어스트림 전송을 위한 적응형 멀티미디어 동기화 기법 (An Adaptive Multimedia Synchronization Scheme for Media Stream Delivery in Multimedia Communication)

이기성
- 정보처리학회논문지C
- /
- 제9C권6호
- /
- pp.953-960
- /
- 2002
실시간 응용 프로그램은 미디어 데이터간에 만족되어야 할 동기화 제약조건(synchronization constraints)을 가지고 있다. 멀티미디어 데이터의 가변적 전송 시연 시간을 흡수하여 피드백 제어와 재생정책에 의한 동기화 기법을 수행한다. 버퍼의 수위가 정상레벨을 유지하는가에 대한 문제는 재생률과 QoS 서비스에 중요한 영향을 주게 된다. 본 논문에서는 버퍼의 수위를 안정상태고 유지하기 위해 피트백을 위한 필터링함수를 적용하고, 재생시간을 적응적으로 적용하여 미디어의 재생 시 끊어짐이 없는 유연한 재생을 한다. 또한 버퍼의 주 미디어인 오디오 프레임의 버퍼가 상위임계레벨에 수위가 존재 할 경우는 적응적으로 재생시간을 줄이고, 하위임계레벨에 버퍼의 수위가 있을 경우는 점차적으로 재생시간을 늘리는 시스템이다.
https://doi.org/10.3745/KIPSTC.2002.9C.6.953 인용 PDF KSCI

미디어 스트림 전송을 위한 적응형 멀티미디어 동기화 기법에 관한 연구 ((A Study on an Adaptive Multimedia Synchronization Scheme for Media Stream Transmission))

지정규
- 한국컴퓨터산업학회논문지
- /
- 제3권9호
- /
- pp.1251-1260
- /
- 2002
실시간 응용 프로그램은 미디어 데이터간에 만족되어야 할 동기화 제약조건을 가지고 있다. 클라이언트의 버퍼를 버퍼레벨기법의 가상버퍼와 네트워크 상태를 포함한 피드백기법을 제안한다. 가변적 전송 지연 시간을 흡수하면서 피드백 제어에 의한 동기화를 수행한다. 버퍼의 수위가 정상레벨을 유지하는가에 대한 문제는 재생률과 QoS 서비스에 중요한 영향을 주게 된다. 본 논문에서는 이러한 문제를 해결하기 위해서 필터링함수, 제어함수, 네트워크 평가 함수 등을 적용하여 멀티미디어 서버에서 전송 시작시점을 결정하고 미디어의 재생 시 끊어짐이 없는 유연한 재생을 처리하게 된다. 주 미디어인 오디오 프레임이 상위임계레벨에 버퍼의 수위가 존재 할 경우는 점차적으로 재생시간을 줄이고, 오디오 프레임이 하위임계레벨에 버퍼의 수위가 있을 경우는 점차적으로 재생시간을 늘리는 시스템을 제안한다.
PDF

대기시간을 이용한 적응형 멀티미디어 동기화 기법 (Adaptive Multimedia Synchronization Using Waiting Time)

이기성;이근왕;이종찬;오해석
- 한국정보처리학회논문지
- /
- 제7권2S호
- /
- pp.649-655
- /
- 2000
Real-time application programs have constraints which need to be met between media-data. These constraints represents the delay time ad quality of service between media-data to be presented. In order to efficiently describe the delay time and quality of service, a new synchronization mechanism is needed. Proposed paper is a dynamic synchronization that minimized the effects of adaptive transmission delay time. That is, the method meets the requirements of synchronization between media-dat by handling dynamically the adaptive waiting time resulted from variations of delay time. In addition, the mechanism has interval adjustment using maximum delay jitter time. This paper decreases the data loss resulted from variation of delay time and from loss time of media-data by means of applying delay jitter in order to deal with synchronization interval adjustment. Plus, the mechanism adaptively manages the waiting time of smoothing buffer, which leads to minimize the gap from the variation of delay time. The proposed paper is suitable to the system which requires the guarantee of high quality of service and mechanism improves quality of services such as decrease of loss rate, increase of playout rate.
PDF

검색결과 43건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)