통합 검색 | Korea Science

메모리 지연을 감추는 기법들 (Memory Latency Hiding Techniques)

기안도
- 전자통신동향분석
- /
- 제13권3호통권51호
- /
- pp.61-70
- /
- 1998
The obvious way to make a computer system more powerful is to make the processor as fast as possible. Furthermore, adopting a large number of such fast processors would be the next step. This multiprocessor system could be useful only if it distributes workload uniformly and if its processors are fully utilized. To achieve a higher processor utilization, memory access latency must be reduced as much as possible and even more the remaining latency must be hidden. The actual latency can be reduced by using fast logic and the effective latency can be reduced by using cache. This article discusses what the memory latency problem is, how serious it is by presenting analytical and simulation results, and existing techniques for coping with it; such as write-buffer, relaxed consistency model, multi-threading, data locality optimization, data forwarding, and data prefetching.
https://doi.org/10.22648/ETRI.1998.J.130305 인용 PDF

하이퍼미디어 데이터를 위한 차별화된 서비스 연구 (Differentiated Service for Hypermedia data on the Web)

이윤정;김태윤
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2001년도 추계학술발표논문집 (하)
- /
- pp.1481-1484
- /
- 2001
Most implementations of HTTP servers do not distinguish among requests for hypermedia data from different clients. Commercialization of Web site is becoming increasingly common. Therefore providing quality of service with members paying to the site is often an important issue for the hosts. For some uses, such as web prefetching or multiple priority schemes, different levels of service are desirable. We propose server-side TCP connection management mechanisms to provide two different levels of Web service, high and regular levels by setting different timeout for inactive connection. Therefore this mechanism can effectively provide different service classes even in the absence of operating system and network support.
PDF

Scheduling of Concurrent Transactions in Broadcasting Environment

Al-Qerem, Ahmad;Hamarsheh, Ala;Al-Lahham, Yaser A.;Eleyat, Mujahed
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제12권4호
- /
- pp.1655-1673
- /
- 2018
Mobile computing environment is subject to the constraints of bounded network bandwidth, frequently encountered disconnections, insufficient battery power, and system asymmetry. To meet these constraints and to gain high scalability, data broadcasting has been proposed on data transmission techniques. However, updates made to the database in any broadcast cycle are deferred to the next cycle in order to appear to mobile clients with lower data currency. The main goal of this paper is to enhance the transaction performance processing and database currency. The main approach involves decomposing the main broadcast cycle into a number of sub-cycles, where data items are broadcasted as they were originally sequenced in the main cycle while appearing in the most current versions. A concurrency control method AOCCRBSC is proposed to cope well with the cycle decomposition. The proposed method exploits predeclaration and adapts the AOCCRB method by customizing prefetching, back-off, and partial backward and forward validation techniques. As a result, more than one of the conflicting transactions is allowed to commit at the server in the same broadcast cycle which empowers the processing of both update and read-only transactions and improves data currency.
https://doi.org/10.3837/tiis.2018.04.014 인용 PDF KSCI

그래프 프로세싱을 위한 GRU 기반 프리페칭 (Gated Recurrent Unit based Prefetching for Graph Processing)

시바니 자드하브;파만 울라;나정은;윤수경
- 반도체디스플레이기술학회지
- /
- 제22권2호
- /
- pp.6-10
- /
- 2023
High-potential data can be predicted and stored in the cache to prevent cache misses, thus reducing the processor's request and wait times. As a result, the processor can work non-stop, hiding memory latency. By utilizing the temporal/spatial locality of memory access, the prefetcher introduced to improve the performance of these computers predicts the following memory address will be accessed. We propose a prefetcher that applies the GRU model, which is advantageous for handling time series data. Display the currently accessed address in binary and use it as training data to train the Gated Recurrent Unit model based on the difference (delta) between consecutive memory accesses. Finally, using a GRU model with learned memory access patterns, the proposed data prefetcher predicts the memory address to be accessed next. We have compared the model with the multi-layer perceptron, but our prefetcher showed better results than the Multi-Layer Perceptron.
PDF

선인출 전용 캐시를 이용한 적극적 선인출 필터링 기법 (An Active Prefetch Filtering Schemes using Exclusive Prefetch Cache)

전영숙;김석일;전중남
- 정보처리학회논문지A
- /
- 제12A권1호
- /
- pp.41-52
- /
- 2005
캐시 미스에 의한 메모리 참조 명령어는 응용 프로그램의 고속 수행을 방해하는 주 원인이다. 캐시 선인출 기법은 캐시 미스에 따른 지연시간을 줄이는 효과적인 방법이다. 그러나 너무 적극적으로 선인출을 할 경우에는 캐시 오염을 유발시켜 오히려 선인출에 의한 장점을 상쇄시킨다. 본 연구에서는 선인출로 인한 캐시의 오염을 줄이기 위해 필터 테이블을 참조하여 선인출 명령을 수행한 지의 여부를 동적으로 판단하는 적극적 선인출 필터링 기법을 제시한다. 정교한 필터링을 위하여 저장되어 있는 불필요한 선인출 데이터의 주소를 직접 사용하는 축출 주소 참조 방시을 제안하였다. 또한 동적 필터링의 정확성을 늘이기 위하여 선인출 데이터의 캐시로부터의 출입을 증가 시키도록 작은 크기의 선인출 전용 캐시를 사용하였다. 선인출 전용 캐시의 사용으로 인해 유용한 요구 데이터들이 선인출 데이터들로 인하여 밀려나가지 않게 되었고, 또한 직접 주소 참조 방식을 통하여 필터링 정확성이 증가됨으로써 선인출 전용 캐시 내에도 유용한 선인출 데이터들만이 존재하게 되어 캐시 미스 수가 크게 감소되었다. 일반적으로 많이 사용되는 일반 벤치마크 프로그램과 멀티미디어 벤치마크 프로그램들에 대하여 실험한 결과, 제안된 방식의 캐시 미스율은 $13.3{\%}$ 감소하였고,, 기존 방식에 비해 우수한 필터링 정확도를 가짐을 보였다.
https://doi.org/10.3745/KIPSTA.2005.12A.1.041 인용 PDF KSCI

멀티미디어 데이터를 위한 피어-투-피어 전송모델 (Peer-to-Peer Transfer Scheme for Multimedia Partial Stream using Client Initiated with Prefetching)

신광식;윤완오;정진하;최상방
- 한국통신학회논문지
- /
- 제29권7B호
- /
- pp.598-612
- /
- 2004
DSL 이나 케이블 모뎀 등과 같은 광 대역 접속기술이 보급됨에 따라 멀티미디어 기반 컨텐츠를 선호하는 사용자 수가 증가되었다. 반면 서버 네트워크 자원은 늘어나는 사용자의 요청 수에 맞춰 끊임없이 확충 할 수는 없다. 그러므로 서버 네트워크 자원을 얼마나 효율적으로 사용하는가는 서비스의 품질을 결정짓는 중요한 요인이 된다. 본 논문은 클라이언트 자원을 이용하여 서버자원이용을 줄여주는 CIWP (Client Initiated With Prefetching) 모델을 기본으로 부분 스트림에 대해서는 P2P 모델을 적용함으로써 멀티캐스트 채널 이외의 추가적인 서버자원소비를 줄이는 새로운 모델을 제시하였다. 특히 threshold based multicast 알고리즘을 적용하여 모든 사용자는 서버로부터 대부분의 데이터를 전송받고 일부 앞부분의 데이터에 대해서만 다른 사용자를 통하여 받게 된다. 또한 사용자간의 데이터 전송은 선행 사용자의 서비스 시간을 초과하지 않도록 한다. 사용자간의 Peer-to-Peer전송은 데이터전송 주체간의 네트워크 자원에 제한을 덜 받도록 ISP 단위로 그룹으로 나누어 그룹 내에서만 데이터 전송을 허용한다. 해석적 방법을 사용하여 추가로 요구되는 클라이언트 측 자원과 이를 통해 절감되는 서버 네트워크 자원을 분석하였다. 또한 시뮬레이션을 통해 지연시간과 전체 요청자중 지연 비율을 통해 성능 향상을 검증하였다. 결과적으로 제시된 모델은 서버 네트워크 자원이용을 비교할 때 35%의 대역폭 절감 효과를 얻을 수 있다.
PDF KSCI

변위 히스토리 버퍼를 이용한 명령어 및 데이터 프리페치 기법 (Instructions and Data Prefetch Mechanism using Displacement History Buffer)

정용수;김진혁;조태환;최상방
- 전자공학회논문지
- /
- 제52권10호
- /
- pp.82-94
- /
- 2015
본 논문에서는 변위 필드를 이용해 히스토리 레코드를 생성하는 방법과 히스토리 레코드의 기준이 되는 트리거 블록에 우선 순위를 부여하여 효율적인 캐시 교체를 가능하게 하는 하드웨어 프리페치 기법을 제안한다. 히스토리 레코드의 트리거 블록을 기준으로 히스토리를 생성하기 때문에 프로그램의 시퀀스를 고려할 수 있으며, 히스토리를 변위 값으로 저장하기 때문에 트리거 주소와 변위필드에 저장된 값을 더해 빠르게 명령어 또는 데이터 주소를 프리페치 할 수 있다. 또한, 트리거 블록에 우선순위를 부여하고 캐시 교체 정책으로 랜덤 교체 방법을 사용해 캐시 공간이 가득 찼을 때 우선순위가 낮은 블록부터 랜덤하게 교체하는 방법을 제안한다. 제안하는 하드웨어 프리페처의 성능을 평가하기 위해 메모리 분석 시뮬레이터인 gem5와 PARSEC 벤치마크 프로그램을 사용하였다. 그 결과 비트벡터를 이용해 공간영역을 생성하는 기존의 하드웨어 프리페처와 비교해 L1 데이터 캐시의 미스율은 평균 약 44.5% 감소하였고 L1 명령어 캐시의 미스율은 평균 약 31% 감소하였다. 또한 IPC (Instruction Per Cycle)는 평균 약 23.7% 향상을 보였다.
https://doi.org/10.5573/ieie.2015.52.10.082 인용 PDF KSCI

M-RPT: 데이터의 주소 간격을 이용한 적극적인 캐시 선인출 방법 (An Eager Cache Prefetching Scheme Using Stride between Successive Data Reference)

전영숙;문현주;전중남;김석일
- 한국정보과학회:학술대회논문집
- /
- 한국정보과학회 2003년도 가을 학술발표논문집 Vol.30 No.2 (1)
- /
- pp.217-219
- /
- 2003
멀티미디어 응용 프로그램은 방대한 양의 데이터를 실시간으로 고속 처리해야 한다. 적재/저장과 같은 메모리 참조 명령어는 프로세서의 고속 수행에 방해가 되는 주요인이다. 본 논문에서는 메모리 참조 속도를 향상시키기 위해 멀티미디어 데이터의 주소간격이 규칙적으로 참조되는 특성을 활용하여 다음에 참조될 데이터를 미리 캐시로 선인출 함으로써 실행시 캐시 미스율을 줄이고 또한 전체 수행시간을 줄이는 효과적인 방법을 제안한다. 제안한 방법은 캐시 미스율을 줄이는 방법으로서 데이터 선인출 기법을 사용하는데 주소간격을 이동한 기존 연구들에 비해 캐시 미스율에 있어서 평균적으로 27%향상되었다.
PDF

스트리밍 데이터의 선인출에 사용되는 참조예측표 교체 전략 (A Replacement Strategy for Reference Prediction Table Used in Prefetching Streaming Data)

임철후;김석일;전중남
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2003년도 추계학술발표논문집 (상)
- /
- pp.135-138
- /
- 2003
멀티미디어 응용프로그램은 처리데이터를 참조할 때 대부분 간격이 일정한 스트리밍 패턴으로 참조한다. 이 특성을 선인출 방법에 적용하여 멀티미디어 응용프로그램의 성능을 항상 시킬 수 있다. 이 논문에서는 하드웨어기반의 규칙 선인출 방법에서 참조예측표에 운영하는 방법을 제안한다. 크기가 제한되어 있는 참조예측표에 메모리 참조 명령어를 추가할 때 주소간격이 0인 행을 우선적으로 제거함으로써 비용절감의 효과를 가져올 수 있다. 실험 결과 제안한 방법과 기존의 참조예측표를 FIFO 방식으로 운영하는 방법을 비교할 때 제안한 방법의 경우 참조예측표의 크기를 반으로 줄여도 거의 같은 효과를 볼 수 있었다.
PDF

Reducing the User-perceived Latency of Browsers with NVRAM

Kim, Kyusik;Cho, Yongwoon;Kim, Seongmin;Kim, Taeseok
- JSTS:Journal of Semiconductor Technology and Science
- /
- 제17권1호
- /
- pp.23-28
- /
- 2017
Non-volatile RAM (NVRAM) provides many opportunities to improve the performance of computing devices. In this paper, we present an approach that reduces the user-perceived latency of browsers by using NVRAM. To this end, we first analyze the browser launch process, and then employ several techniques that improve the performance of each step by using NVRAM. Specially, we focus on minimizing the launch time of browser by 1) prefetching the block sequence required for browser launch, 2) caching the web resources in the fast NVRAM, and 3) reusing the displayed bitmap data in the frame buffer. Through implementation, we show that our scheme significantly reduces the launch time of browsers.
https://doi.org/10.5573/JSTS.2017.17.1.023 인용 PDF KSCI

검색결과 66건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)