Search | Korea Science

MI-MESI Write-invalidate Snooping Cache Coherence Protocol (MI-MESI 쓰기-무효화 스누핑 캐쉬 일관성 유지 프로토콜)

Jang, Seong-Tae
- The Transactions of the Korea Information Processing Society
- /
- v.2 no.5
- /
- pp.757-767
- /
- 1995
In this paper, we present MI-MESI write-invalidate snooping cache coherence protocol which addresses several significant drawbacks of MESI and MI-MESI write -invalidate snooping cache coherence protocols under the split transaction bus based multiprocessor environment. In this protocol, each cache block maintains one of six cache states which represent Modified-shared, Invalid-by-other, Modified, Exclusive, Shared and Invalid states. By using these cache states, our protocol reduces both the access contention and unnecessary updates for the memory modules significantly, and thus providing the fast memory access time.
PDF

NAWM Bus Architecture of High Performance for SoC (SoC를 위한 고성능 NAWM 버스 아키텍처)

Lee, Kook-Pyo;Yoon, Yung-Sup
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.45 no.9
- /
- pp.26-32
- /
- 2008
The conventional shared bus architecture is capable of processing only one data transaction in same time. In this paper, we propose the NAWM (No Arbitration Wild Master) bus architecture that is capable of processing several data transactions in same time. After designing the master and the slave wrappers of NAWM bus architecture about AMBA system, we confirm that most of IPs of AMBA system can be a lied without modification and the added timing delay can be neglected. from simulation we deduce that more than 50% parallel processing is possible when several masters initiate slaves in NAWM bus architecture.
PDF KSCI

A Lock Mechanism for HiPi-bus Based Multiprocessor Systems (HiPi-bus 구조의 다중 프로세서 시스템에서의 잠금장치)

윤용호;임인칠
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.2
- /
- pp.33-43
- /
- 1993
Lock mechanism is essential for synchronization on the multiprocessor systems. Lock mechanism needs to reduce the time for lock operation in low lock contention. Lock mechanism must consider the case of the high lock contention. The conventional lock control scheme in memory results in the increase of bus traffic and memory utilization in lock operation. This paper suggests a lock scheme which stores the lock data in cache and manages it efficiently to reduce the time spent in lock operation when the lock contention is low on a multiprocessor system built on HiPi-bus(Highly Pipelined bus). This paper also presents the design of the HIPi-CLOCK (Highly Pipelined bus Cache LOCK mechanism) which transfere the data from on cache to another when the lock contention is high. The designed simulator compares the conventional lock scheme which controls the lock in memory with the suggested HiPi-CLOCK scheme in terms of the RMW(Read-Modify-Write) operation time using simulated trace. It is shown that the suggested lock control scheme performance is over twice than that of the conventional method in low lock contention. When the lock contention is high, the performance of the suggested scheme increases as the number of the shared lock data increases.
PDF

Performance Analysis of Bandwidth-Awared Bus Arbitration Method (점유율을 고려한 버스 중재방식의 성능 분석)

Lee, Kook-Pyo;Koh, Si-Young
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.14 no.9
- /
- pp.2078-2082
- /
- 2010
The general bus system architecture consists of masters, slaves, arbiter, decoder and so on in shared bus. As several masters can't use a bus concurrently, arbiter plays an role in bus arbitration. In compliance with the selection of arbitration method, The efficiency of bus usage can be determined. Fixed Priority, Round-Robin, TDMA, Lottery arbitration are studied in conventional arbitration method. Conventional arbitration method is considered bus priority primarily, actual bus utilization didn't considered. In this paper, we propose arbitration method using bus utilization operating block of each master, we verify the performance compared with the other arbitration methods through throughput performance. From the result of performance verification, we confirm that proposed arbitration method, matched bus utilization set by the user 40%, 20%, 20%, 20%.
https://doi.org/10.6109/jkiice.2010.14.9.2078 인용 PDF KSCI

A Power System Economic Operation using Bus Distributed Transmission Loss Information (분산 송전손실정보에 의한 전력시스템의 경제운용)

이봉용;심건보
- The Transactions of the Korean Institute of Electrical Engineers
- /
- v.39 no.4
- /
- pp.333-340
- /
- 1990
분산 송전송실정보에 의한 전력시스템의 경제운용=The transmission loss information produced in a line may be shared by both end buses connected to the line. Then, the loss may be seen as if it is discretely produced at both buses. Likewise, all transmission losses can be considered as if they are discretely produced at every bus distributed. The bus transmission loss equation can be defined, in which the loss information about connected lines are contained. This formulation can greatly enhance the computational efficiency for the economic control of both real powers and voltages. It requires solutions of two linear matrix equations, one for the calculation of incremental transmission losses and the other for the determination of voltage levels to be controlled. The Proposed approach is demonstrated through three sample systems and it is found that the solutions can be obtained after three iterations regardless of system sizes. This implies that only one-step search would be required for the solution if real informations would be available. Results are compared with those of optimal power flows.
PDF

Implementation of AHB1-AHB2 Multi-Bus Architecture Using Memory Selector (메모리 셀렉터를 이용한 AHB1-AHB2 다중버스 아키텍처 구조 구현)

Lee, Keun-Hwan;Lee, Kook-Pyo;Yoon, Yung-Sup
- Proceedings of the IEEK Conference
- /
- 2008.06a
- /
- pp.527-528
- /
- 2008
In this paper, several cases of multi-shared bus architecture are discussed and in order to decrease the bridge latency, the architecture introducing a memory decoder is proposed. Finally, a LCD controller using DMA master is integrated in this bus architecture that is verified due to RTL simulation and FPGA board test. DMA, LCD line buffer and SDRAM controller are normally operated in the timing simulation using ModelSim tool, and the LCD image is confirmed in the real FPGA board containing LCD panel.
PDF

An Analysis of Multi-processor System Performance Depending on the Input/Output Types (입출력 형태에 따른 다중처리기 시스템의 성능 분석)

Moon, Wonsik
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.12 no.4
- /
- pp.71-79
- /
- 2016
This study proposes a performance model of a shared bus multi-processor system and analyzes the effect of input/output types on system performance and overload of shared resources. This system performance model reflects the memory reference time in relation to the effect of input/output types on shared resources and the input/output processing time in relation to the input/output processor, disk buffer, and device standby places. In addition, it demonstrates the contribution of input/output types to system performance for comprehensive analysis of system performance. As the concept of workload in the probability theory and the presented model are utilized, the result of operating and analyzing the model in various conditions of processor capability, cache miss ratio, page fault ratio, disk buffer hit ratio (input/output processor and controller), memory access time, and input/output block size. A simulation is conducted to verify the analysis result.
https://doi.org/10.17662/ksdim.2016.12.4.071 인용 PDF KSCI

Method for NoC Bottleneck Relaxation Using Proxy (프록시를 이용한 NoC의 병목현상 해소 방법)

Kim, Kyu-Chull;Kwon, Tai-Hwan
- The KIPS Transactions:PartA
- /
- v.18A no.1
- /
- pp.25-32
- /
- 2011
NoC is actively being studied recently in order to overcome the limitations of shared-bus architecture. We proposed an NoC architecture which employs a buffer that plays a similar role of a proxy server in a computer network to enhance the communication efficiency of NoC architecture. In the proposed NoC architecture, whenever the master has a difficulty in communicating with the slave directly, the master communicates with the proxy server which is able to communicate with the slave on behalf of the master. With the proposed scheme in NoC, we can increase the speed and the bandwidth of communication channel. The experimental results showed that overall communication efficiency was significantly improved by sending the packets to the proxy server rather than holding them in the switch buffer.
https://doi.org/10.3745/KIPSTA.2011.18A.1.025 인용 PDF KSCI

Performance Analysis of A Distributed Shared Memory Multiprocessor System Using PASEC (PARSEC을 이용한 분산공유메모리 다중프로세서 시스템의 성능분석)

Park, Joon-Seok;Jeon, Chang-Ho
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.10
- /
- pp.3049-3054
- /
- 2000
In this paper, the effects of the hardware components and runtime environments on the overall performance of a distributed shared memory system are analyzed through simulation. In simulation, the system is modeled using PARSE[1.2] closely to the real runtime environment and the 2D FFT is virtually executed on it. The results of simulation show that the minor hardware components such as bus interfaces and local bus of a processor, which are usuallyignored or neglected when analyzing performance. have significant impacts on the overall system performance. Performance variations caused from runtime environments such as loop overhead and code optimuzatio are also analyzed quantitatively.
PDF

Scheduler for parallel processing with finely grained tasks

Hosoi, Takafumi;Kondoh, Hitoshi;Hara, Shinji
- 제어로봇시스템학회:학술대회논문집
- /
- 1991.10b
- /
- pp.1817-1822
- /
- 1991
A method of reducing overhead caused by the processor synchronization process and common memory accesses in finely grained tasks is described. We propose a scheduler which considers the preparation time during searching to minimize the redundant accesses to shared memory. Since the suggested hardware (synchronizer) determines the access order of processors and bus arbitration simultaneously by including the synchronization process into the bus arbitration process, the synchronization time vanishes. Therefore this synchronizer has no overhead caused by the processor synchronization[l]. The proposed scheduler algorithm is processed in parallel. The processes share the upper bound derived by each searching and the lower bound function is built considering the preparation time in order to eliminate as many searches as possible. An application of the proposed method to a multi-DSP system to calculate inverse dynamics for robot arms, showed that the sampling time can be twice shorter than that of the conventional one.
PDF

Search Result 84, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)