• 제목/요약/키워드: Parallel-flow

검색결과 1,066건 처리시간 0.055초

Abstract Visualization for Effective Debugging of Parallel Programs Based on Multi-threading (멀티 스레딩 기반 병렬 프로그램의 효과적인 디버깅을 위한 추상적 시각화)

  • Kim, Young-Joo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • 제20권3호
    • /
    • pp.549-557
    • /
    • 2016
  • It is important for effective visualization to summarize not only a large amount of debugging information but also the mental models of abstract ideas. This paper presents an abstract visualization tool which provides effective visualization of thread structure and race information for OpenMP programs with critical sections and nested parallelism, using a partial order execution graph which captures logical concurrency among threads. This tool is supported by an on-the-fly trace-filtering technique to reduce space complexity of visualization information, and a graph abstraction technique to reduce visual complexity of nested parallelism and critical sections in the filtered trace. The graph abstraction of partial-order relation and race information is effective for understanding program execution and detecting to eliminate races, because the user can examine control flow of program and locations of races in a structural fashion.

Design and Evaluation of Flexible Thread Partitioning System (융통성 있는 스레드 분할 시스템 설계와 평가)

  • Jo, Sun-Moon
    • Journal of Internet Computing and Services
    • /
    • 제8권3호
    • /
    • pp.75-83
    • /
    • 2007
  • Multithreaded model is an effective parallel system in that it can reduce the long memory reference latency time and solve the synchronization problems. When compiling the non-strict functional programs for the multithreaded parallel machine, the most important thing is to find an set of sequentially executable instructions and to partitions them into threads. The existing partitioning algorithm partitions the condition of conditional expression, true expression and false expression into the basic blocks and apply local partitioning to these basic blocks. We can do the better partitioning if we modify the definition of the thread and allow the branching within the thread. The branching within the thread do not reduce the parallelism, do not increase the number of synchronization and do not violate the basic rule of the thread partitioning. On the contrary, it can lengthen the thread and reduce the number of synchronization. In the paper, we enhance the method of the partition of threads by combining the three basic blocks into one of two blocks.

  • PDF

An experimental study on the operating performance of facade installed natural circulation type solar thermal system (수직벽면형 무동력 태양열 시스템 작동성능에 관한 실험적 연구)

  • Baek, Nam-Choon;Lee, Wang-Je;Lee, Jin-Kook;Lee, Soon-Myung
    • Journal of the Korean Solar Energy Society
    • /
    • 제35권4호
    • /
    • pp.1-7
    • /
    • 2015
  • The operation of the natural circulation type solar heating systems with facade integrated collector was analyzed by experiment. Two different types of flat plate solar collectors were used for these experiments. One was for the normal flat plate solar collector with the size of 1m*2m and the other was for the large size solar collector with $4m^2$(1m*4m). The experiments were carried out to investigate the effect of the series or parallel connection method on the performance of the collectors. As a result, the solar thermal system which is installed on the wall or facade would be applicable for the natural circulation type if the system design reflects various parameters, including collector connecting method(series or parallel), to provide enough vertical height between collector and storage tank, and to reduce pressure loss due to collector and piping network, etc. The natural circulation type of solar thermal system as proposed in this study can increase the system reliability by removing or minimizing the use of the components such as pump, controller, sensors which may cause serious troubles of the system for a long-time operation

Performance Evaluation of QoS-based Web Services Selection Models (QoS 기반 웹 서비스 선택 모형의 성능 평가)

  • Seo, Sang-Koo
    • Journal of the Korea Society of Computer and Information
    • /
    • 제12권4호
    • /
    • pp.43-52
    • /
    • 2007
  • As the number of public Web Services increases, there will be many services with the same functionality. These services. however, will vary in their QoS properties, such as price, response time and availability, and it is very important to choose a best service while satisfying given QoS constraints. This paper brings parallel branching and response time constraint of business processes into focus and investigates several service selection plans based on multidimensional multiple choice Knapsack model. Specifically. proposed in the paper are a plan with response time constraints for each execution flow, a plan with a single constraint over the whole service types and a plan with a constraint on a particular execution path of a composite Web Services. Experiments are conducted to observe the performance of each plan with varying the number of services, the number of branches and the values of response time constraint. Experimental results show that reducing the number of candidate services using Pareto Dominance is very effective and the plan with a constraint over the whole service types is efficient in time and solution quality for small to medium size problems.

  • PDF

Experimental Study on the Two Phase Thermosyphone Loop with Parallel Connected Multiple Evaporators under Partial Load and Low Temperature Operating Condition (병렬 연결된 다중 증발기 구조 2상 유동 순환형 열사이폰의 부분부하 및 저온운전 특성에 관한 실험적 연구)

  • Kang In-Seak;Choi Dong-Kyu;Kim Taig-young
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • 제16권11호
    • /
    • pp.1051-1059
    • /
    • 2004
  • Two phase thermosyphone loop for electronics cooling are designed and manufactured to test its performance under the partial load and low environment temperature conditions. The thermosyphone device has six evaporators connected parallel for the purpose of cooling six power amplifier units (PAU) independently. The heater modules for simulating PAUs are adhered with thermal pad to the evaporator plates to reduce the contact resistance. There are unbalanced distributions of liquid refrigerant in the differently heated evaporators due to the vapor pressure difference. To reduce the vapor pressure differences caused by partial heating, two evaporators are connected each other using the copper tube. The pressure regulation tube successfully reduces these unbalances and it is good candidates for a field distributed systems. Under the low environment temperature operating condition, such as $-30^{\circ}C$, there may be unexpected subcooling in condenser. It leads the very low saturation pressure, and under this condition there exists explosive boiling in evaporator. The abrupt pressure rise due to the explosive boiling inhibits the supplement of liquid refrigerant to the evaporator for continuous cooling. Finally the cooling cycle will be broken. For the normal circulation of refrigerant there may be an optimum cooling air flow rate in condenser to adjust the given heat load.

Numerical modeling of two parallel tunnels interaction using three-dimensional Finite Elements Method

  • Nawel, Bousbia;Salah, Messast
    • Geomechanics and Engineering
    • /
    • 제9권6호
    • /
    • pp.775-791
    • /
    • 2015
  • Due to the extension of communication ways (metro, highways, railways), hence, to improve traffic flow imposes often the difficult crossing that generally drive to the construction of underground works (tunnel, water conveyance tunnel...) plays a major role in the redevelopment of urban areas. This study is focused on the assessment of the interaction response of parallel tunnels, so this study uses the results from the simulation of two tunnels to illustrate a few observations that may aid in practical designs. In this article, simultaneous drilling of highway's twin tunnels is simulated by means of Finite Element Method (FEM) implemented in Plaxis program. So the treated subject appears in a setting of geotechnical where one can be to construct several tunnels sometimes in a ground of weak mechanical characteristics. The objective of this study is to simulate numerically the interaction effects caused by construction of two parallels tunnels. This is an important factor in the study of the total answer of the problem interaction between parallels underground works. The importance of the effects transmitted is function of several parameters as the type of the works, and the mechanical characteristics (tunnel size, depth, and the relative position between two tunnels, lining thickness...). This article describes numerical analyses of two parallels tunnels interaction. This study will be applied to a real case of a section tunnel T4 of the highway East-West (Algeria); the study presented below comprises a series of numerical simulations of two tunnels using the computer program Plaxis which is used in the analyses is based on Finite Element Method.

AANet: Adjacency auxiliary network for salient object detection

  • Li, Xialu;Cui, Ziguan;Gan, Zongliang;Tang, Guijin;Liu, Feng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권10호
    • /
    • pp.3729-3749
    • /
    • 2021
  • At present, deep convolution network-based salient object detection (SOD) has achieved impressive performance. However, it is still a challenging problem to make full use of the multi-scale information of the extracted features and which appropriate feature fusion method is adopted to process feature mapping. In this paper, we propose a new adjacency auxiliary network (AANet) based on multi-scale feature fusion for SOD. Firstly, we design the parallel connection feature enhancement module (PFEM) for each layer of feature extraction, which improves the feature density by connecting different dilated convolution branches in parallel, and add channel attention flow to fully extract the context information of features. Then the adjacent layer features with close degree of abstraction but different characteristic properties are fused through the adjacent auxiliary module (AAM) to eliminate the ambiguity and noise of the features. Besides, in order to refine the features effectively to get more accurate object boundaries, we design adjacency decoder (AAM_D) based on adjacency auxiliary module (AAM), which concatenates the features of adjacent layers, extracts their spatial attention, and then combines them with the output of AAM. The outputs of AAM_D features with semantic information and spatial detail obtained from each feature are used as salient prediction maps for multi-level feature joint supervising. Experiment results on six benchmark SOD datasets demonstrate that the proposed method outperforms similar previous methods.

A Study on Ring Buffer for Efficiency of Mass Data Transmission in Unstable Network Environment (불안정한 네트워크 환경에서 대용량 데이터의 전송 효율화를 위한 링 버퍼에 관한 연구)

  • Song, Min-Gyu;Kim, Hyo-Ryoung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • 제15권6호
    • /
    • pp.1045-1054
    • /
    • 2020
  • In this paper, we designed a TCP/IP based ring buffer system that can stably transfer bulk data streams in the unstable network environments. In the scheme we proposed, The observation data stream generated and output by each radio observatory's backend system as a UDP frame is stored as a UDP packet in a large capacity ring buffer via a socket buffer in the client system. Thereafter, for stable transmission to the remote destination, the packets are processed in TCP and transmitted to the socket buffer of server system in the correlation center, which packets are stored in a large capacity ring buffer if there is no problem with the packets. In case of errors such as loss, duplication, and out of order delivery, the packets are retransmitted through TCP flow control, and we guaranteed that the reliability of data arriving at the correlation center. When congestion avoidance occurs due to network performance instability, we also suggest that performance degradation can be minimized by applying parallel streams.

A Fast Processor Architecture and 2-D Data Scheduling Method to Implement the Lifting Scheme 2-D Discrete Wavelet Transform (리프팅 스킴의 2차원 이산 웨이브릿 변환 하드웨어 구현을 위한 고속 프로세서 구조 및 2차원 데이터 스케줄링 방법)

  • Kim Jong Woog;Chong Jong Wha
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • 제42권4호
    • /
    • pp.19-28
    • /
    • 2005
  • In this paper, we proposed a parallel fast 2-D discrete wavelet transform hardware architecture based on lifting scheme. The proposed architecture improved the 2-D processing speed, and reduced internal memory buffer size. The previous lifting scheme based parallel 2-D wavelet transform architectures were consisted with row direction and column direction modules, which were pair of prediction and update filter module. In 2-D wavelet transform, column direction processing used the row direction results, which were not generated in column direction order but in row direction order, so most hardware architecture need internal buffer memory. The proposed architecture focused on the reducing of the internal memory buffer size and the total calculation time. Reducing the total calculation time, we proposed a 4-way data flow scheduling and memory based parallel hardware architecture. The 4-way data flow scheduling can increase the row direction parallel performance, and reduced the initial latency of starting of the row direction calculation. In this hardware architecture, the internal buffer memory didn't used to store the results of the row direction calculation, while it contained intermediate values of column direction calculation. This method is very effective in column direction processing, because the input data of column direction were not generated in column direction order The proposed architecture was implemented with VHDL and Altera Stratix device. The implementation results showed overall calculation time reduced from $N^2/2+\alpha$ to $N^2/4+\beta$, and internal buffer memory size reduced by around $50\%$ of previous works.

A Study on the Airflow Distribution in the Diagonal Ventilation Circuit for the Design of a High Level Radioactive Waste Repository (고준위 방사성 폐기물 처분장 설계를 위한 Diagonal 환기 회로 내 공기량 분배에 관한 연구)

  • Hwang, In-Phil;Choi, Heui-Joo;Roh, Jang-Hoon;Kim, Jin
    • Tunnel and Underground Space
    • /
    • 제22권3호
    • /
    • pp.173-180
    • /
    • 2012
  • In this study, diagonal ventilation circuits that are advantageous in air flow direction control were studied. Based on the results of the study, it could be seen that air volumes in diagonal ventilation circuits could also be calculated using numerical formulas or programs if the air volumes and air flow directions to be infused into diagonal branches are determined in advance as with other serial/parallel circuits. To apply the results, design plans for high level radioactive waste repositories applied with diagonal ventilation circuits and parallel ventilation circuits. To compared the each design plans and obtain expected operation results, ventilation network simulations were conducted through the Ventsim program which is a ventilation networking program. Based on the results, in the case of diagonal repositories that was expected to cause great increases in resistance, fan pressure was 1570 pa, total flux was 84 $m^3/s$, fan efficiency was 76.4%, fan power consumption was 181.2 kW and annual fan operating costs were 178,710,838 and thus maximum around 8% differences were shown in pressure and flux values and a difference of around 1.5% was shown in terms of operating costs.