• Title/Summary/Keyword: Data processing

Search Result 16,858, Processing Time 0.042 seconds

Korean Semantic Role Labeling Based on Suffix Structure Analysis and Machine Learning (접사 구조 분석과 기계 학습에 기반한 한국어 의미 역 결정)

  • Seok, Miran;Kim, Yu-Seop
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.11
    • /
    • pp.555-562
    • /
    • 2016
  • Semantic Role Labeling (SRL) is to determine the semantic relation of a predicate and its argu-ments in a sentence. But Korean semantic role labeling has faced on difficulty due to its different language structure compared to English, which makes it very hard to use appropriate approaches developed so far. That means that methods proposed so far could not show a satisfied perfor-mance, compared to English and Chinese. To complement these problems, we focus on suffix information analysis, such as josa (case suffix) and eomi (verbal ending) analysis. Korean lan-guage is one of the agglutinative languages, such as Japanese, which have well defined suffix structure in their words. The agglutinative languages could have free word order due to its de-veloped suffix structure. Also arguments with a single morpheme are then labeled with statistics. In addition, machine learning algorithms such as Support Vector Machine (SVM) and Condi-tional Random Fields (CRF) are used to model SRL problem on arguments that are not labeled at the suffix analysis phase. The proposed method is intended to reduce the range of argument instances to which machine learning approaches should be applied, resulting in uncertain and inaccurate role labeling. In experiments, we use 15,224 arguments and we are able to obtain approximately 83.24% f1-score, increased about 4.85% points compared to the state-of-the-art Korean SRL research.

Direct Pass-Through based GPU Virtualization for Biologic Applications (바이오 응용을 위한 직접 통로 기반의 GPU 가상화)

  • Choi, Dong Hoon;Jo, Heeseung;Lee, Myungho
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.2
    • /
    • pp.113-118
    • /
    • 2013
  • The current GPU virtualization techniques incur large overheads when executing application programs mainly due to the fine-grain time-sharing scheduling of the GPU among multiple Virtual Machines (VMs). Besides, the current techniques lack of portability, because they include the APIs for the GPU computations in the VM monitor. In this paper, we propose a low overhead and high performance GPU virtualization approach on a heterogeneous HPC system based on the open-source Xen. Our proposed techniques are tailored to the bio applications. In our virtualization framework, we allow a VM to solely occupy a GPU once the VM is assigned a GPU instead of relying on the time-sharing the GPU. This improves the performance of the applications and the utilization of the GPUs. Our techniques also allow a direct pass-through to the GPU by using the IOMMU virtualization features embedded in the hardware for the high portability. Experimental studies using microbiology genome analysis applications show that our proposed techniques based on the direct pass-through significantly reduce the overheads compared with the previous Domain0 based approaches. Furthermore, our approach closely matches the performance for the applications to the bare machine or rather improves the performance.

The Assessment Guideline of the Simplified Test Maturity Model (TMM) for An Assessor (심사원을 위한 경량화 테스트 성숙도 모델을 위한 평가 가이드 연구)

  • Jang, Woo Sung;Kim, Ki Du;Son, Hyun Seung;Park, Bo Kyung;Kim, R. Young Chul
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.6 no.8
    • /
    • pp.379-384
    • /
    • 2017
  • In real software business environment, there are required to validate software quality in diverse usage range of software for many small & medium companies. Software quality means both qualities of production and process. In our situation, we focus on better process quality of a test organization than a whole organization. But even the original test maturity model (TMM) does not enough to apply with our domestic venture/small & medium companies. To solve this problem, we suggest the simplified test maturity model for our companies. We redefine this simplified model with the original TMM and a test process improvement next (TPI next) model. The previous models just have provided each definition of maturity level, goal and activity per each level, which not exists an assessment guideline and a formal assessing procedure. Due to this reasons, an assessor is difficult to assess the test organization without them. this paper suggest an assessment guideline of the simplified TMM and also define the procedure which is included with activities and byproducts. With these assessment guideline, an assessor can work possible to formally assess test organizations of small & medium companies, and with self assessment guideline they can be correctly provision before assessment of their organizations.

Failure Analysis of Aircraft Software Test Cases from a Perspective of Requirements Traceability (요구사항 추적성 관점에서 항공기 탑재 소프트웨어 시험 사례 실패 분석)

  • Kim, Sung-Sub;Cho, Hee-Tae;Lee, Seonah
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.11
    • /
    • pp.357-366
    • /
    • 2020
  • As the proportion and complexity of software embedded in aircraft increase, risk factors such as mission failure, function failure and performance failure due to software errors also increase. In the mission-critical software systems such as aircraft software, managing requirement traceability is essential to maintain the software systems with minimal period and cost. However, the development company is not accurately complying with the guideline for managing requirement traceability due to various reasons such as development cost and schedule. Therefore, it is not easy to systematically establish and maintain requirement traceability. In the paper, we analyze actual test cases of aviation software systems from the viewpoint of requirements traceability in order to learn if there are failure cases of test cases due to the absence of systematic traceability management activities. We also check the risks associated with the failure cases according to the type and severity of the cases. As a result of analyzing a total of 7 aircraft-mounted software, failure cases could be divided into three types: omission of requirements, lack of connection between requirements and test procedures, and omission of test procedures. There were a total of 18 failure cases, 6 for each type. The numbers of high, middle and low risks were 1, 13 and 4, respectively, where the number of middle risks is largest.

Analysis of PRC regeneration algorithm performance in dynamic environment by using Multi-DGPS Signal (다중 DGPS 신호를 이용한 동적 환경에서의 PRC 재생성 알고리즘 성능분석)

  • Song Bok-Sub;Oh Kyung-Ryoon;Kim Jeong-Ho
    • The KIPS Transactions:PartA
    • /
    • v.13A no.4 s.101
    • /
    • pp.335-342
    • /
    • 2006
  • As PRC linear interpolation algorithm is applied after analysed and verified in this paper, the unknown location of a user can be identified by using PRC information of multi-DGPS reference station. The PRC information of each GPS satellite is not varying rapidly, which makes it possible to assume that PRC information of each GPS satellite varies linearly. So, the PRC regeneration algorithm with linear interpolation can be applied to improve the accuracy of finding a user's location by using the various PRC information obtained from multi-DGPS reference station. The desirable PRC is made by the linear combination with the known position of multi-DGPS reference station and PRC values of a satellite using signals from multi-DGPS reference station. The RTK-GPS result was used as the reference. To test the performance of the linearly interpolated PRC regeneration algorithm, multi-channel DGPS beacon receiver was built to get a user's position more exactly by using PRC data of maritime DGPS reference station in RTCM format. At the end of this paper, the result of the quantitative analysis of the developed navigation algorithm performance is presented.

Page Logging System for Web Mining Systems (웹마이닝 시스템을 위한 페이지 로깅 시스템)

  • Yun, Seon-Hui;O, Hae-Seok
    • The KIPS Transactions:PartC
    • /
    • v.8C no.6
    • /
    • pp.847-854
    • /
    • 2001
  • The Web continues to grow fast rate in both a large aclae volume of traffic and the size and complexity of Web sites. Along with growth, the complexity of tasks such as Web site design Web server design and of navigating simply through a Web site have increased. An important input to these design tasks is the analysis of how a web site is being used. The is paper proposes a Page logging System(PLS) identifying reliably user sessions required in Web mining system PLS consists of Page Logger acquiring all the page accesses of the user Log processor producing user session from these data, and statements to incorporate a call to page logger applet. Proposed PLS abbreviates several preprocessing tasks which spends a log of time and efforts that must be performed in Web mining systems. In particular, it simplifies the complexity of transaction identification phase through acquiring directly the amount of time a user stays on a page. Also PLS solves local cache hits and proxy IPs that create problems with identifying user sessions from Web sever log.

  • PDF

A Semi-Noniterative VQ Design Algorithm for Text Dependent Speaker Recognition (문맥종속 화자인식을 위한 준비반복 벡터 양자기 설계 알고리즘)

  • Lim, Dong-Chul;Lee, Haing-Sei
    • The KIPS Transactions:PartB
    • /
    • v.10B no.1
    • /
    • pp.67-72
    • /
    • 2003
  • In this paper, we study the enhancement of VQ (Vector Quantization) design for text dependent speaker recognition. In a concrete way, we present the non-Iterative method which makes a vector quantization codebook and this method Is nut Iterative learning so that the computational complexity is epochally reduced. The proposed semi-noniterative VQ design method contrasts with the existing design method which uses the iterative learning algorithm for every training speaker. The characteristics of a semi-noniterative VQ design is as follows. First, the proposed method performs the iterative learning only for the reference speaker, but the existing method performs the iterative learning for every speaker. Second, the quantization region of the non-reference speaker is equivalent for a quantization region of the reference speaker. And the quantization point of the non-reference speaker is the optimal point for the statistical distribution of the non-reference speaker In the numerical experiment, we use the 12th met-cepstrum feature vectors of 20 speakers and compare it with the existing method, changing the codebook size from 2 to 32. The recognition rate of the proposed method is 100% for suitable codebook size and adequate training data. It is equal to the recognition rate of the existing method. Therefore the proposed semi-noniterative VQ design method is, reducing computational complexity and maintaining the recognition rate, new alternative proposal.

An Analysis of decision Factor on Drive Distance for University Golf Player's Object Execution Using Late Hitting Method (대학 골프선수들의 의도적 지연히팅 시 비거리 결정인자 분석)

  • So, Jea-Moo;Lim, Young-Tae;Kim, Yong-Seok;Cho, Bum-Wook
    • Korean Journal of Applied Biomechanics
    • /
    • v.15 no.3
    • /
    • pp.71-78
    • /
    • 2005
  • The purpose of this research was to conduct an analysis on the factors that determine the distance at the time of target swing based on the use of late hitting of outstanding college golfers to verify the difference between late hitting and the distance that target increases in regular swing and the distance. Then, this research conducts an analysis on the correlation between club head velocity, ball velocity, launch angle, back spin, meet ratio and distance that become kinematics variables at the time of target swing. To attain the above mentioned purpose, 25 outstanding college players with average experience and handicap of 6 years and 5, respectively, were targeted Comparative analysis on two swing that target increase in regular and the distance was conducted by used driver. When it pertained to two types of swing. analysis system comprised of an analytical software called the Science Eye of the Bridgestone and peripheries was used to define the relationship between variables of club head velocity, ball velocity, launch angle, back spin, meet ratio that become kinematics variables. As for the method of processing data pertaining to the factors that determine the distance, differences of distance by the type of swing was verified by using independent T-test that leveraged SPSS 120 statistics program. Moreover, level of correlation between variables that contribute to the increase in distance through relation of correlation, and analysis of tendencies was conducted to analyze tendency of non-distance to increase in accordance to the increase of each variable. Key results produced through this experiment are as follows: 1. Artificial late hitting for increased non-distance that targets skilled players had effect on increased the distance(p<. 05). 2 The drive distance is correlated with each measured variable that is positive correlation to ball velocity, club head velocity, meet ratio and relation of back spin and launch angle are negative correlation. ball velocity and club head velocity are very high correlated with drive distance(p<.01), back spin and distance are negative correlation(p<.01). 3. Among each measured variable increasing the club velocity is the most contribution, and ball velocity and meet ratio and the increasing launch angle and back spin is negative effect for increasing distance.

The Effects on The Glass Processing by Alumina Addition in Soda Lime Glass (소다석회유리에서 Alumina첨가제에 따른 제병 공정의 영향)

  • Choi, Young-June;Kim, Jong-Ock;Kim, Taik-Nam
    • The Journal of Engineering Research
    • /
    • v.4 no.1
    • /
    • pp.69-85
    • /
    • 2002
  • The chemical composition of bottle glass is consisted of Na2O-CaO-SiO2. However the cullet is mornally used in order to decrease the melting tsmperature. This induce the productivity of bottle and decreases the cost. The addition of plate glass decreases the Al2O3 content and in fluence the stone phenomenon and devitification in botle glass. Tus the Feldspar is added in order to increase the Al2O3 content when plate cullet was added in melting. The Tridymite crystal was observed over 7.5% Al2O3 contents, which shown as white crystal in appearance. It is Supposed that the Wollastonite Would be occurred in more over 7.5% Al2O3. This fad id well consised With the Litertctures.

  • PDF

Comparative Analysis of Economic Efficiency by Major Sericultural Farming Areas in Korea (잠업단지의 경제효율에 관한 비교분석)

  • 이질현;김문협;강석권
    • Journal of Sericultural and Entomological Science
    • /
    • v.14 no.2
    • /
    • pp.95-103
    • /
    • 1972
  • The major purpose of this study is to collect the information related on the aspects of economic efficiency for solving the problems which are faced by farmers and areas, and providing scientific facts to farmers and related institutions for further development of sericultural sector in Korea. In order for obtaining the related information 12 sample areas among 23 major sericultural farming areas and 30 farm units in each area are selected and analyzed in this study. The fold suevey is made by member of this study team and graduate students in the Department of Sericultural Science with a prepared questionnaires. Cross-section and regression analysis methods are employed for processing the data in this study. The major findings obtained are as followings. 1. Sericultural earnings per Tanbo is, on the average, 22, 752 won in new cultivated areas and 29, 403 won in ordinary ones. There are big difference in the size of earnings by areas, especially, 46, 968 won in Kumo mountain area, compared with 16, 798 won in Yeoju and Yichun areas. General trend is finded that small scale farming units are made higher earnings and operating their farms efficiently. 2. Cocoon production expences per Tanbo is 16, 737 won in new cultivated areas and 19, 802 won in ordinary areas. There are also big difference in farming expences, especially, 27, 389 won in Sudang area, compared with 11, 689 won in Emjin area. 3. Sericultural income per Tanto is 10, 664 won in ordinary areas and 6, 898 won in new cultivated areas. Farmers in Kumo mountain area make the highest income of 21, 164 won and lowest income of 1, 296 won in Sudang area. It can be generized that about 30-50 a sized farmers make higher income. 4. Land, labor and capital productivities estimated by fitting Cobb-Douglas functions in ordinary areas are higher than in new cultivated areas, especially, labor productivity is higher in ordinary areas. 5. Changsung, Kwangna, Yunsun and Kumo mountain areas are technically and economically efficient. Sudang and Mujinchang areas are technically successful but economically inefficient and Emjin and Honam areas are technically inefficient but economically efficient. YeojuYichun, Chunwon and West Kyongnam are technically and economically inefficient. Technical and economic improvement program should be implemented for these areas. 6. Estimated Internal Rate of Return (IRR) on capital investment in Chongwon are is 23.5 percent. It is economically feasible, if we consider 20 percent of opportunity cost of capital in our economy.

  • PDF