• Title/Summary/Keyword: pdf

Search Result 626, Processing Time 0.028 seconds

Organizing an in-class hackathon to correct PDF-to-text conversion errors of Genomics & Informatics 1.0

  • Kim, Sunho;Kim, Royoung;Nam, Hee-Jo;Kim, Ryeo-Gyeong;Ko, Enjin;Kim, Han-Su;Shin, Jihye;Cho, Daeun;Jin, Yurhee;Bae, Soyeon;Jo, Ye Won;Jeong, San Ah;Kim, Yena;Ahn, Seoyeon;Jang, Bomi;Seong, Jiheyon;Lee, Yujin;Seo, Si Eun;Kim, Yujin;Kim, Ha-Jeong;Kim, Hyeji;Sung, Hye-Lynn;Lho, Hyoyoung;Koo, Jaywon;Chu, Jion;Lim, Juwon;Kim, Youngju;Lee, Kyungyeon;Lim, Yuri;Kim, Meongeun;Hwang, Seonjeong;Han, Shinhye;Bae, Sohyeun;Kim, Sua;Yoo, Suhyeon;Seo, Yeonjeong;Shin, Yerim;Kim, Yonsoo;Ko, You-Jung;Baek, Jihee;Hyun, Hyejin;Choi, Hyemin;Oh, Ji-Hye;Kim, Da-Young;Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.33.1-33.7
    • /
    • 2020
  • This paper describes a community effort to improve earlier versions of the full-text corpus of Genomics & Informatics by semi-automatically detecting and correcting PDF-to-text conversion errors and optical character recognition errors during the first hackathon of Genomics & Informatics Annotation Hackathon (GIAH) event. Extracting text from multi-column biomedical documents such as Genomics & Informatics is known to be notoriously difficult. The hackathon was piloted as part of a coding competition of the ELTEC College of Engineering at Ewha Womans University in order to enable researchers and students to create or annotate their own versions of the Genomics & Informatics corpus, to gain and create knowledge about corpus linguistics, and simultaneously to acquire tangible and transferable skills. The proposed projects during the hackathon harness an internal database containing different versions of the corpus and annotations.

Current Trends for National Bibliography through Analyzing the Status of Representative National Bibliographies (주요국 국가서지 현황조사를 통한 국가서지의 최신 경향 분석)

  • Lee, Mihwa;Lee, Ji-Won
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.32 no.1
    • /
    • pp.35-57
    • /
    • 2021
  • This paper is to grasp the current trends of national bibliographies through analyzing representative national bibliographies using literature review, analysis of national bibliographies' web pages and survey. First, in order to conform to the definition of a national bibliography as a record of a national publication, it attempts to include a variety of materials from print to electronic resources, but in reality it cannot contain all the materials, so there are exceptions. It is impossible to create a general selection guide for national bibliography coverage, and a plan that reflects the national characteristics and prepares a valid and comprehensive coverage based on analysis is needed. Second, cooperation with publishers and libraries is being made to efficiently generate national bibliography. For the efficiency of national bibliography generation, changes should be sought such as the standardization and consistency, the collection level metadata description for digital resources, and the creation of national bibliography using linked data. Third, national bibliography is published through the national bibliographic online search system, linked data search, MARC download using PDF, OAI-PMH, SRU, Z39.50, and mass download in RDF/XML format, and is integrated with the online public access catalog or also built separately. Above all, national bibliographies and online public access catalogs need to be built in a way of data reuse through an integrated library system. Fourth, as a differentiated function for national bibliography, various services such as user tagging and national bibliographic statistics are provided along with various browsing functions. In addition, services of analysis of national bibliographic big data, links to electronic publications, and mass download of linked data should be provided, and it is necessary to identify users' needs and provide open services that reflect them in order to develop differentiated services. Through the current trends and considerations of the national bibliographies analyzed in this study, it will be possible to explore changes in national and international national bibliography.

Effect of Time-dependent Diffusion and Exterior Conditions on Service Life Considering Deterministic and Probabilistic Method (결정론 및 확률론적 방법에 따라 시간의존성 염화물 확산계수 및 외부 영향인자가 내구수명에 미치는 영향)

  • Kwon, Seung-Jun
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.20 no.6
    • /
    • pp.65-72
    • /
    • 2016
  • Service life evaluation for RC Structures exposed to chloride attack is very important, however the previous two methods(deterministic and probabilistic method) show a big difference. The paper presents a service life simulation using deterministic and probabilistic method with time-dependent diffusion coefficient. Three different cases are considered for diffusion coefficient, concrete cover depth, and surface chloride content respectively, and then the PDF(probability of durability failure) and the related service life are obtained. Through adopting time-dependent diffusion, the discrepancy between the two methods can be reduced, which yields reasonable service life. When diffusion coefficient increases from $2.5{\times}10^{-12}m^2/sec$ to $7.5{\times}10^{-12}m^2/sec$, the service life decreases to 25.5~35.6% level, and cover depth does from 75 mm to 125 mm, it increases to 267~311% level as well. In the case of surface chloride content from $5.0kg/m^3$ to $15.0kg/m^3$, it changes to 40.9~54.5%. The effect of cover depth is higher than the others by 8~10 times and also implies it is a key parameter to service life extension.

Sports Celebrities as a Determinant of Sport Media Distribution Contents: Focusing on Tacit Premise of Agenda Setting Theory (스포츠미디어의 유통 콘텐츠 결정요인으로서 스포츠 스타: 의제설정 이론의 암묵적 전제를 중심으로)

  • YOO, Sang-Keon;KIM, Yong-Eun;SEO, Won-Jae
    • Journal of Distribution Science
    • /
    • v.17 no.10
    • /
    • pp.83-91
    • /
    • 2019
  • Purpose - Media is a significant distributional channel in sport. In terms of determining the influencer in building sport media contents, recent sport media studies have employed agenda-setting theory, assuming media itself as the agenda provider. In a real-world situation, however, sports stars have been deemed key factor determining distribution contents in sport. The starting point of this study is the "tacit premise" of agenda-setting theory. Given the agenda-setting theory, the current study attempted to explore the function of sport stars as an agenda provider, which is a key determinant of sport distribution. Research design, data, and methodology - This study has reviewed articles of Yuna Kim, Sang-hwa Lee, and Hyun-jin Ryu from daily newspapers including as dong-a ilbo and joongang ilbo (2013 to 2017). The study collected data, portable document format (PDF), from the online archive of dong-a ilbo and joongang ilbo. We coded the length of the article, the frequency, the size of the picture, and the structural form of the article. Inter-coder reliability was compared with data previously investigated by the researcher. Inter-coder reliabilities for study 1 and 2 was .89 and .85. To examine hypotheses, descriptive analysis, correlations, and cross-tap analysis were performed. Results - The results partially supported the hypotheses proposing the significant role of sports stars as the agenda setters in distributing sport media contents. In specific, the study found that the number of articles about sports stars prevailed the number of articles about regular athletes. Besides, studies found that the use of photos was more frequent in articles of sports starts than that of regular athletes. In sports newspaper articles, featured story articles were used more than straight-articles for news relating to sports stars. Also, sports newspaper of sports stars contained more information associated within an event rather than outside of an event. Conclusions - In sports journalism, this study challenges the current theory that the media affects the composition and the content of sports coverages. As the principle of the agenda-setting of sports media, the influence of sports stars must be continuously studied along with a follow-up study.

Simulation of Low-Grazing-Angle Coherent Sea Clutter (Low Grazing Angle에서의 코히어런트 해상 클러터 시뮬레이션)

  • Choi, Sang-Hyun;Song, Ji-Min;Jeon, Hyeon-Mu;Chung, Yong-Seek;Kim, Jong-Mann;Hong, Seong-Won;Yang, Hoon-Gee
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.29 no.8
    • /
    • pp.615-623
    • /
    • 2018
  • The probability density function(PDF) for the amplitude of the reflectivity of low-grazing-angle sea clutter has generally been modeled by a compound-Gaussian distribution, rather than by the Rayleigh distribution, owing to the intensity variation of each clutter patch over time. The texture component forming the reflectivity has been simulated by combining Gamma distribution and memory-less nonlinear transformation(MNLT). On the other hand, there is no typical method available that can be used to simulate the speckle component. We first review Watt's method, wherein the speckle is simulated starting from the Doppler spectrum of the received echoes that is modeled as having a Gaussian shape. Then, we introduce a newly proposed method. The proposed method simulates the speckle by manipulating a clutter covariance matrix through the Cholesky decomposition after minimizing the effect of adjacent clutter patches using an equalizer. The feasibility of the proposed method is validated through simulation, wherein the results from two methods are compared in terms of the Doppler spectrum and the correlation function.

Optimal Design of Water Distribution System considering the Uncertainties on the Demands and Roughness Coefficients (수요와 조도계수의 불확실성을 고려한 상수도관망의 최적설계)

  • Jung, Dong-Hwi;Chung, Gun-Hui;Kim, Joong-Hoon
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.10 no.1
    • /
    • pp.73-80
    • /
    • 2010
  • The optimal design of water distribution system have started with the least cost design of single objective function using fixed hydraulic variables, eg. fixed water demand and pipe roughness. However, more adequate design is accomplished with considering uncertainties laid on water distribution system such as uncertain future water demands, resulting in successful estimation of real network's behaviors. So, many researchers have suggested a variety of approaches to consider uncertainties in water distribution system using uncertainties quantification methods and the optimal design of multi-objective function is also studied. This paper suggests the new approach of a multi-objective optimization seeking the minimum cost and maximum robustness of the network based on two uncertain variables, nodal demands and pipe roughness uncertainties. Total design procedure consists of two folds: least cost design and final optimal design under uncertainties. The uncertainties of demands and roughness are considered with Latin Hypercube sampling technique with beta probability density functions and multi-objective genetic algorithms (MOGA) is used for the optimization process. The suggested approach is tested in a case study of real network named the New York Tunnels and the applicability of new approach is checked. As the computation time passes, we can check that initial populations, one solution of solutions of multi-objective genetic algorithm, spread to lower right section on the solution space and yield Pareto Optimum solutions building Pareto Front.

A Study on Limesurvey in the Form of Open Source Online Survey System for Curriculum Organizing (학교 교육과정 편성을 위한 오픈 소스 온라인 설문조사 시스템 Limesurvey 활용 방안)

  • Han, Ki-Sun;Chun, Seok-Ju
    • 한국정보교육학회:학술대회논문집
    • /
    • 2011.01a
    • /
    • pp.91-101
    • /
    • 2011
  • The purpose of this paper is to quickly identify school parents, teachers, students, community needs and opinions for curriculum organizing and the implementation of an online survey system for operating educational activities. Online survey system should be implemented based on Limesurvey to reduce costs and administrative costs. Limesurvery is available without the development of the separate program and offers the form of web-based template system, complete design, layout. Also, Limesurvey offers basic statistical analysis of survey data. Limesurvey can be executed by installing the program on a web hosting, typing database information. Limesurvey can be made a graph of the statistical results. Besides, Limesurvery can be stored in the form of HTML, Word, Excel, CSV Files and can be stured as basic datas for SPSS or PASW, R data, other statistical processing programs. If we could be operate Limesurvey in the form of open source-based survey program in elementary school, we could be reduced teacher's unnecessary work for statistics and overcame the problem of offline survey system.

  • PDF

Selecting Climate Change Scenarios Reflecting Uncertainties (불확실성을 고려한 기후변화 시나리오의 선정)

  • Lee, Jae-Kyoung;Kim, Young-Oh
    • Atmosphere
    • /
    • v.22 no.2
    • /
    • pp.149-161
    • /
    • 2012
  • Going by the research results of the past, of all the uncertainties resulting from the research on climate change, the uncertainty caused by the climate change scenario has the highest degree of uncertainty. Therefore, depending upon what kind of climate change scenario one adopts, the projection of the water resources in the future will differ significantly. As a matter of principle, it is highly recommended to utilize all the GCM scenarios offered by the IPCC. However, this could be considered to be an impractical alternative if a decision has to be made at an action officer's level. Hence, as an alternative, it is deemed necessary to select several scenarios so as to express the possible number of cases to the maximum extent possible. The objective standards in selecting the climate change scenarios have not been properly established and the scenarios have been selected, either at random or subject to the researcher's discretion. In this research, a new scenario selection process, in which it is possible to have the effect of having utilized all the possible scenarios, with using only a few principal scenarios and maintaining some of the uncertainties, has been suggested. In this research, the use of cluster analysis and the selection of a representative scenario in each cluster have efficiently reduced the number of climate change scenarios. In the cluster analysis method, the K-means clustering method, which takes advantage of the statistical features of scenarios has been employed; in the selection of a representative scenario in each cluster, the selection method was analyzed and reviewed and the PDF method was used to select the best scenarios with the closest simulation accuracy and the principal scenarios that is suggested by this research. In the selection of the best scenarios, it has been shown that the GCM scenario which demonstrated high level of simulation accuracy in the past need not necessarily demonstrate the similarly high level of simulation accuracy in the future and various GCM scenarios were selected for the principal scenarios. Secondly, the "Maximum entropy" which can quantify the uncertainties of the climate change scenario has been used to both quantify and compare the uncertainties associated with all the scenarios, best scenarios and the principal scenarios. Comparison has shown that the principal scenarios do maintain and are able to better explain the uncertainties of all the scenarios than the best scenarios. Therefore, through the scenario selection process, it has been proven that the principal scenarios have the effect of having utilized all the scenarios and retaining the uncertainties associated with the climate change to the maximum extent possible, while reducing the number of scenarios at the same time. Lastly, the climate change scenario most suitable for the climate on the Korean peninsula has been suggested. Through the scenario selection process, of all the scenarios found in the 4th IPCC report, principal climate change scenarios, which are suitable for the Korean peninsula and maintain most of the uncertainties, have been suggested. Therefore, it is assessed that the use of the scenario most suitable for the future projection of water resources on the Korean peninsula will be able to provide the projection of the water resources management that maintains more than 70~80% level of uncertainties of all the scenarios.

Numerical simulation of gasification of coal-water slurry for production of synthesis gas in a two stage entrained gasifier (2단 분류층 가스화기에서 합성가스 생성을 위한 석탄 슬러리 가스화에 대한 수치 해석적 연구)

  • Seo, Dong-Kyun;Lee, Sun-Ki;Song, Soon-Ho;Hwang, Jung-Ho
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 2007.11a
    • /
    • pp.417-423
    • /
    • 2007
  • Oxy-gasification or oxygen-blown gasification, enables a clean and efficient use of coal and opens a promising way to CO2 capture. The coal gasification process of a slurry feed type, entrained-flow coal gasifier was numerically predicted in this paper. The purposes of this study are to develop an evaluation technique for design and performance optimization of coal gasifiers using a numerical simulation technique, and to confirm the validity of the model. By dividing the complicated coal gasification process into several simplified stages such as slurry evaporation, coal devolatilization, mixture fraction model and two-phase reactions coupled with turbulent flow and two-phase heat transfer, a comprehensive numerical model was constructed to simulate the coal gasification process. The influence of turbulence on the gas properties was taken into account by the PDF (Probability Density Function) model. A numerical simulation with the coal gasification model is performed on the Conoco-Philips type gasifier for IGCC plant. Gas temperature distribution and product gas composition are also presented. Numerical computations were performed to assess the effect of variation in oxygen to coal ratio and steam to coal ratio on reactive flow field. The concentration of major products, CO and H2 were calculated with varying oxygen to coal ratio (0.2-1.5) and steam to coal ratio(0.3-0.7). To verify the validity of predictions, predicted values of CO and H2 concentrations at the exit of the gasifier were compared with previous work of the same geometry and operating points. Predictions showed that the CO and H2 concentration increased gradually to its maximum value with increasing oxygen-coal and hydrogen-coal ratio and decreased. When the oxygen-coal ratio was between 0.8 and 1.2, and the steam-coal ratio was between 0.4 and 0.5, high values of CO and H2 were obtained. This study also deals with the comparison of CFD (Computational Flow Dynamics) and STATNJAN results which consider the objective gasifier as chemical equilibrium to know the effect of flow on objective gasifier compared to equilibrium. This study makes objective gasifier divided into a few ranges to study the evolution of the gasification locally. By this method, we can find that there are characteristics in the each scope divided.

  • PDF

A Study on the Improvement of Excavation and Research Process - With a Focus on Building a Silla Ancient Tombs Database - (문화재 발굴 조사·연구 과정의 개선 방안 연구 - 신라 고분 데이터베이스 구축을 중심으로 -)

  • Jung, Ikjae
    • Korean Journal of Heritage: History & Science
    • /
    • v.53 no.3
    • /
    • pp.4-23
    • /
    • 2020
  • In this article, the excavation and research of cultural assets were set as a process and the improvement measures were considered. To this end, we examined the process of excavating cultural assets to diagnose problems, suggested changes in the format of reports and the establishment of a database, and drew up improvement models for Silla's ancient tombs and research. The problems of the current process of excavating cultural assets are as follows. First, investigation and research fail to integrate and merely comprise 'examination as an administrative procedure' or 'investigation for the sake of investigation', which ultimately hamper research and achievement. Second, there are differences in the composition or description of the report by surveyors or excavation agencies, which make it difficult to integrate data at a higher level. Third, the current form of reporting remains in analog format such as books and PDFs, which not only reduces continuity and efficiency to the research phase, but also lags behind the rapidly changing times. We believe that the improvement of these problems should be achieved by computerizing reports, converting them into digital formats, and establishing them in a database. First, regarding the transition to report format, it was pointed out that the form of excavation data, the final stage of the excavation process, remains analog and the improvement model was presented from the perspective of linking it to excavation and research, and the justification was emphasized through comparison with other cases. Second, the database reviewed the build model for Silla tombs. To this end, the purpose and expected effects, targets, progress, attributes, categories, and interfaces were examined.