• Title/Summary/Keyword: R Language

Search Result 510, Processing Time 0.025 seconds

A Web Application for Open Data Visualization Using R (R 이용 오픈데이터 시각화 웹 응용)

  • Kim, Kwang-Seob;Lee, Ki-Won
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.2
    • /
    • pp.72-81
    • /
    • 2014
  • As big data are one of main issues in the recent days, the interests on their technologies are also increasing. Among several technological bases, this study focuses on data visualization and R based on open source. In general, the term of data visualization can be summarized as the web technologies for constructing, manipulating and displaying various types of graphic objects in the interactive mode. R is an operating environment or a language for statistical data analysis from basic to advanced level. In this study, a web application with these technological aspects and components is newly implemented and exemplified with data visualization for geo-based open data provided by public organizations or government agencies. This application model does not need users' data building or proprietary software installation. Futhermore it is designed for users in the geo-spatial application field with less experiences and little knowledges about R. The results of data visualization by this application can support decision making process of web users accessible to this service. It is expected that the more practical and various applications with R-based geo-statistical analysis functions and complex operations linked to big data contribute to expanding the scope and the range of the geo-spatial application.

Similarity calculation between national R&D reports using co-occurrence (문서의 공기관계를 이용하여 국가 R&D 보고서간 유사도 계산)

  • Kim, Nam-Hun;Joo, Jong-Min;Park, Hyuk-Ro;Yang, Hyung-Jeong;Choi, Kwang-Nam
    • 한국어정보학회:학술대회논문집
    • /
    • 2016.10a
    • /
    • pp.201-204
    • /
    • 2016
  • 본 논문에서는 문서의 공기관계를 통해 추출된 문서의 특징을 이용하여 유사 보고서를 판별하는 시스템을 제안한다. 국가 R&D 보고서의 XML형식 파일에서 텍스트를 추출 후, 문장 단위로 나누어 각 문장의 공기관계를 추출한다. 그 후 공기관계의 노드와 엣지를 문서에 추가하고, 노드로 사용된 단어만 남기고 나머지 단어는 제외한다. 그리고 이것을 문서의 특징으로 삼고 유사도 계산을 한다. 이 때, 유사도 계산은 코사인 유사도를 사용한다. 실험결과, 국가 R&D문서 유사도 계산에서 제안된 방법이 기존의 방법보다 높은 분류율을 보여주었다.

  • PDF

Parallel Computing Environment for R with on Supercomputer Systems (빅데이터 분석을 위한 슈퍼컴퓨터 환경에서 R의 병렬처리)

  • Lee, Sang Yeol;Won, Joong Ho
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.39 no.4
    • /
    • pp.19-31
    • /
    • 2014
  • We study parallel processing techniques for the R programming language of high performance computing technology. In this study, we used massively parallel computing system which has 25,408 cpu cores. We conducted a performance evaluation of a distributed memory system using MPI and of a the shared memory system using OpenMP. Our findings are summarized as follows. First, For some particular algorithms, parallel processing is about 150 times faster than serial processing in R. Second, the distributed memory system gets faster as the number of nodes increases while shared memory system is limited in the improvement of performance, due to the limit of the number of cpus in a single system.

Interactive Statistics Laboratory using R and Sage (R을 활용한 '대화형 통계학 입문 실습실' 개발과 활용)

  • Lee, Sang-Gu;Lee, Geung-Hee;Choi, Yong-Seok;Lee, Jae Hwa;Lee, Jenny Jyoung
    • Communications of Mathematical Education
    • /
    • v.29 no.4
    • /
    • pp.573-588
    • /
    • 2015
  • In this paper, we introduce development process and application of a simple and effective model of a statistics laboratory using open source software R, one of leading language and environment for statistical computing and graphics. This model consists of HTML files, including Sage cells, video lectures and enough internet resources. Users do not have to install statistical softwares to run their code. Clicking 'evaluate' button in the web page displays the result that is calculated through cloud-computing environment. Hence, with any type of mobile equipment and internet, learners can freely practice statistical concepts and theorems via various examples with sample R (or Sage) codes which were given, while instructors can easily design and modify it for his/her lectures, only gathering many existing resources and editing HTML file. This will be a resonable model of laboratory for studying statistics. This model with bunch of provided materials will reduce the time and effort needed for R-beginners to be acquainted with and understand R language and also stimulate beginners' interest in statistics. We introduce this interactive statistical laboratory as an useful model for beginners to learn basic statistical concepts and R.

Building Sentence Meaning Identification Dataset Based on Social Problem-Solving R&D Reports (사회문제 해결 연구보고서 기반 문장 의미 식별 데이터셋 구축)

  • Hyeonho Shin;Seonki Jeong;Hong-Woo Chun;Lee-Nam Kwon;Jae-Min Lee;Kanghee Park;Sung-Pil Choi
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.4
    • /
    • pp.159-172
    • /
    • 2023
  • In general, social problem-solving research aims to create important social value by offering meaningful answers to various social pending issues using scientific technologies. Not surprisingly, however, although numerous and extensive research attempts have been made to alleviate the social problems and issues in nation-wide, we still have many important social challenges and works to be done. In order to facilitate the entire process of the social problem-solving research and maximize its efficacy, it is vital to clearly identify and grasp the important and pressing problems to be focused upon. It is understandable for the problem discovery step to be drastically improved if current social issues can be automatically identified from existing R&D resources such as technical reports and articles. This paper introduces a comprehensive dataset which is essential to build a machine learning model for automatically detecting the social problems and solutions in various national research reports. Initially, we collected a total of 700 research reports regarding social problems and issues. Through intensive annotation process, we built totally 24,022 sentences each of which possesses its own category or label closely related to social problem-solving such as problems, purposes, solutions, effects and so on. Furthermore, we implemented four sentence classification models based on various neural language models and conducted a series of performance experiments using our dataset. As a result of the experiment, the model fine-tuned to the KLUE-BERT pre-trained language model showed the best performance with an accuracy of 75.853% and an F1 score of 63.503%.

Web-Publikation in der deutschen Linguistik (독어학 분야의 웹 출판)

  • Chung Mun Yong
    • Koreanishche Zeitschrift fur Deutsche Sprachwissenschaft
    • /
    • v.3
    • /
    • pp.327-346
    • /
    • 2001
  • Das Ziel dieser Arbeit liegt darin, den Bestand der wissenschaftlichen Web-Publikationen in der deutschen Linguistik darzustellen. Das Internet bietet heute $f\"{u}r$ die Forschung bereits zwei der wichtigsten produktiven $M\"{o}glichkeiten;n\"{a}mlich$ Information und Kommunikation. Akademische Kreise haben diverse Homepages entwickelt. Der schnelle Zugang zu aktuellen bibliographischen Daten und Forschungsergebnissen hat $f\"{u}r$ koreanische Germanisten einen besonders hohen Stellenwert. Wissenschaftliches Publizieren in Form von Fachzeitschriften ist ein gutes Modell $daf\"{u}r$. Fachzeitschriften erscheinen weltweit und relativ schnell, erreichen aber nur geringe Auflagen. Der Leserkreis ist fast identisch mit der Gruppe der potentiellen Autoren und Herausgeber. Ein Vorteil des elektronischen Publizierens ist die M\"{o}glichkeit$ multimeiale Dokumente und $weiterf\"{u}hrende$ Hyperlinks zu integrieren. Aber die $Qualit\"{a}t\;der\;Aufs\"{a}tze$ kann man kaum objektiv ermitteln und nur schwer beurteilen. Elektronische Zeitschriften $k\"{o}nnen$ sich in der Wissenschaft nur dann etablieren, wenn es gelingt, als wissenschaftliche Arbeiten von den wissenschaftlichen Kreisen oder von der Univerwaltung anerkannt zu werden. Folgende on-line wissenschaftliche Fachzeitschriften werden hier dargestellt; Linguistik online(ISSN 1615-3014), The Web Journal of Modern Language Linguistics(ISSN 1461-4499), PhiN(ISSN 1433-7177), Zeitschrift $f\"{u}r$ interkulturellen Fremdsprachenunterricht(ISSN: 1205-6545), und Language Learning & Technology(ISSN 1094-3501). 1)http://viadrina.euv-frankfurt-o.de/$\~wjoumal/deutsch/$ 2)http://wjmll.ncl.ac.uk/ 3)http://www.fu-berlin.de/phin/ 4)http://www.ualberta.ca/$\~german/ejoumal/$ 5)http://llt.msu.edu/ In der folgenden Homepage kann man auch eine Quellensammlung zu 'Dissertationen Online' finden. 6) http://www.educat.hu-berlin.de/$diss\_online/biblio.html$ Eine individuelle und institutionelle Offenheit und eine $n\"{u}chteme$ Anwendung der Materialien sind bei der Herstellung und Nutzung von Forschungsergebnissen erforderlich.

  • PDF

Distribution of /ju/ After Coronal Sonorant Consonants in British English (영국영어에서 치경공명자음 뒤의 /ju/ 분포)

  • Hwangbo, Young-shik
    • Journal of English Language & Literature
    • /
    • v.56 no.5
    • /
    • pp.851-870
    • /
    • 2010
  • The purpose of this paper is to investigate the distribution of /ju/ in British English, especially after the coronal sonorants /n, l, /r/. The sequence /ju/ is related with vowels such as /u/, /ʊ/, and /ʊ/, and has occasioned a variety of conflicting analyses or suggestions. One of those is in which context /j/ is deleted if we suppose that the underlying form is /ju/. The context differs according to the dialect we deal with. In British English, it is known that /j/ is deleted always after /r/, and usually after /l/ when it occurs in an unstressed word-medial syllable. To check this well-known fact I searched OED Online (the 2nd Edition, 1989) for those words which contain /n, l, r/ + /ju, jʊ, u, ʊ, (j)u, (j)ʊ/ in their pronunciations, using the search engine provided by OED Online. After removing some unnecessary words, I classified the collected words into several groups according to the preceding sonorant consonants, the positions, and the presence (or absence) of the stress, of the syllable where /ju/ occurs. The results are as follows: 1) the deletion of /j/ depends on the sonorant consonant which /ju/ follows, the position where it occurs, and the presence of the stress which /ju/ bears; 2) though the influence of the sonorant consonants is strong, the position and stress also have non-trivial effect on the deletion of /j/, that is, the word-initial syllable and the stressed syllable prefer the deletion of /j/, and word-medial and unstressed syllable usually retain /j/; 3) the stress and position factors play their own roles even in the context where the effect of /n, l, r/ is dominant.

Psychoeducational Profile-Revised, Korean Wechsler Preschool and Primary Scale of Intelligence, Fourth Edition, and the Vineland Adaptive Behavior Scale, Second Edition: Comparison of Utility for Developmental Disabilities in Preschool Children

  • Sumi Ryu;Taeyeop Lee;Yunshin Lim;Haejin Kim;Go-eun Yu;Seonok Kim;Hyo-Won Kim
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.34 no.4
    • /
    • pp.258-267
    • /
    • 2023
  • Objectives: This study aimed to compare the utility of the Psychoeducational Profile-Revised (PEP-R), Korean Wechsler Preschool and Primary Scale of Intelligence, Fourth Edition (K-WPPSI-IV), and Vineland Adaptive Behavior Scale, Second Edition (VABS-II) for evaluating developmental disabilities (DD) in preschool children. Additionally, we examined the correlations between the PEP-R, K-WPPSI-IV, and VABS-II. Methods: A total of 164 children aged 37-84 months were assessed. Children's development was evaluated using the PEP-R, K-WPPSI-IV, VABS-II, Preschool Receptive-Expressive Language Scale, and Korean Childhood Autism Rating Scale, Second Edition. Results: Of the 164 children, 103 had typical development (TD) and 61 had DD. The mean of the PEP-R Developmental Quotient (DQ), K-WPPSI-IV Full-Scale Intelligence Quotient (FSIQ), and VABS-II Adaptive Behavior Composite (ABC) scores were significantly higher in the TD group than in the DD group (p<0.001). The estimated area under the curve of the PEP-R DQ, K-WPPSI-IV FSIQ, and VABS-II ABC scores was 0.953 (95% confidence interval [CI]=0.915-0.992), 0.955 (95% CI=0.914-0.996), and 0.961 (95% CI=0.932-0.991), respectively, which did not indicate a statistically significant difference. The PEP-R DQ scores were positively correlated with the K-WPPSI-IV FSIQ (r=0.90, p<0.001) and VABS-II ABC scores (r=0.84, p<0.001). A strong correlation was observed between the K-WPPSI-IV FSIQ and VABS-II ABC scores (r=0.89, p<0.001). Conclusion: This study found that the PEP-R, K-WPPSI-IV, and VABS-II effectively distinguished DD from TD in preschool children, and no significant differences in utility were observed between them.

Uncertain Knowledge Processing for Oriental Medicine Diagnostic Model (한의 진단 모델의 추론 과정에서 발생하는 불확실한 진단 지식의 처리)

  • Shin, Yang-Kyu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.8 no.1
    • /
    • pp.1-7
    • /
    • 1997
  • The inference process for medical expert system is mostly formed by diagnostic knowledge on the if-then rule base. Oriental medicine diagnostic knowledge, however, may involve uncertain knowledge caused by ambiguous concept. In this paper, we analyze an oriental medicine diagnostic process by a rule-based inference system, and propose a method for representing and processing uncertain oriental medicine diagnostic knowledge using CLP( R ) which is a kind of constraint satisfaction program.

  • PDF

A Study of SPRT and EXSPRT-R Appling Foreign Language Test (SPRT와 EXSPRT-R 검증법의 언어능력 시험적용에 대한 연구)

  • Kim, Myung-Gwan;Kim, Ji-Han
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2005.05a
    • /
    • pp.989-992
    • /
    • 2005
  • CAT(Computer Adaptive Testing : 컴퓨터 기반 적응적 검사)는 기존의 종이 시험지에서 이루어지던 시험과 달리 수험자에게 적절한 맞춤식 출제로 보다 정확한 수험자의 능력 판단 및 빠른 수험진행을 가능케 하였다. 기존의 CAT는 많은 인원과 문제가 있어야만 그 결과에 신뢰성이 있다고 알려져 있다. CAT의 대표적인 알고리즘인 SPRT와 EXSPRT-R을 이용하여 10명의 적은 인원으로 JLPT 4급 기출문제를 적용한 실험을 하였다. SPRT 에서는 인원수와, 문제 난이도를 무시한 결과로 인하여 만족 할만한 결과를 얻지 못하였으나, EXSPRT-R의 경우에는 적은 인원에서도 충분히 CAT를 이용할 수 있음을 발견할 수 있었다.

  • PDF