• Title/Summary/Keyword: python language

Search Result 140, Processing Time 0.021 seconds

Proposal For Improving Data Processing Performance Using Python (파이썬 활용한 데이터 처리 성능 향상방법 제안)

  • Kim, Hyo-Kwan;Hwang, Won-Yong
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.4
    • /
    • pp.306-311
    • /
    • 2020
  • This paper deals with how to improve the performance of Python language with various libraries when developing a model using big data. The Python language uses the Pandas library for processing spreadsheet-format data such as Excel. In processing data, Python operates on an in-memory basis. There is no performance issue when processing small scale of data. However, performance issues occur when processing large scale of data. Therefore, this paper introduces a method for distributed processing of execution tasks in a single cluster and multiple clusters by using a Dask library that can be used with Pandas when processing data. The experiment compares the speed of processing a simple exponential model using only Pandas on the same specification hardware and the speed of processing using a dask together. This paper presents a method to develop a model by distributing a large scale of data by CPU cores in terms of performance while maintaining that python's advantage of using various libraries is easy.

Formal Semantics Based on Action Equation 2.0 for Python (작용식 2.0 기반 파이썬에 대한 형식 의미론)

  • Han, Jung Lan
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.10 no.6
    • /
    • pp.163-172
    • /
    • 2021
  • To specify a formal semantics for a programming language is to do a significant part for design, standardization and translation of it. The Python is popular and powerful, it is necessary to do research for a formal semantics to specify a static and dynamic semantics for Python clearly in order to design a similar language and do an efficient translation. This paper presents the Action Equation 2.0 that specifies a formal semantics for Python to change and update Action Equation. To measure the execution time for Python programs, we implemented the semantic structure specified in Action Equation 2.0 in Java, and prove through simulation that Action Equation 2.0 is a real semantic structure that can be implemented. The specified Action Equation 2.0 is compared to other descriptions, in terms of readability, modularity, extensibility, and flexibility and then we verified that Action Equation 2.0 is superior to other formal semantics.

Performance Comparison of Python and Scala APIs in Spark Distributed Cluster Computing System (Spark 기반에서 Python과 Scala API의 성능 비교 분석)

  • Ji, Keung-yeup;Kwon, Youngmi
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.2
    • /
    • pp.241-246
    • /
    • 2020
  • Hadoop is a framework to process large data sets in a distributed way across clusters of nodes. It has been a popular platform to process big data, but in recent years, other platforms became competitive ones depending on the characteristics of the application. Spark is one of distributed platforms to enable real-time data processing and improve overall processing performance over Hadoop by introducing in-memory processing instead of disk I/O. Whereas Hadoop is designed to work on Java and data analysis is processed using Java API, Spark provides a variety of APIs with Scala, Python, Java and R. In this paper, the goal is to find out whether the APIs of different programming languages af ect the performances in Spark. We chose two popular APIs: Python and Scala. Python is easy to learn and is used in AI domain in a wide range. Scala is a programming language with advantages of parallelism. Our experiment shows much faster processing with Scala API than Python API. For the performance issues on AI-based analysis, further study is needed.

Development of Python Education Program for Block Coding Learners (블록코딩 선행학습자를 위한 Python 교육 프로그램 개발)

  • Kim, Taeryeong;Han, Sungwan
    • Journal of The Korean Association of Information Education
    • /
    • v.22 no.1
    • /
    • pp.53-60
    • /
    • 2018
  • In this study we have developed a Python education program that can be applied to students who have studied block-based coding. We have developed a Python education program based on the extracted the learners' level of block-based coding by analyzing the programs and the textbooks. We extracted the grammar of the block-based coding and constructed the curriculum. Then, the Python education program was composed by 16 hours. After reviewing the appropriateness of the education program through expert validation, it was concluded that the developed Python education program is suitable for applying to learners of block-based coding. We expect that proposed program will be effectively applied as basic resources to learn script coding in class.

Coding Helper for Python Beginners based on the Large Language Model(LLM) (대규모 언어 모델(LLM) 기반의 파이썬 입문자를 위한 코딩 도우미)

  • Se-Hoon Lee;Jeong-Bin Choi;Yong-Tae Baek;Sun-Ho Yoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2023.07a
    • /
    • pp.389-390
    • /
    • 2023
  • 본 논문에서는 파이썬 코딩 플랫폼에서의 LLM(Large Language Models)을 로직 및 문법 에러 확인, 디버깅 도구로 활용할 수 있는 시스템을 제안한다. 이 시스템은 사용자가 코딩 플랫폼에서 작성한 파이썬 코드와 함께 발생한 에러 문구 및 프롬프트를 LLM 모델에 입력함으로써 로직(문법) 에러를 식별하고 디버깅에 활용할 수 있다. 특히, 입문자를 고려해 프롬프트를 제한하여 사용의 편의성을 높인다. 이를 통해 파이썬 코딩 교육에서 입문자들의 학습 과정을 원활하게 진행할 수 있으며, 파이썬 코딩에 대한 진입 장벽을 낮출 수 있다.

  • PDF

Using Python Programming Language for Teaching Industrial Engineering Subjects: A Case Study on Engineering Economy (산업공학 전공 교과목 강의를 위한 파이썬 프로그래밍 활용: 경제성공학 교육 사례 연구)

  • Cho, Yongkyu
    • Journal of Practical Engineering Education
    • /
    • v.14 no.2
    • /
    • pp.245-258
    • /
    • 2022
  • Computational thinking with programming skills has been widely emphasized for future industrial engineering researchers and practitioners in Industry 4.0. However, industrial engineering students still have limited opportunities to improve their computational thinking abilities during university coursework. In this regard, this research study proposes to use Python programming language for teaching classical Industrial Engineering subjects. For a specific case study, we designed and instructed an Engineering Economy lecture which cultivates the concept and techniques of economic analysis for engineering students. During the class, we introduced the usage of several Python libraries that include numpy-financial for basic financial functions, numpy and scipy for simple numerical computation and analysis, and matplotlib for data visualization. Anonymous class evaluation survey showed the effectiveness of the proposed teaching method in terms of both educational satisfaction and contents delivery. Finally, we found additional needs for providing lectures that adopt the similar teaching style to the proposed method.

A Scraping Method of In-Frame Web Sources Using Python (파이썬을 이용한 프레임내 웹 페이지 스크래핑 기법)

  • Yun, Sujin;Seung, Li;Woo, Young Woon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.271-274
    • /
    • 2019
  • In this paper, we proposed a detailed address acquisition scheme for automatically collecting data of a web page in a frame that is difficult to access by a general web access method. Using the Python language and the Beautiful Soup library, which can utilize the proposed address resolution technique and the HTML selector, we were able to automatically collect all the bulletin board text data written in several pages. By using the proposed method, we can collect large amount of data automatically by Python web scraping program for web pages of any form of address, and we expect that it can be used for big data analysis.

  • PDF

Introduction to numba library in Python for efficient statistical computing (효율적인 통계 계산을 위한 파이썬 numba 라이브러리의 소개)

  • Cho, Younsang;Yu, Donghyeon;Son, Won;Park, Seoncheol
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.6
    • /
    • pp.665-682
    • /
    • 2020
  • This paper introduces numba library in Python, which improves computational efficiency of the provided implemented code written by naive Python language by applying just-in-time (JIT) compilation. To apply just-in-time compilation, the numba only needs to use a decorator on a target Python function. We provide implementation examples with numba for the permutation test and the parameter estimation for Gaussian mixture distribution. We also numerically show the efficiency of numba by comparing the total computation times of the implementation using naive python and the implementation using numba for each application.

Python Basic Programming Curriculum for Non-majors and Development Analysis of Evaluation Problems (비전공자를 위한 파이썬 기초 프로그래밍 커리큘럼과 평가문제 개발분석)

  • Hur, Kyeong
    • Journal of Practical Engineering Education
    • /
    • v.14 no.1
    • /
    • pp.75-83
    • /
    • 2022
  • Most of the courses that teach the Python programming language are liberal arts courses that all students in general universities must complete. Through this, non-major students who have learned the basic programming process based on computational thinking are strengthening their convergence capabilities to apply SW in various major fields. In the previous research results, various evaluation methods for understanding the concept of computational thinking and writing code were suggested. However, there are no examples of evaluation problems, so it is difficult to apply them in actual course operation. Accordingly, in this paper, a Python basic programming curriculum that can be applied as a liberal arts subject for non-majors is proposed according to the ADDIE model. In addition, the case of evaluation problems for each Python element according to the proposed detailed curriculum was divided into 1st and 2nd phases and suggested. Finally, the validity of the proposed evaluation problem was analyzed based on the evaluation scores of non-major students calculated in the course to which this evaluation problem case was applied. It was confirmed that the proposed evaluation problem case was applied as a real-time online non-face-to-face evaluation method to effectively evaluate the programming competency of non-major students.

Language-based Classification of Words using Deep Learning (딥러닝을 이용한 언어별 단어 분류 기법)

  • Zacharia, Nyambegera Duke;Dahouda, Mwamba Kasongo;Joe, Inwhee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.411-414
    • /
    • 2021
  • One of the elements of technology that has become extremely critical within the field of education today is Deep learning. It has been especially used in the area of natural language processing, with some word-representation vectors playing a critical role. However, some of the low-resource languages, such as Swahili, which is spoken in East and Central Africa, do not fall into this category. Natural Language Processing is a field of artificial intelligence where systems and computational algorithms are built that can automatically understand, analyze, manipulate, and potentially generate human language. After coming to discover that some African languages fail to have a proper representation within language processing, even going so far as to describe them as lower resource languages because of inadequate data for NLP, we decided to study the Swahili language. As it stands currently, language modeling using neural networks requires adequate data to guarantee quality word representation, which is important for natural language processing (NLP) tasks. Most African languages have no data for such processing. The main aim of this project is to recognize and focus on the classification of words in English, Swahili, and Korean with a particular emphasis on the low-resource Swahili language. Finally, we are going to create our own dataset and reprocess the data using Python Script, formulate the syllabic alphabet, and finally develop an English, Swahili, and Korean word analogy dataset.