• Title/Summary/Keyword: Source language

Search Result 487, Processing Time 0.023 seconds

C-language Learning Contents Supporting Web-based Compiling and Running (웹기반 컴파일과 실행을 지원하는 C언어 교육콘텐츠 개발)

  • Kim, Seong-Hyun;Kim, Young-Kuk
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.796-800
    • /
    • 2006
  • In this paper, we developed an e-loaming contents for C programming language using Linux and open source software, not using commercial integrated development tool like Microsoft's Visual Studio. In most programming language courses, students study or practice the programming language by editing source code compiling and running the executable code by commercial software like Visual Studio which installed on each PC. This way of learning has some difficulties in total cost of purchasing software and using other PCs which donot have proper software installed. To overcome this situation and enable loaming anywhere, with any device, at anytime, we propose a way of utilizing Linux and open source software in Web-based learning environment. In this environment students can input their source code on the form of broweser and get the result instantly from the server.

  • PDF

Cross-Language Clone Detection based on Common Token (공통 토큰에 기반한 서로 다른 언어의 유사성 검사)

  • Hong, Sung-Moon;Kim, Hyunha;Lee, Jaehyung;Park, Sungwoo;Mo, Ji-Hwan;Doh, Kyung-Goo
    • Journal of Software Assessment and Valuation
    • /
    • v.14 no.2
    • /
    • pp.35-44
    • /
    • 2018
  • Tools for detecting cross-language clones usually compare abstract-syntax-tree representations of source code, which lacks scalability. In order to compare large source code to a practical level, we need a similarity checking technique that works on a token level basis. In this paper, we define common tokens that represent all tokens commonly used in programming languages of different paradigms. Each source code of different language is then transformed into the list of common tokens that are compared. Experimental results using exEyes show that our proposed method using common tokens is effective in detecting cross-language clones.

A Survey on Open Source based Large Language Models (오픈 소스 기반의 거대 언어 모델 연구 동향: 서베이)

  • Ha-Young Joo;Hyeontaek Oh;Jinhong Yang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.4
    • /
    • pp.193-202
    • /
    • 2023
  • In recent years, the outstanding performance of large language models (LLMs) trained on extensive datasets has become a hot topic. Since studies on LLMs are available on open-source approaches, the ecosystem is expanding rapidly. Models that are task-specific, lightweight, and high-performing are being actively disseminated using additional training techniques using pre-trained LLMs as foundation models. On the other hand, the performance of LLMs for Korean is subpar because English comprises a significant proportion of the training dataset of existing LLMs. Therefore, research is being carried out on Korean-specific LLMs that allow for further learning with Korean language data. This paper identifies trends of open source based LLMs and introduces research on Korean specific large language models; moreover, the applications and limitations of large language models are described.

The Reactivating of Allan Colquhoun's Architectural Theory - 'Figure', 'Form' and 'Image' - (앨런 코쿤(Allan Colquhoun)의 건축이론을 재활성화하기 위한 시론 - '형상(Figure)', '형태(Form)', 그리고 '이미지(Image)'-)

  • LEE, Dong-Eon
    • Journal of architectural history
    • /
    • v.7 no.1 s.14
    • /
    • pp.81-92
    • /
    • 1998
  • According to Post-modernists including deconstructivists, as Modernism is changed into Post-modernism, the paradigm is shifted from consciousness to language. The paradigm of consciousness corresponds to representational language, and the paradigm of language to self-referential one. In post-modern age most of architects are wandering what kind of language architecture is. Some theorists contend that architecture is representational, and others that it is self-referential. Allan Colquhoun, who is known as one of the best architectural theorists inUnited States, accepts both the former and the latter, but fails to reveal the meaning and the limitation, of the two languages. Although he believes that the representational language of architecture ('figure') is the source of self-referential language of architecture('form'), he never clearly answers what kind of language architecture. In order to overcome the limitation and the meaning of Colquhoun's figure and form, and synthesize the two language, this essay appropriates Martin Heidegger's some concepts, 'ready-to-hand,' 'present-at-hand' and 'being-in-the-world' to make a theoretical framework for 'image' which prevails over and synthesizes 'form' and 'figure.' Since Image is based upon both 'being-in-the world' and 'ready-to-hand,' it is the source of 'form' and 'figure.' When 'image' is fragmented, the former and the latter emerge. Image is therefore both the former and the latter because it represents and self-refers a world as a reality.

  • PDF

Extension of Code Refactoring Technique to Support Energy Efficiency and Language Conversion of Embedded Software (임베디드 소프트웨어의 에너지 효율성과 언어 변환 지원을 위한 코드 리팩토링 기법 확장)

  • Nam, Seungwoo;Hong, Jang-Eui
    • Journal of Convergence for Information Technology
    • /
    • v.8 no.2
    • /
    • pp.91-103
    • /
    • 2018
  • Refactoring is an engineering technique for securing the quality of existing legacy code, improving the internal structure without changing the functionality of the software. Along with the reuse of open source software, reuse of source code through programming language conversion is increasingly required due to technical or market requirements. In this situation, the refactoring technique including language conversion as well as energy efficiency is considered to be an important means for improving the productivity and the quality of embedded software development. This paper proposes a code refactoring technique that converts the grammar and structure of a programming language into those of a different language through comparison and mapping, in addition to the existing energy efficient refactoring technique. The use of the proposed refactoring technique can expect to improve the competitiveness of the product through rapid software development and quality improvement by coping with the environment change of the software development language and enhancing the reuse of the existing code.

Linguistics in Postmodern Science Fiction: Delany's Babel 17 and Stephenson's Snow Crash

  • Kim, Il-Gu
    • English Language & Literature Teaching
    • /
    • v.12 no.2
    • /
    • pp.41-59
    • /
    • 2006
  • As the late partner to science fiction, various experimental languages such as animal language, telepathic language, newly invented language, alien language often appear as "unexpected and frightened situations" in SF. Like generative semanticists, some SF writers daringly delve into the sacred mystery of semantics in language whereas others avoid the dream of a universal language by holding themselves to manageable data. Samuel Delany's description of the ideal telepathic universal language in Babel 17 shows us humans' dream to be like God by showing to us the new process of communication in the factual interplanetary environment. Similar to the mystery of alien language in SF, the baby's babbling reveals how language is both simple and complicated. Children's language shows us the changing process of a soul revealed by language use and it is no wonder that many languages of AIs in SF often borrow their source from children's language acquisition processes. In short, science fiction as the repository of tropes illuminates other literary language studies and other literary genres. Especially in terms of the futuristic study of linguistics, the relationship between science fiction and linguistics is much closer than we thought.

  • PDF

Translation Disambiguation Based on 'Word-to-Sense and Sense-to-Word' Relationship (`단어-의미 의미-단어` 관계에 기반한 번역어 선택)

  • Lee Hyun-Ah
    • The KIPS Transactions:PartB
    • /
    • v.13B no.1 s.104
    • /
    • pp.71-76
    • /
    • 2006
  • To obtain a correctly translated sentence in a machine translation system, we must select target words that not only reflect an appropriate meaning in a source sentence but also make a fluent sentence in a target language. This paper points out that a source language word has various senses and each sense can be mapped into multiple target words, and proposes a new translation disambiguation method based on this 'word-to-sense and sense-to-word' relationship. In my method target words are chosen through disambiguation of a source word sense and selection of a target word. Most of translation disambiguation methods are based on a 'word-to-word' relationship that means they translate a source word directly into a target wort so they require complicate knowledge sources that directly link a source words to target words, which are hard to obtain like bilingual aligned corpora. By combining two sub-problems for each language, knowledge for translation disambiguation can be automatically extracted from knowledge sources for each language that are easy to obtain. In addition, disambiguation results satisfy both fidelity and intelligibility because selected target words have correct meaning and generate naturally composed target sentences.

Multi-source information integration framework using self-supervised learning-based language model (자기 지도 학습 기반의 언어 모델을 활용한 다출처 정보 통합 프레임워크)

  • Kim, Hanmin;Lee, Jeongbin;Park, Gyudong;Sohn, Mye
    • Journal of Internet Computing and Services
    • /
    • v.22 no.6
    • /
    • pp.141-150
    • /
    • 2021
  • Based on Artificial Intelligence technology, AI-enabled warfare is expected to become the main issue in the future warfare. Natural language processing technology is a core technology of AI technology, and it can significantly contribute to reducing the information burden of underrstanidng reports, information objects and intelligences written in natural language by commanders and staff. In this paper, we propose a Language model-based Multi-source Information Integration (LAMII) framework to reduce the information overload of commanders and support rapid decision-making. The proposed LAMII framework consists of the key steps of representation learning based on language models in self-supervsied way and document integration using autoencoders. In the first step, representation learning that can identify the similar relationship between two heterogeneous sentences is performed using the self-supervised learning technique. In the second step, using the learned model, documents that implies similar contents or topics from multiple sources are found and integrated. At this time, the autoencoder is used to measure the information redundancy of the sentences in order to remove the duplicate sentences. In order to prove the superiority of this paper, we conducted comparison experiments using the language models and the benchmark sets used to evaluate their performance. As a result of the experiment, it was demonstrated that the proposed LAMII framework can effectively predict the similar relationship between heterogeneous sentence compared to other language models.

Benchmarking of BioPerl, Perl, BioJava, Java, BioPython, and Python for Primitive Bioinformatics Tasks and Choosing a Suitable Language

  • Ryu, Tae-Wan
    • International Journal of Contents
    • /
    • v.5 no.2
    • /
    • pp.6-15
    • /
    • 2009
  • Recently many different programming languages have emerged for the development of bioinformatics applications. In addition to the traditional languages, languages from open source projects such as BioPerl, BioPython, and BioJava have become popular because they provide special tools for biological data processing and are easy to use. However, it is not well-studied which of these programming languages will be most suitable for a given bioinformatics task and which factors should be considered in choosing a language for a project. Like many other application projects, bioinformatics projects also require various types of tasks. Accordingly, it will be a challenge to characterize all the aspects of a project in order to choose a language. However, most projects require some common and primitive tasks such as file I/O, text processing, and basic computation for counting, translation, statistics, etc. This paper presents the benchmarking results of six popular languages, Perl, BioPerl, Python, BioPython, Java, and BioJava, for several common and simple bioinformatics tasks. The experimental results of each language are compared through quantitative evaluation metrics such as execution time, memory usage, and size of the source code. Other qualitative factors, including writeability, readability, portability, scalability, and maintainability, that affect the success of a project are also discussed. The results of this research can be useful for developers in choosing an appropriate language for the development of bioinformatics applications.

Language Anxieties Second Language Learning

  • Park, Seon-Ho
    • Korean Journal of English Language and Linguistics
    • /
    • v.2 no.3
    • /
    • pp.373-401
    • /
    • 2002
  • It is often observed that Korean migrant students overseas experience various kinds of anxieties learning English as a second language although they are in an English-speaking country like New Zealand. The context of learning English as a second language is explored by examining language anxieties experienced by recent Korean migrant students in New Zealand. 177 students were surveyed using questionnaires asking their anxieties over various contexts of English learning processes. The three stages of language anxiety of Input, Processing, and Output showed that there were some degrees of anxiety among the students at each stage depending on their ages, age at migration, and duration of residence, in particular. Students tended to experience more language anxiety in school than outside the school. It was also clear that students were experiencing more anxieties with English than with Korean in the four language skills of speaking, listening, reading and writing. Writing was commonly found in English and Korean to be the most frequent source of anxiety among the four language skills. Some implications from the results are suggested for parents, teachers, and students.

  • PDF