• Title/Summary/Keyword: visual language

Search Result 712, Processing Time 0.03 seconds

Implementation of SMIL Editor for Multimedia Broadcasting (멀티미디어 방송을 위한 SMIL 편집 시스템 구현)

  • 장대영;김창수;정회경
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.3
    • /
    • pp.622-629
    • /
    • 2004
  • Recently, as digital broadcasting and internet are spreaded out of the world, we can easily use informations with less restrictions of time and space. According to the current trends, concerns for the ways of representing multimedia data has been rapidly increased, and users demand the services with integrated document that takes not only simple text and image but also time varying audio-visual data. Therefore, in 1998, W3C presented an international standard, SMIL in order to solve multimedia object representation and synchronization problems. By using SMIL, various multimedia elements can be integrated as a multimedia document with proper view in a space and time. Using this SMIL document, we can create new internet radio broadcasting service that delivers not only audio data but also various text, image and video. In this paper, we describe on a SMIL document editor for the common users to be able to represent time varying multimedia data with special layout and synchronization of time and space.

Active Vision from Image-Text Multimodal System Learning (능동 시각을 이용한 이미지-텍스트 다중 모달 체계 학습)

  • Kim, Jin-Hwa;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.43 no.7
    • /
    • pp.795-800
    • /
    • 2016
  • In image classification, recent CNNs compete with human performance. However, there are limitations in more general recognition. Herein we deal with indoor images that contain too much information to be directly processed and require information reduction before recognition. To reduce the amount of data processing, typically variational inference or variational Bayesian methods are suggested for object detection. However, these methods suffer from the difficulty of marginalizing over the given space. In this study, we propose an image-text integrated recognition system using active vision based on Spatial Transformer Networks. The system attempts to efficiently sample a partial region of a given image for a given language information. Our experimental results demonstrate a significant improvement over traditional approaches. We also discuss the results of qualitative analysis of sampled images, model characteristics, and its limitations.

Effects of EAI and VAS on perceptual judgement and confidence rating by listeners for voice disorders (청지각적 평가 방식에 따른 음성장애 심한 정도 판단과 자가 신뢰도에 대한 차이)

  • Lee, Ok-Bun;Kim, Sun-Hee;Jeong, Hanjin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.5
    • /
    • pp.3046-3050
    • /
    • 2014
  • The purpose of the present study was to evaluate the effect of 7-point interval scale(EAI) and visual analogue scale(VAS) on perceptual judgement and the reliability of severity on voice problems by dysphonic speakers. 30 undergraduate students studying communication disorder were enrolled in the perceptual evaluation. Those listeners judged overall voice severity within the anchored(condition 1) and non-anchored scales(condition 2) for vowel prolongation and reading tasks by 25 speakers with voice disorder. The results of this study showed that the scores by VAS was significantly higher than EAI in both condition 1 and condition 2 for vowel prolongation and reading task. However, the scores by EAI method was higher than by VAS method on voice severity of vowel prolongation (condition 1) and reading task(condition 2). These results suggest auditory-perceptual scaling procedures must be more studied in the aspects of clinical application of voice disorder.

Analysis of Artistic Symbol Expression of Movie Contents Focused on the film "Roma(2018)" (영화콘텐츠의 예술적 상징표현 분석연구 영화 "로마(2018)"을 중심으로)

  • Lee, Tae-Hoon
    • Journal of Digital Convergence
    • /
    • v.17 no.11
    • /
    • pp.475-482
    • /
    • 2019
  • Analyzing the inner meaning and expression of philosophy by analyzing the composition, symbolic expression, and style of the film with high artistic perfection, which contains the spirit of the times, considers human beings through society and history, and raises awareness of life and the present generation. It will be a very meaningful and valuable study in film as art. The movie 'Rome' was cut into the rest of the public's mind by being tempered, hidden and omitted, and the color was black and white. Many aesthetic attempts can be found through symbolic images expressing the ironic message of maid's daily life as a race, capital, socially oppressed history. It can be seen that he expresses his own authorism visual language by drawing symbolic expressions through many contrasts and symbolic expressions through objects. The analysis of commercial films containing these artistic values is expected to help in the future production as a measure of the progress of art films and precedents of authorism expression techniques.

An elasto-plastic damage constitutive model for jointed rock mass with an application

  • Wang, Hanpeng;Li, Yong;Li, Shucai;Zhang, Qingsong;Liu, Jian
    • Geomechanics and Engineering
    • /
    • v.11 no.1
    • /
    • pp.77-94
    • /
    • 2016
  • A forked tunnel, as a special complicated underground structure, is composed of big-arch tunnel, multi-arch tunnel, neighborhood tunnels and separate tunnels according to the different distances between two separate tunnels. Due to the complicated process of design and construction, surrounding jointed rock mass stability of the big-arch tunnel which belongs to the forked tunnel during excavation is a hot issue that needs special attentions. In this paper, an elasto-plastic damage constitutive model for jointed rock mass is proposed based on the coupling method considering elasto-plastic and damage theories, and the irreversible thermodynamics theory. Based on this elasto-plastic damage constitutive model, a three dimensional elasto-plastic damage finite element code (D-FEM) is implemented using Visual Fortran language, which can numerically simulate the whole excavation process of underground project and perform the structural stability of the surrounding rock mass. Comparing with a popular commercial computer code, three dimensional fast Lagrangian analysis of continua (FLAC3D), this D-FEM has advantages in terms of rapid computing process, element grouping function and providing more material models. After that, FLAC3D and D-FEM are simultaneously used to perform the structural stability analysis of the surrounding rock mass in the forked tunnel considering three different computing schemes. The final numerical results behave almost consistent using both FLAC3D and D-FEM. But from the point of numerically obtained damage softening areas, the numerical results obtained by D-FEM more closely approach the practical behaviors of in-situ surrounding rock mass.

A Technology Landscape of Artificial Intelligence: Technological Structure and Firms' Competitive Advantages (인공지능 기술 랜드스케이프 : 기술 구조와 기업별 경쟁우위)

  • Lee, Wangjae;Lee, Hakyeon
    • Journal of Korea Technology Innovation Society
    • /
    • v.22 no.3
    • /
    • pp.340-361
    • /
    • 2019
  • This study analyzes the technological structure of artificial intelligence (AI) and technological capabilities of AI companies based on patent information. 2589 AI patents registered in USPTO from 2007 to 2017 were collected and analyzed by the Latent Dirichlet Allocation (LDA) to derive 20 AI technology topics. Analysis of technology development trends by AI technology reveals that visual understanding, data analysis, motion control, and machine learning are growing, while language understanding and speech technology are sluggish. In addition, we also investigated leading companies in each sub-field of AI as well as core competencies of global IT companies. The findings of this study are expected to be fruitfully used for formulation and implementation of technology strategy of AI companies.

Study on the distribution law and influencing factors of pressure field distribution before exploitation in heavy oilfield

  • Zhang, Xing;Jiang, Ting T.;Zhang, Jian H.;Li, Bo;Li, Yu B.;Zhang, Chun Y.;Xu, Bing B.;Qi, Peng
    • Geomechanics and Engineering
    • /
    • v.18 no.2
    • /
    • pp.205-213
    • /
    • 2019
  • A calculation model of reservoir pressure field distribution around multiple production wells in a heavy oil reservoir is established, which can overcome the unreasonable uniform-pressure value calculated by the traditional mathematical model in the multiwell mining areas. A calculating program is developed based on the deduced equations by using Visual Basic computer language. Based on the proposed mathematical model, the effects of drainage rate and formation permeability on the distribution of reservoir pressure are studied. Results show that the reservoir pressure drops most at the wellbore. The farther the distance away from the borehole, the sparser the isobaric lines distribute. Increasing drainage rate results in decreasing reservoir pressure and bottom-hole pressure, especially the latter. The permeability has a significant effect on bottom hole pressure. The study provides a reference basis for studying the dynamic pressure field distribution before thermal recovery technology in heavy oilfield and optimizing construction parameters.

Development of Automatic Peach Grading System using NIR Spectroscopy

  • Lee, Kang-J.;Choi, Kyu H.;Choi, Dong S.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1267-1267
    • /
    • 2001
  • The existing fruit sorter has the method of tilting tray and extracting fruits by the action of solenoid or springs. In peaches, the most sort processing is supported by man because the sorter make fatal damage to peaches. In order to sustain commodity and quality of peach non-destructive, non-contact and real time based sorter was needed. This study was performed to develop peach sorter using near-infrared spectroscopy in real time and nondestructively. The prototype was developed to decrease internal and external damage of peach caused by the sorter, which had a way of extracting tray with it. To decrease positioning error of measuring sugar contents in peaches, fiber optic with two direction diverged was developed and attached to the prototype. The program for sorting and operating the prototype was developed using visual basic 6.0 language to measure several quality index such as chlorophyll, some defect, sugar contents. The all sorting result was saved to return farmers for being index of good quality production. Using the prototype, program and MLR(multiple linear regression) model, it was possible to estimate sugar content of peaches with the determination coefficient of 0.71 and SEC of 0.42bx using 16 wavelengths. The developed MLR model had determination coefficient of 0.69, and SEP of 0.49bx, it was better result than single point measurement of 1999's. The peach sweetness grading system based on NIR reflectance method, which consists of photodiode-array sensor, quartz-halogen lamp and fiber optic diverged two bundles for transmitting the light and detecting the reflected light, was developed and evaluated. It was possible to predict the soluble solid contents of peaches in real time and nondestructively using the system which had the accuracy of 91 percentage and the capacity of 7,200 peaches per an hour for grading 2 classes by sugar contents. Draining is one of important factors for production peaches having good qualities. The reason why one farm's product belows others could be estimated for bad draining, over-much nitrogen fertilizer, soil characteristics, etc. After this, the report saved by the peach grading system will have to be good materials to farmers for production high quality peaches. They could share the result or compare with others and diagnose their cultural practice.

  • PDF

"Say Hello to Vietnam!": A Multimodal Analysis of British Travel Blogs

  • Thuy T.H. Tran
    • SUVANNABHUMI
    • /
    • v.15 no.2
    • /
    • pp.91-129
    • /
    • 2023
  • This paper reports the findings of a multimodal study conducted on 10 travel blog posts about Vietnam by seven British professional travel bloggers. The study takes a sociolinguistic view to tourism by seeing travel blogs as a source for linguistic and other semiotic materials while considering language as situated practice for the social construction of fundamental categories such as "human," "society," and "nation." It borrows concepts from Halliday's Systemic Functional Linguistics for interpersonal metafunction to develop an analytical framework to study how the co-occurrence of text and still images in these travel blog posts formulated the portrayal of Vietnam as a tourism destination and indicated the main sociolinguistic features of the blogs. The analysis of appreciation values and interactive qualities encoded in evaluative adjectives and still images show that Vietnam is generally portrayed as a country of identity and diversity. It provides tourists with positive experiences in terms of places of interest, food and local lifestyles and is cost-competitive. Strangerhood and authenticity are two outstanding sociolinguistic features exhibited in these travel blog posts. The findings of this study also underline the co-contribution of the linguistic sign, in this case evaluative adjectives, and the visual sign, in this case still images, as interpersonal meaning-making resources. To portray Vietnam, still images served as integral elements to evidence the credibility of verbal narrations. To unveil sociolinguistic characteristics of travel blogs, still images supported the linguistic realizations of authenticity and strangerhood on the posts, and in some case delivered an even stronger message than words. Not only does the study present a source of feedback from international travelers to tourism practice in Vietnam, but it also provides insights into multimodal analysis of tourism discourse which remains an under-researched area in Vietnam.

Artificial intelligence application UX/UI study for language learning of children with articulation disorder (조음장애 아동의 언어학습을 위한 인공지능 애플리케이션 UX/UI 연구)

  • Yang, Eun-mi;Park, Dea-woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.174-176
    • /
    • 2022
  • In this paper, we present a mobile application for 'personalized customized learning' for children with articulation disorders using an artificial intelligence (AI) algorithm. A dataset (Data Set) to analyze, judge, and predict the learner's articulation situation and degree. In particular, we designed a prototype model by looking at how AI can be improved and advanced compared to existing applications from the UX/UI (GUI) aspect. So far, the focus has been on visual experience, but now it is an important time to process data and provide a UX/UI (GUI) experience to users. The UX/UI (GUI) of the proposed mobile application was to be provided according to the learner's articulation level and situation by using CRNN (Convolution Recurrent Neural Network) of DeepLearning and Auto Encoder GPT-3 (Generative Pretrained Transformer). The use of artificial intelligence algorithms will provide a learning environment with a high degree of perfection to children with articulation disorders, thereby enhancing the learning effect. I hope that you do not have any fear or discomfort in conversation by improving the perfection of articulation with 'personalized and customized learning'.

  • PDF