• Title/Summary/Keyword: 구글 음성 검색

Search Result 8, Processing Time 0.02 seconds

Analysis of Mobile Search Functions of Korean Search Portals (검색 포털들의 모바일 검색 기능 분석)

  • Park, So-Yeon
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.1
    • /
    • pp.175-190
    • /
    • 2012
  • This study aims to investigate the current status of mobile search functions of Korean search portals, namely Google Korea, Naver, Nate, Daum, and Yahoo Korea. This study focuses on unique mobile search functionalities, such as voice search, music search, code search, and visual/ object search. In particular, this study analyzed characteristics of these search functions and evaluated their performances based on the accuracy and the speed of recognition. The results of this study show that both Naver and Daum support various mobile searching functions, whereas Google only supports voice search. Nate and Yahoo do not offer any unique function. The results of this study can be applied to the portal's effective development of mobile search functionalities.

음성인식기술을 활용한 VTS 자동 기록 프로그램 개발의 필요성

  • Park, Min-Gyeong;Kim, Myeong-Su;Lee, Sang-Rok;Heo, Yeong-Gwan
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2015.07a
    • /
    • pp.314-315
    • /
    • 2015
  • 최근 음성인식기술이 눈부시게 발전하여 여러 분야에 걸쳐 폭넓게 활용되고 있는 추세에 맞추어, 음성으로 관제의 대부분을 시행하는 VTS에 적용하고자 하였다. 선박 사고 뿐만 아니라, 기타 선박 비리나 정보 공개 요청 등 여러 분야에서 활용할 수 있는 관제내용을 보다 객관적이고 정확하게 기록하고자 VTS 자동 기록 프로그램을 개발하고자 한다.

  • PDF

A Basic Performance Evaluation of the Speech Recognition APP of Standard Language and Dialect using Google, Naver, and Daum KAKAO APIs (구글, 네이버, 다음 카카오 API 활용앱의 표준어 및 방언 음성인식 기초 성능평가)

  • Roh, Hee-Kyung;Lee, Kang-Hee
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.12
    • /
    • pp.819-829
    • /
    • 2017
  • In this paper, we describe the current state of speech recognition technology and identify the basic speech recognition technology and algorithms first, and then explain the code flow of API necessary for speech recognition technology. We use the application programming interface (API) of Google, Naver, and Daum KaKao, which have the most famous search engine among the speech recognition APIs, to create a voice recognition app in the Android studio tool. Then, we perform a speech recognition experiment on people's standard words and dialects according to gender, age, and region, and then organize the recognition rates into a table. Experiments were conducted on the Gyeongsang-do, Chungcheong-do, and Jeolla-do provinces where the degree of tongues was severe. And Comparative experiments were also conducted on standardized dialects. Based on the resultant sentences, the accuracy of the sentence is checked based on spacing of words, final consonant, postposition, and words and the number of each error is represented by a number. As a result, we aim to introduce the advantages of each API according to the speech recognition rate, and to establish a basic framework for the most efficient use.

음성인터페이스 기술 개요 및 스마트폰 환경에서의 서비스 동향

  • Lee, Yun-Geun
    • Information and Communications Magazine
    • /
    • v.29 no.4
    • /
    • pp.3-9
    • /
    • 2012
  • 본고에서는 최근 스마트폰 등에서 사용자에게 편리한 인터페이스 수단으로 활용되고 있는 음성인식 기술에 대하여 알아본다. 음성인식 기술은 컴퓨터가 인간의 말을 알아듣는 기술로서 50년 이상의 연구개발 역사를 가지고 있다. 그간 꾸준한 기술개발과 상용화 시도를 통하여 지속적인 발전을 이루어왔으며 최근 들어 스마트폰 활성화에 따라 관심도가 급속히 높아지고 있는 분야이다. 음성인식 기술은 언어와 관련된 기술이니만큼 기술측면과 시장측면에서의 특수성이 있으며 이를 충분히 고려한 연구개발전략이 수립되어야 한다. 현재, 구글, 애플, 마이크로소프트 등 세계적인 IT 선도기업이 음성인식 기술 개발에 많은 노력을 기울이고 있으며 특히 스마트폰 환경에서의 음성인식 응용 서비스인 음성검색, 자동통역, 인공지능 개인비서 등을 시작하며 본격적인 기술, 시장 선점 경쟁에 돌입하였다. 이들 서비스에 대하여 자세히 알아보고 이에 따른 시사점 및 국내 대응현황에 대해 알아본다.

Multi-purpose smart mirror including CCTV function (CCTV 기능을 포함한 다용도 스마트 미러)

  • Lee, Tea-Nam
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.863-865
    • /
    • 2022
  • 본 프로젝트는 시간, 날씨, 미세먼지 농도, 캘린더, 뉴스 등을 포함한 기본적인 생활정보를 스마트 미러에 디스플레이 해주며 추가적으로 구글 어시스턴트를 활용해 음성인식으로 유튜브 재생, 인터넷 검색 등 다양한 기능을 내재하고 있다. 아울러 인체 감지 센서를 이용해 움직임이 감지되지 않으면 절전모드로 동작하다 움직임이 감지하면 일반 모드로 동작한다. 마지막으로 CCTV 기능을 내재하고 있어 CCTV 화면을 웹 애플리케이션을 통해 실시간 스트리밍 하며 사람 얼굴이 감지될 시 화면을 녹화하는 기능을 포함하고 있다.

Design and Implementation of a Navigation System for Visually Impaired Persons (시각장애인을 위한 네비게이션 시스템 설계 및 구현)

  • Jang, Su-Min;Hwang, Dong-Gyo;Kang, Soo;Kim, Eun-Ju;Park, Jun-Ho;Jang, Ki-Hun;Yoo, Jae-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.1
    • /
    • pp.38-47
    • /
    • 2012
  • In order to extend the activity range of visually impaired persons, we design and implement a navigation system that supports road information services and points of interest. The proposed navigation system consists of route creation modules and storage modules for visually impaired persons. In particular, the main interface of the navigation system are implemented using TTS(Text-to-Speech) program for sound and braille module that outputs braille with sense of touch. We also use google map APIs that can provide latest map information for the navigation system.

Personalized Smart Mirror using Voice Recognition (음성인식을 이용한 개인맞춤형 스마트 미러)

  • Dae-Cheol, Kang;Jong-Seok, Lim;Gil-Ho, Lee;Beom-Hee, Lee;Hyoung-Keun, Park
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.6
    • /
    • pp.1121-1128
    • /
    • 2022
  • Information about the present invention is made available for business use. You are helping to use the LCD, you can't use the LCD screen. During software configuration, Raspbian was used to provide the system environment. We made our way through the menu and made our financial through play. It provides various information such as weather, weather, apps, streamer music, and web browser search function, and it can be charged. Currently, the 'Google Assistant' will be provided through the GUI within a predetermined time.

A study of Artificial Intelligence (AI) Speaker's Development Process in Terms of Social Constructivism: Focused on the Products and Periodic Co-revolution Process (인공지능(AI) 스피커에 대한 사회구성 차원의 발달과정 연구: 제품과 시기별 공진화 과정을 중심으로)

  • Cha, Hyeon-ju;Kweon, Sang-hee
    • Journal of Internet Computing and Services
    • /
    • v.22 no.1
    • /
    • pp.109-135
    • /
    • 2021
  • his study classified the development process of artificial intelligence (AI) speakers through analysis of the news text of artificial intelligence (AI) speakers shown in traditional news reports, and identified the characteristics of each product by period. The theoretical background used in the analysis are news frames and topic frames. As analysis methods, topic modeling and semantic network analysis using the LDA method were used. The research method was a content analysis method. From 2014 to 2019, 2710 news related to AI speakers were first collected, and secondly, topic frames were analyzed using Nodexl algorithm. The result of this study is that, first, the trend of topic frames by AI speaker provider type was different according to the characteristics of the four operators (communication service provider, online platform, OS provider, and IT device manufacturer). Specifically, online platform operators (Google, Naver, Amazon, Kakao) appeared as a frame that uses AI speakers as'search or input devices'. On the other hand, telecommunications operators (SKT, KT) showed prominent frames for IPTV, which is the parent company's flagship business, and 'auxiliary device' of the telecommunication business. Furthermore, the frame of "personalization of products and voice service" was remarkable for OS operators (MS, Apple), and the frame for IT device manufacturers (Samsung) was "Internet of Things (IoT) Integrated Intelligence System". The econd, result id that the trend of the topic frame by AI speaker development period (by year) showed a tendency to develop around AI technology in the first phase (2014-2016), and in the second phase (2017-2018), the social relationship between AI technology and users It was related to interaction, and in the third phase (2019), there was a trend of shifting from AI technology-centered to user-centered. As a result of QAP analysis, it was found that news frames by business operator and development period in AI speaker development are socially constituted by determinants of media discourse. The implication of this study was that the evolution of AI speakers was found by the characteristics of the parent company and the process of co-evolution due to interactions between users by business operator and development period. The implications of this study are that the results of this study are important indicators for predicting the future prospects of AI speakers and presenting directions accordingly.