통합 검색 | Korea Science

Two-Stream Convolutional Neural Network for Video Action Recognition

Qiao, Han;Liu, Shuang;Xu, Qingzhen;Liu, Shouqiang;Yang, Wanggan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제15권10호
- /
- pp.3668-3684
- /
- 2021
Video action recognition is widely used in video surveillance, behavior detection, human-computer interaction, medically assisted diagnosis and motion analysis. However, video action recognition can be disturbed by many factors, such as background, illumination and so on. Two-stream convolutional neural network uses the video spatial and temporal models to train separately, and performs fusion at the output end. The multi segment Two-Stream convolutional neural network model trains temporal and spatial information from the video to extract their feature and fuse them, then determine the category of video action. Google Xception model and the transfer learning is adopted in this paper, and the Xception model which trained on ImageNet is used as the initial weight. It greatly overcomes the problem of model underfitting caused by insufficient video behavior dataset, and it can effectively reduce the influence of various factors in the video. This way also greatly improves the accuracy and reduces the training time. What's more, to make up for the shortage of dataset, the kinetics400 dataset was used for pre-training, which greatly improved the accuracy of the model. In this applied research, through continuous efforts, the expected goal is basically achieved, and according to the study and research, the design of the original dual-flow model is improved.
https://doi.org/10.3837/tiis.2021.10.011 인용 PDF KSCI HTML

Human-AI 협력 프로세스 기반의 증거기반 국가혁신 모니터링 연구: 해양수산부 사례 (A Study on Human-AI Collaboration Process to Support Evidence-Based National Innovation Monitoring: Case Study on Ministry of Oceans and Fisheries)

임정선;배성훈;류길호;김상국
- 산업경영시스템학회지
- /
- 제46권2호
- /
- pp.22-31
- /
- 2023
Governments around the world are enacting laws mandating explainable traceability when using AI(Artificial Intelligence) to solve real-world problems. HAI(Human-Centric Artificial Intelligence) is an approach that induces human decision-making through Human-AI collaboration. This research presents a case study that implements the Human-AI collaboration to achieve explainable traceability in governmental data analysis. The Human-AI collaboration explored in this study performs AI inferences for generating labels, followed by AI interpretation to make results more explainable and traceable. The study utilized an example dataset from the Ministry of Oceans and Fisheries to reproduce the Human-AI collaboration process used in actual policy-making, in which the Ministry of Science and ICT utilized R&D PIE(R&D Platform for Investment and Evaluation) to build a government investment portfolio.
https://doi.org/10.11627/jksie.2023.46.2.022 인용 PDF

Human Action Recognition via Depth Maps Body Parts of Action

Farooq, Adnan;Farooq, Faisal;Le, Anh Vu
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제12권5호
- /
- pp.2327-2347
- /
- 2018
Human actions can be recognized from depth sequences. In the proposed algorithm, we initially construct depth, motion maps (DMM) by projecting each depth frame onto three orthogonal Cartesian planes and add the motion energy for each view. The body part of the action (BPoA) is calculated by using bounding box with an optimal window size based on maximum spatial and temporal changes for each DMM. Furthermore, feature vector is constructed by using BPoA for each human action view. In this paper, we employed an ensemble based learning approach called Rotation Forest to recognize different actions Experimental results show that proposed method has significantly outperforms the state-of-the-art methods on Microsoft Research (MSR) Action 3D and MSR DailyActivity3D dataset.
https://doi.org/10.3837/tiis.2018.05.023 인용 PDF KSCI

HSFE Network and Fusion Model based Dynamic Hand Gesture Recognition

Tai, Do Nhu;Na, In Seop;Kim, Soo Hyung
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권9호
- /
- pp.3924-3940
- /
- 2020
Dynamic hand gesture recognition(d-HGR) plays an important role in human-computer interaction(HCI) system. With the growth of hand-pose estimation as well as 3D depth sensors, depth, and the hand-skeleton dataset is proposed to bring much research in depth and 3D hand skeleton approaches. However, it is still a challenging problem due to the low resolution, higher complexity, and self-occlusion. In this paper, we propose a hand-shape feature extraction(HSFE) network to produce robust hand-shapes. We build a hand-shape model, and hand-skeleton based on LSTM to exploit the temporal information from hand-shape and motion changes. Fusion between two models brings the best accuracy in dynamic hand gesture (DHG) dataset.
https://doi.org/10.3837/tiis.2020.09.020 인용 PDF KSCI HTML

Motion classification using distributional features of 3D skeleton data

Woohyun Kim;Daeun Kim;Kyoung Shin Park;Sungim Lee
- Communications for Statistical Applications and Methods
- /
- 제30권6호
- /
- pp.551-560
- /
- 2023
Recently, there has been significant research into the recognition of human activities using three-dimensional sequential skeleton data captured by the Kinect depth sensor. Many of these studies employ deep learning models. This study introduces a novel feature selection method for this data and analyzes it using machine learning models. Due to the high-dimensional nature of the original Kinect data, effective feature extraction methods are required to address the classification challenge. In this research, we propose using the first four moments as predictors to represent the distribution of joint sequences and evaluate their effectiveness using two datasets: The exergame dataset, consisting of three activities, and the MSR daily activity dataset, composed of ten activities. The results show that the accuracy of our approach outperforms existing methods on average across different classifiers.
https://doi.org/10.29220/CSAM.2023.30.6.551 인용 PDF

Impact of Human Mobility on Social Networks

Wang, Dashun;Song, Chaoming
- Journal of Communications and Networks
- /
- 제17권2호
- /
- pp.100-109
- /
- 2015
Mobile phone carriers face challenges from three synergistic dimensions: Wireless, social, and mobile. Despite significant advances that have been made about social networks and human mobility, respectively, our knowledge about the interplay between two layers remains largely limited, partly due to the difficulty in obtaining large-scale datasets that could offer at the same time social and mobile information across a substantial population over an extended period of time. In this paper, we take advantage of a massive, longitudinal mobile phone dataset that consists of human mobility and social network information simultaneously, allowing us to explore the impact of human mobility patterns on the underlying social network. We find that human mobility plays an important role in shaping both local and global structural properties of social network. In contrast to the lack of scale in social networks and human movements, we discovered a characteristic distance in physical space between 10 and 20 km that impacts both local clustering and modular structure in social network. We also find a surprising distinction in trajectory overlap that segments social ties into two categories. Our results are of fundamental relevance to quantitative studies of human behavior, and could serve as the basis of anchoring potential theoretical models of human behavior and building and developing new applications using social and mobile technologies.
https://doi.org/10.1109/JCN.2015.000023 인용 KSCI

Weather Recognition Based on 3C-CNN

Tan, Ling;Xuan, Dawei;Xia, Jingming;Wang, Chao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권8호
- /
- pp.3567-3582
- /
- 2020
Human activities are often affected by weather conditions. Automatic weather recognition is meaningful to traffic alerting, driving assistance, and intelligent traffic. With the boost of deep learning and AI, deep convolutional neural networks (CNN) are utilized to identify weather situations. In this paper, a three-channel convolutional neural network (3C-CNN) model is proposed on the basis of ResNet50.The model extracts global weather features from the whole image through the ResNet50 branch, and extracts the sky and ground features from the top and bottom regions by two CNN5 branches. Then the global features and the local features are merged by the Concat function. Finally, the weather image is classified by Softmax classifier and the identification result is output. In addition, a medium-scale dataset containing 6,185 outdoor weather images named WeatherDataset-6 is established. 3C-CNN is used to train and test both on the Two-class Weather Images and WeatherDataset-6. The experimental results show that 3C-CNN achieves best on both datasets, with the average recognition accuracy up to 94.35% and 95.81% respectively, which is superior to other classic convolutional neural networks such as AlexNet, VGG16, and ResNet50. It is prospected that our method can also work well for images taken at night with further improvement.
https://doi.org/10.3837/tiis.2020.08.024 인용 PDF KSCI HTML

Human Activities Recognition Based on Skeleton Information via Sparse Representation

Liu, Suolan;Kong, Lizhi;Wang, Hongyuan
- Journal of Computing Science and Engineering
- /
- 제12권1호
- /
- pp.1-11
- /
- 2018
Human activities recognition is a challenging task due to its complexity of human movements and the variety performed by different subjects for the same action. This paper presents a recognition algorithm by using skeleton information generated from depth maps. Concatenating motion features and temporal constraint feature produces feature vector. Reducing dictionary scale proposes an improved fast classifier based on sparse representation. The developed method is shown to be effective by recognizing different activities on the UTD-MHAD dataset. Comparison results indicate superior performance of our method over some existing methods.
https://doi.org/10.5626/JCSE.2018.12.1.1 인용

실생활 음향 데이터 기반 이중 CNN 구조를 특징으로 하는 음향 이벤트 인식 알고리즘 (Dual CNN Structured Sound Event Detection Algorithm Based on Real Life Acoustic Dataset)

서상원;임우택;정영호;이태진;김휘용
- 방송공학회논문지
- /
- 제23권6호
- /
- pp.855-865
- /
- 2018
음향 이벤트 인식은 다수의 음향 이벤트가 발생하는 환경에서 이를 인식하고 각각의 발생과 소멸 시점을 판단하는 기술로써 인간의 청각적 인지 특성을 모델화하는 연구다. 음향 장면 및 이벤트 인식 연구 그룹인 DCASE는 연구자들의 참여 유도와 더불어 음향 인식 연구의 활성화를 위해 챌린지를 진행하고 있다. 그러나 DCASE 챌린지에서 제공하는 데이터 세트는 이미지 인식 분야의 대표적인 데이터 세트인 이미지넷에 비해 상대적으로 작은 규모이며, 이 외에 공개된 음향 데이터 세트는 많지 않아 알고리즘 개발에 어려움이 있다. 본 연구에서는 음향 이벤트 인식 기술 개발을 위해 실내외에서 발생할 수 있는 이벤트를 정의하고 수집을 진행하였으며, 보다 큰 규모의 데이터 세트를 확보하였다. 또한, 인식 성능 개선을 위해 음향 이벤트 존재 여부를 판단하는 보조 신경망을 추가한 이중 CNN 구조의 알고리즘을 개발하였고, 2016년과 2017년의 DCASE 챌린지 기준 시스템과 성능 비교 실험을 진행하였다.
https://doi.org/10.5909/JBE.2018.23.6.855 인용 PDF KSCI KPUBS HTML

딥러닝을 이용한 언어별 단어 분류 기법 (Language-based Classification of Words using Deep Learning)

듀크;다후다;조인휘
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2021년도 춘계학술발표대회
- /
- pp.411-414
- /
- 2021
One of the elements of technology that has become extremely critical within the field of education today is Deep learning. It has been especially used in the area of natural language processing, with some word-representation vectors playing a critical role. However, some of the low-resource languages, such as Swahili, which is spoken in East and Central Africa, do not fall into this category. Natural Language Processing is a field of artificial intelligence where systems and computational algorithms are built that can automatically understand, analyze, manipulate, and potentially generate human language. After coming to discover that some African languages fail to have a proper representation within language processing, even going so far as to describe them as lower resource languages because of inadequate data for NLP, we decided to study the Swahili language. As it stands currently, language modeling using neural networks requires adequate data to guarantee quality word representation, which is important for natural language processing (NLP) tasks. Most African languages have no data for such processing. The main aim of this project is to recognize and focus on the classification of words in English, Swahili, and Korean with a particular emphasis on the low-resource Swahili language. Finally, we are going to create our own dataset and reprocess the data using Python Script, formulate the syllabic alphabet, and finally develop an English, Swahili, and Korean word analogy dataset.
https://doi.org/10.3745/PKIPS.y2021m05a.411 인용 PDF

검색결과 161건 처리시간 0.022초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)