• Title/Summary/Keyword: Feature Maps

Search Result 284, Processing Time 0.026 seconds

Neural network with occlusion-resistant and reduced parameters in stereo images (스테레오 영상에서 폐색에 강인하고 축소된 파라미터를 갖는 신경망)

  • Kwang-Yeob Lee;Young-Min Jeon;Jun-Mo Jeong
    • Journal of IKEEE
    • /
    • v.28 no.1
    • /
    • pp.65-71
    • /
    • 2024
  • This paper proposes a neural network that can reduce the number of parameters while reducing matching errors in occluded regions to increase the accuracy of depth maps in stereo matching. Stereo matching-based object recognition is utilized in many fields to more accurately recognize situations using images. When there are many objects in a complex image, an occluded area is generated due to overlap between objects and occlusion by background, thereby lowering the accuracy of the depth map. To solve this problem, existing research methods that create context information and combine it with the cost volume or RoIselect in the occluded area increase the complexity of neural networks, making it difficult to learn and expensive to implement. In this paper, we create a depthwise seperable neural network that enhances regional feature extraction before cost volume generation, reducing the number of parameters and proposing a neural network that is robust to occlusion errors. Compared to PSMNet, the proposed neural network reduced the number of parameters by 30%, improving 5.3% in color error and 3.6% in test loss.

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.

A Deep Learning Based Approach to Recognizing Accompanying Status of Smartphone Users Using Multimodal Data (스마트폰 다종 데이터를 활용한 딥러닝 기반의 사용자 동행 상태 인식)

  • Kim, Kilho;Choi, Sangwoo;Chae, Moon-jung;Park, Heewoong;Lee, Jaehong;Park, Jonghun
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.163-177
    • /
    • 2019
  • As smartphones are getting widely used, human activity recognition (HAR) tasks for recognizing personal activities of smartphone users with multimodal data have been actively studied recently. The research area is expanding from the recognition of the simple body movement of an individual user to the recognition of low-level behavior and high-level behavior. However, HAR tasks for recognizing interaction behavior with other people, such as whether the user is accompanying or communicating with someone else, have gotten less attention so far. And previous research for recognizing interaction behavior has usually depended on audio, Bluetooth, and Wi-Fi sensors, which are vulnerable to privacy issues and require much time to collect enough data. Whereas physical sensors including accelerometer, magnetic field and gyroscope sensors are less vulnerable to privacy issues and can collect a large amount of data within a short time. In this paper, a method for detecting accompanying status based on deep learning model by only using multimodal physical sensor data, such as an accelerometer, magnetic field and gyroscope, was proposed. The accompanying status was defined as a redefinition of a part of the user interaction behavior, including whether the user is accompanying with an acquaintance at a close distance and the user is actively communicating with the acquaintance. A framework based on convolutional neural networks (CNN) and long short-term memory (LSTM) recurrent networks for classifying accompanying and conversation was proposed. First, a data preprocessing method which consists of time synchronization of multimodal data from different physical sensors, data normalization and sequence data generation was introduced. We applied the nearest interpolation to synchronize the time of collected data from different sensors. Normalization was performed for each x, y, z axis value of the sensor data, and the sequence data was generated according to the sliding window method. Then, the sequence data became the input for CNN, where feature maps representing local dependencies of the original sequence are extracted. The CNN consisted of 3 convolutional layers and did not have a pooling layer to maintain the temporal information of the sequence data. Next, LSTM recurrent networks received the feature maps, learned long-term dependencies from them and extracted features. The LSTM recurrent networks consisted of two layers, each with 128 cells. Finally, the extracted features were used for classification by softmax classifier. The loss function of the model was cross entropy function and the weights of the model were randomly initialized on a normal distribution with an average of 0 and a standard deviation of 0.1. The model was trained using adaptive moment estimation (ADAM) optimization algorithm and the mini batch size was set to 128. We applied dropout to input values of the LSTM recurrent networks to prevent overfitting. The initial learning rate was set to 0.001, and it decreased exponentially by 0.99 at the end of each epoch training. An Android smartphone application was developed and released to collect data. We collected smartphone data for a total of 18 subjects. Using the data, the model classified accompanying and conversation by 98.74% and 98.83% accuracy each. Both the F1 score and accuracy of the model were higher than the F1 score and accuracy of the majority vote classifier, support vector machine, and deep recurrent neural network. In the future research, we will focus on more rigorous multimodal sensor data synchronization methods that minimize the time stamp differences. In addition, we will further study transfer learning method that enables transfer of trained models tailored to the training data to the evaluation data that follows a different distribution. It is expected that a model capable of exhibiting robust recognition performance against changes in data that is not considered in the model learning stage will be obtained.

Comparison of Forest Carbon Stocks Estimation Methods Using Forest Type Map and Landsat TM Satellite Imagery (임상도와 Landsat TM 위성영상을 이용한 산림탄소저장량 추정 방법 비교 연구)

  • Kim, Kyoung-Min;Lee, Jung-Bin;Jung, Jaehoon
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.5
    • /
    • pp.449-459
    • /
    • 2015
  • The conventional National Forest Inventory(NFI)-based forest carbon stock estimation method is suitable for national-scale estimation, but is not for regional-scale estimation due to the lack of NFI plots. In this study, for the purpose of regional-scale carbon stock estimation, we created grid-based forest carbon stock maps using spatial ancillary data and two types of up-scaling methods. Chungnam province was chosen to represent the study area and for which the $5^{th}$ NFI (2006~2009) data was collected. The first method (method 1) selects forest type map as ancillary data and uses regression model for forest carbon stock estimation, whereas the second method (method 2) uses satellite imagery and k-Nearest Neighbor(k-NN) algorithm. Additionally, in order to consider uncertainty effects, the final AGB carbon stock maps were generated by performing 200 iterative processes with Monte Carlo simulation. As a result, compared to the NFI-based estimation(21,136,911 tonC), the total carbon stock was over-estimated by method 1(22,948,151 tonC), but was under-estimated by method 2(19,750,315 tonC). In the paired T-test with 186 independent data, the average carbon stock estimation by the NFI-based method was statistically different from method2(p<0.01), but was not different from method1(p>0.01). In particular, by means of Monte Carlo simulation, it was found that the smoothing effect of k-NN algorithm and mis-registration error between NFI plots and satellite image can lead to large uncertainty in carbon stock estimation. Although method 1 was found suitable for carbon stock estimation of forest stands that feature heterogeneous trees in Korea, satellite-based method is still in demand to provide periodic estimates of un-investigated, large forest area. In these respects, future work will focus on spatial and temporal extent of study area and robust carbon stock estimation with various satellite images and estimation methods.

Comparison of Flood Inundation Models using Topographic Feature (지형요소를 이용한 홍수범람해석 모형의 비교)

  • Moon, Changgeon;Lee, Jungsik;Cho, Sunggeun;Shin, Shachul
    • Journal of the Korean GEO-environmental Society
    • /
    • v.15 no.1
    • /
    • pp.69-77
    • /
    • 2014
  • The objective of this study is to compare flood inundation models for small stream basin. HEC-RAS model was used for the analysis of one dimensional hydraulics and HEC-GeoRAS, Ras Mapper and RiverCAD models were applied for the flood inundation analysis in Gum Chung stream. Flood inundations are to simulate by flood inundation models using observed data and rainfall on each frequency and to compare with inundation area based on the flood plain maps. The results of this study are as follows; Area of flood inundations by HEC-GeoRAS model is similar to that of flood plain map and appears in order of RAS Mapper and RiverCAD model. Flood inundation area by RiverCAD model is to estimate lager than that of RAS Mapper and HEC-GeoRAS model in flood area on each frequency and the results show that they have a little difference in models of flood inundation analysis at small stream. Comparing the area of flood inundations by flood depth, the results of three models are relatively similar in flood depth as 2.0 m below, and RiverCAD model shows a significant difference in flood depth as 2.0 m or more.

Estuary Classification Based on the Characteristics of Geomorphological Features, Natural Habitat Distributions and Land Uses (하구의 지형적.자연서식지.이용개발특성에 따른 유형 분류)

  • Lee, Kang-Hyun;Rho, Baik-Ho;Choi, Hyun-Jeong;Lee, Chang-Hee
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.16 no.2
    • /
    • pp.53-69
    • /
    • 2011
  • Classification of estuaries based on their multi-component and multidisciplinary processes is important for the systematic management of estuaries. In this study, an integrated GlS-based analysis system including high resolution aerial photographies and topographic maps was used to classify 463 estuaries based on estuarine circulation pattern, geomorphological feature, natural habitat distribution and characteristics of land use. These estuaries were divided into two basic types, open and closed. Two hundred and thirty five systems were open estuaries comprising of forty one mountainous type (OM), eighty sevcn sandpit type (OS) and one hundred seven funnel type (OF). Each type of open estuary was further classified into three types based on habitat distribution and land use characteristics resulting in total ninc types of open estuaries. Two hundred and twenty eight estuaries were closed systems comprising of one hundred and forty four blocked type (CB directly) and eighty four lake type (CL, indirectly). CB type estuary was further classified into three types based on habitat distribution and land use characteristics. This estuarine classification scheme can be applied to provide a framework for effective management strategies of individual estuaries to estimate the management priority and strategy.

Generation of Multi-view Images Using Depth Map Decomposition and Edge Smoothing (깊이맵의 정보 분해와 경계 평탄 필터링을 이용한 다시점 영상 생성 방법)

  • Kim, Sung-Yeol;Lee, Sang-Beom;Kim, Yoo-Kyung;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.471-482
    • /
    • 2006
  • In this paper, we propose a new scheme to generate multi-view images utilizing depth map decomposition and adaptive edge smoothing. After carrying out smooth filtering based on an adaptive window size to regions of edges in the depth map, we decompose the smoothed depth map into four types of images: regular mesh, object boundary, feature point, and number-of-layer images. Then, we generate 3-D scenes from the decomposed images using a 3-D mesh triangulation technique. Finally, we extract multi-view images from the reconstructed 3-D scenes by changing the position of a virtual camera in the 3-D space. Experimental results show that our scheme generates multi-view images successfully by minimizing a rubber-sheet problem using edge smoothing, and renders consecutive 3-D scenes in real time through information decomposition of depth maps. In addition, the proposed scheme can be used for 3-D applications that need the depth information, such as depth keying, since we can preserve the depth data unlike the previous unsymmetric filtering method.

A Study on the Block Planning Characteristics of the Tribute Granary Castle at Asan Cape Gongse in the Joseon Dynasty (조선시대 아산 공세곶창성의 배치 특성에 관한 연구)

  • Lee, Wang-Kee;Lee, Jeong-Soo;Lim, Cho-Long
    • Journal of architectural history
    • /
    • v.16 no.3
    • /
    • pp.75-94
    • /
    • 2007
  • There were many researches on marine transportation and granaries, most of which focused on the historical establishment and organization of the marine transportation. However, a few researches were conducted on the architectural aspects. Hence the purposes of this study are to investigate the following matters: first, documents and relics concerning the tribute granary castle at Cape Gongse in Asan, a typical granary during the Joseon Dynasty, were investigated to academically understand the castle's establishment and historical background; second, the dispositional characteristics of the granary and the castle, including its adjacent facilities, were investigated to review its archaeological value; finally, basic materials were provided for systematical preservation and management these relics. As for the research method, the author referred to and analyzed sundry records and old maps, and ascertained in detail historical evidence through residential testimonies and the on-the-spot surveys. In addition, the author investigated the dispositional characteristics of the tribute granary castle at Cape Gongse by analyzing its exact size and shape, based on the old documents and an actual survey of the castle remains. The characteristics of the tribute granary castle at Cape Gongse may be summarized as follows. First, tribute granary at cape Gongse is a only tribute granary which has a granary and castle. second, the tribute granary castle at Cape Gongse has a curvilinear shape, like a gourd dipper; a large circle surrounding the village and a small circle surrounding the area of Mt. Shinpoong both meet up with it. Third, the construction type of the tribute granary castle at Cape Gongse is in a style similar to a town castle or a battle camp castle located in the coastal regions. As for its locational conditions, however, the east gate, presumably an incoming and outgoing route to the granary for vessels, was a feature unique to the marine granary castle. Fourth, the tribute granary at Cape Gongse had a granary of eighty kan in 1523 and, in addition, there were also Bongsang-cheong, Sa-chang, Joseon-sobakcheo, Chimhae-dang, and more, not to mention many privates houses in the castle. The granary is located in the center of the tribute granary castle, where Gongse Nonghyub is currently located. The location of the government offices seemed to be on the northern ridge. Fifth, the tribute granary castle at Cape Gongse is a valuable relic that offers insight into marine transportation, tribute granaries, and tribute granary castles during the Joseon Dynasty. It has special archaeological value because it was one of only a few tribute granary castles that served to protect the tribute granaries.

  • PDF

Proton Magnetic Resonance Chemical Shift Imaging(1H-CSI)-directed Stereotactic Brain Biopsy (양성자 화학적 이동영상기법(1H-CSI)을 이용한 정위적 뇌생검)

  • Chang, Kyung-Sool;Son, Byung-Chul;Kim, Moon-Chan;Choi, Byung-Gil;Kim, Euy-Neying;Kim, Bum-Soo;Choe, Bo-Young;Baik, Hyun-Man;Hong, Yong-Kil;Kang, Joon-Ki
    • Journal of Korean Neurosurgical Society
    • /
    • v.29 no.12
    • /
    • pp.1606-1611
    • /
    • 2000
  • Objective : To obtain more reliable sample in stereotactic biopsy, authors adopted proton chemical shift imaging ($^1H$-CSI)-directed biopsy. Until now, proton single voxel spectroscopy($^1H$-SVS) technique has been reported as a technique using metabolic information in stereotactic biopsy. The authors performed $^1H$-CSI with a stereotactic headframe in place and evaluated the pathologic results obtained from local metabolic information through $^1H$-CSI. Methods : $^1H$ CSI-directed stereotactic biopsy was performed in four patients. $^1H$-CSI and conventional Gd-enhancement stereotactic MRI was done simultaneously after application of the stereotatic frame. After reconstruction of metabolic maps of NAA/Cr, Cho/Cr, and Lactate/Cr ratios, the focal areas of increased Cho/Cr ratios and decreased NAA/Cr ratios were selected for target sites in the MR images Results : There was no difficulty in performing $^1H$-CSI with the stereotactic headframe in place. In pathologic examinations, the samples taken in area of increased Cho/Cr ratios and decreased NAA/Cr ratios showed the features of increased cellularity, mitoses and cellular atypism, thus facilitated the diagnosis. The pathologic samples taken from the area of increased Lactate/Cr ratios showed prominent feature of necrosis. Conclusion : $^1H$-CSI was feasible with stereotactic head frame in place. The final pathologic results obtained in our samples were concordant with the local metabolic informations from $^1H$-CSI. Authors believe that $^1H$ CSI-directed stereotactic biopsy may provide us advantages in obtaining more reliable tissue specimen in stereotactic biopsy.

  • PDF

Mine Haulage System Design for Reopening of Yangyang Iron Mine using 3D Modelling (3차원 모델링을 이용한 재개광 양양철광의 운반시스템 설계)

  • Son, Youngjin;Kim, Jaedong
    • Tunnel and Underground Space
    • /
    • v.22 no.6
    • /
    • pp.412-428
    • /
    • 2012
  • To achieve mine development, a large amount of data concerned with the geological structure and the ore body had to be investigated and collected through geological survey, drilling and geophysical explorations. In most previous cases, however, the data were usually analyzed two dimensionally and those results showed some limits because of their 2D presentation. Those 2D maps such as geological plane sections or longitudinal sections cause lots of difficulties in understanding the complex geological structure or the feature of ore body in a spatial way. In this study, research area was set on the abandoned Yangyang iron mine in Korea and the Sugaeng ore body within the mine was selected as the research target to design a mine haulage system for reopening. A 3D mine model of this area was tried to be constructed using a 3D modelling software, GEMS. An accurate 3D model including the ore body, the geological structure, the old underground mine drifts and the new mine drifts was constructed under the purpose of reopening of the abandoned iron mine. Especially, mine design for trackless haulage system was conducted. New inclines and drifts were planned and modelled 3 dimensionally considering the utilization of old drifts and shaft. In addition to the 3D modelling, geostatistical technique was adopted to generate a spatial distribution of the ore grade and the rock physical properties. 3D model would be able to contribute in solving problems such as evaluating ore reserves, planning the mine development and additional explorations and changing the development plans, etc.