Collaborative filtering (CF) is a system that interprets the relationship between a user and a product and recommends the product to a specific user. The CF model is advantageous in that it can recommend products to users with only rating data without any additional information such as contents. However, there are many cases where a user does not give a rating even after consuming the product as well as consuming only a small portion of the total product. This means that the number of ratings observed is very small and the user rating matrix is very sparse. The sparsity of this rating data poses a problem in raising CF performance. In this paper, we concentrate on raising the performance of latent factor model (especially SVD). We propose a new model that includes product similarity information and co occurrence information in SVD. The similarity and concurrence information obtained from the rating data increased the expressiveness of the latent space in terms of latent factors. Thus, Recall increased by 16% and Precision and NDCG increased by 8% and 7%, respectively. The proposed method of the paper will show better performance than the existing method when combined with other recommender systems in the future.
Jung, Moon Young;Kim, In Kee;Sung, Won Mo;Kang, Jung Keuk
Economic and Environmental Geology
/
v.28
no.3
/
pp.199-211
/
1995
The deep sea camera system could render it possible to obtain the detailed information of the nodule distribution, but difficult to estimate nodule abundance quantitatively. In order to estimate nodule abundance quantitatively from deep seabed photographs, the nodule abundance equation was derived from the box core data obtained in KODOS area(long.: $154^{\circ}{\sim}151^{\circ}W$, lat.: $9^{\circ}{\sim}12^{\circ}N$) during two survey cruises carried out in 1989 and 1990. The regression equation derived by considering extent of burial of nodule to Handa's equation compensates for the abundance error attributable to partial burial of some nodules by sediments. An average long axis and average extent of burial of nodules in photographed area are determined according to the surface textures of nodules, and nodule coverage is calculated by the image analysis method. Average nodule abundance estimated from seabed photographs by using the equation is approximately 92% of the actual average abundance in KODOS area. The measured sampling points by box core or free fall grab are in general very sparse and hence nodule abundance distribution should be interpolated and extrapolated from measured data to uncharacterized areas. The another goal of this study is to depict continuous distribution of nodule abundance in KODOS area by using PC-version of geostatistical model in which several stages are systematically proceeded. Geostatistics was used to analyse spatial structure and distribution of regionalized variable(nodule abundance) within sets of real data. In order to investigate the spatial structure of nodule abundance in KODOS area, experimental variograms were calculated and fitted to a spherical models in isotropy and anisotropy, respectively. The spherical structure models were used to map out distribution of the nodule abundance for isotropic and anisotropic models by using the kriging method. The result from anisotropic model is much more reliable than one of isotropic model. Distribution map of nodule abundance produced by PC-version of geostatistical model indicates that approximately 40% of KODOS area is considered to be promising area(nodule abundance > $5kg/m^2$) for mining in case of anisotropy.
Purpose : Recently, the Recon Challenge at the 2009 ISMRM workshop on Data Sampling and Image Reconstruction at Sedona, Arizona was held to evaluate feasibility of highly accelerated acquisition of time resolved contrast enhanced MR angiography. This paper provides the step-by-step description of the winning results of k-t FOCUSS in this competition. Materials and Methods : In previous works, we proved that k-t FOCUSS algorithm successfully solves the compressed sensing problem even for less sparse cardiac cine applications. Therefore, using k-t FOCUSS, very accurate time resolved contrast enhanced MR angiography can be reconstructed. Accelerated radial trajectory data were synthetized from X-ray cerebral angiography images and provided by the organizing committee, and radiologists double blindly evaluated each reconstruction result with respect to the ground-truth data. Results : The reconstructed results at various acceleration factors demonstrate that each components of compressed sensing, such as sparsifying transform and incoherent sampling patterns, etc can have profound effects on the final reconstruction results. Conclusion : From reconstructed results, we see that the compressed sensing dynamic MR imaging algorithm, k-t FOCUSS enables high resolution time resolved contrast enhanced MR angiography.
Kim, Hyun-Joong;Kim, Woo-Hwan;Lee, Sang-Cheol;Im, Jong-Ho;Cho, Sang-Hee;Kim, Ah-Hyoun
Communications for Statistical Applications and Methods
/
v.15
no.5
/
pp.697-708
/
2008
Operational risk is defined as the risk of loss resulting from inadequate or failed internal processes, people and systems, or external events. The advanced measurement approach proposed by Basel committee uses loss distribution approach(LDA) which quantifies operational loss based on bank's own historical data and measurement system. LDA involves two distribution fittings(frequency and severity) and then generates aggregate loss distribution by employing mathematical convolution. An objective validation for the operational risk measurement is essential because the operational risk measurement allows flexibility and subjective judgement to calculate regulatory capital. However, the methodology to verify the soundness of the operational risk measurement was not fully developed because the internal operational loss data had been extremely sparse and the modeling of extreme tail was very difficult. In this paper, we propose a methodology for the validation of operational risk measurement based on bootstrap confidence intervals of operational VaR(value at risk). We derived two methods to generate confidence intervals of operational VaR.
Background : In spite of the worldwide relevance of obsessive-compulsive disorder Ed-highlight : Unclear. Perhaps consider changing word choice. (OCD), there are considerable differences in prevalence, sex ratio, comorbidity patterns, and sociodemographic correlates. Data on subclinical OCD have been sparse to date. Methods : Data stemmed from the Korea Epidemiologic Catchment Area (KECA) study which had been carried out from April to December 2001. Korean versions of DSM-IV adapted Composite International Diagnostic Interview were administered to a representative sample of 6275 persons aged 18-64 living in the community. DSM-IV based criteria for subclinical OCD were applied. Results : The lifetime prevalence rates for OCD and subclinical OCD were 0.8% and 6.6%, respectively. In both OCD and subclinical OCD, the rates for males and females were not statistically different. OCD was demonstrated to be associated with depressive disorder, bipolar disorder, social phobia, generalized anxiety disorder, and alcohol and nicotine dependence. Additionally, subclinical OCD was associated with posttraumatic stress and somatoform disorders. Comorbidity rates in subclinical OCD were lower than those in OCD. Conclusions : The lifetime prevalence rate for OCD was less than 1% in the Korean general population. Age distribution and comorbidity patterns suggest that subclinical OCD represents a broad and heterogeneous syndrome and not simply a milder form of OCD.
Korean Journal of Agricultural and Forest Meteorology
/
v.13
no.1
/
pp.35-40
/
2011
While high-definition precipitation maps with a 270 m spatial resolution are available for South Korea, there is little information on geospatial availability of precipitation water for the famine - plagued North Korea. The restricted data access and sparse observations prohibit application of the widely used PRISM (Parameter-elevation Regressions on Independent Slopes Model) to North Korea for fine-resolution mapping of precipitation. A hybrid method which complements the PRISM grid with a sub-grid scale elevation function is suggested to estimate precipitation for remote areas with little data such as North Korea. The fine scale elevation - precipitation regressions for four sloping aspects were derived from 546 observation points in South Korea. A 'virtual' elevation surface at a 270 m grid spacing was generated by inverse distance weighed averaging of the station elevations of 78 KMA (Korea Meteorological Administration) synoptic stations. A 'real' elevation surface made up from both 78 synoptic and 468 automated weather stations (AWS) was also generated and subtracted from the virtual surface to get elevation difference at each point. The same procedure was done for monthly precipitation to get the precipitation difference at each point. A regression analysis was applied to derive the aspect - specific coefficient of precipitation change with a unit increase in elevation. The elevation difference between 'virtual' and 'real' surface was calculated for each 270m grid points across North Korea and the regression coefficients were applied to obtain the precipitation corrections for the PRISM grid. The correction terms are now added to the PRISM generated low resolution (~2.4 km) precipitation map to produce the 270 m high resolution map compatible with those available for South Korea. According to the final product, the spatial average precipitation for entire territory of North Korea is 1,196 mm for a climatological normal year (1971-2000) with standard deviation of 298 mm.
Yoon, Dong Jin;Lee, Ju Hong;Choi, Bum Ghi;Song, Jae Won
Smart Media Journal
/
v.10
no.3
/
pp.39-47
/
2021
Enhanced index tracking is a problem of optimizing the objective function to generate returns above the index based on the index tracking that follows the market return. In order to avoid problems such as large transaction costs and illiquidity, we used a method of constructing a portfolio by selecting only some of the stocks included in the index. Commonly used enhanced index tracking methods tried to find the optimal portfolio with only one objective function in all tested periods, but it is almost impossible to find the ultimate strategy that always works well in the volatile financial market. In addition, it is important to improve generalization performance beyond optimizing the objective function for training data due to the nature of the financial market, where statistical characteristics change significantly over time, but existing methods have a limitation in that there is no direct discussion for this. In order to solve these problems, this paper proposes ensemble learning that composes a portfolio by combining several objective functions and a 3-stage portfolio selection algorithm that can select a portfolio by applying criteria other than the objective function to the training data. The proposed method in an experiment using the S&P500 index shows Sharpe ratio that is 27% higher than the index and the existing methods, showing that the 3-stage portfolio selection algorithm and ensemble learning are effective in selecting an enhanced index portfolio.
Background: Pseudorabies, also known as Aujeszky's disease, is caused by the pseudorabies virus (PRV) and has been recognized as a critical disease affecting the pig industry and a wide range of animals around the world, resulting in great economic losses each year. Shandong province, one of the most vital food animal-breeding regions in China, has a very dense pig population, within which pseudorabies infections were detected in recent years. The data, however, on PRV epidemiology and coinfection rates of PRV with other major swine diseases is sparse. Objectives: This study aimed to investigate the PRV epidemiology in Shandong and analyze the current control measures. Methods: In this study, a total number of 16,457 serum samples and 1,638 tissue samples, which were collected from 362 intensive pig farms (≥ 300 sows/farm) covered all cities in Shandong, were tested by performing enzyme-linked immunosorbent assay (ELISA) and polymerase chain reaction (PCR). Results: Overall, 52.7% and 91.5% of the serum samples were positive for PRV-gE and -gB, respectively, based on ELISA results. In addition, 15.7% of the tissue samples were PCR positive for PRV. The coinfection rates of PRV with porcine circovirus type 2 (PCV2), porcine reproductive and respiratory syndrome virus, and classical swine fever virus were measured; coinfection with PCV2 was 35.0%, higher than those of the other two viruses. Macroscopic and microscopic lesions were observed in various tissues during histopathological examination. Conclusions: The results demonstrate the PRV prevalence and its coinfection rates in Shandong province and indicate that pseudorabies is endemic in pig farms in this region. This study provides epidemiological data that can be useful in the prevention and control of pseudorabies in Shandong, China.
Recently, following the development of LIDAR technology which can detect distance from the object, the interest for LIDAR based 3D object detection network is getting higher. Previous networks generate inaccurate localization results due to spatial information loss during voxelization and downsampling. In this study, we propose an attention-based convergence method and a camera-LIDAR convergence system to acquire high-level features and high positional accuracy. First, by introducing the attention method into the Voxel-RCNN structure, which is a grid-based 3D object detection network, the multi-scale sparse 3D convolution feature is effectively fused to improve the performance of 3D object detection. Additionally, we propose the late-fusion mechanism for fusing outcomes in 3D object detection network and 2D object detection network to delete false positive. Comparative experiments with existing algorithms are performed using the KITTI data set, which is widely used in the field of autonomous driving. The proposed method showed performance improvement in both 2D object detection on BEV and 3D object detection. In particular, the precision was improved by about 0.54% for the car moderate class compared to Voxel-RCNN.
This study proposes a novel recommender system using the structural hole analysis to reflect qualitative and emotional information in recommendation process. Although collaborative filtering (CF) is known as the most popular recommendation algorithm, it has some limitations including scalability and sparsity problems. The scalability problem arises when the volume of users and items become quite large. It means that CF cannot scale up due to large computation time for finding neighbors from the user-item matrix as the number of users and items increases in real-world e-commerce sites. Sparsity is a common problem of most recommender systems due to the fact that users generally evaluate only a small portion of the whole items. In addition, the cold-start problem is the special case of the sparsity problem when users or items newly added to the system with no ratings at all. When the user's preference evaluation data is sparse, two users or items are unlikely to have common ratings, and finally, CF will predict ratings using a very limited number of similar users. Moreover, it may produces biased recommendations because similarity weights may be estimated using only a small portion of rating data. In this study, we suggest a novel limitation of the conventional CF. The limitation is that CF does not consider qualitative and emotional information about users in the recommendation process because it only utilizes user's preference scores of the user-item matrix. To address this novel limitation, this study proposes cluster-indexing CF model with the structural hole analysis for recommendations. In general, the structural hole means a location which connects two separate actors without any redundant connections in the network. The actor who occupies the structural hole can easily access to non-redundant, various and fresh information. Therefore, the actor who occupies the structural hole may be a important person in the focal network and he or she may be the representative person in the focal subgroup in the network. Thus, his or her characteristics may represent the general characteristics of the users in the focal subgroup. In this sense, we can distinguish friends and strangers of the focal user utilizing the structural hole analysis. This study uses the structural hole analysis to select structural holes in subgroups as an initial seeds for a cluster analysis. First, we gather data about users' preference ratings for items and their social network information. For gathering research data, we develop a data collection system. Then, we perform structural hole analysis and find structural holes of social network. Next, we use these structural holes as cluster centroids for the clustering algorithm. Finally, this study makes recommendations using CF within user's cluster, and compare the recommendation performances of comparative models. For implementing experiments of the proposed model, we composite the experimental results from two experiments. The first experiment is the structural hole analysis. For the first one, this study employs a software package for the analysis of social network data - UCINET version 6. The second one is for performing modified clustering, and CF using the result of the cluster analysis. We develop an experimental system using VBA (Visual Basic for Application) of Microsoft Excel 2007 for the second one. This study designs to analyzing clustering based on a novel similarity measure - Pearson correlation between user preference rating vectors for the modified clustering experiment. In addition, this study uses 'all-but-one' approach for the CF experiment. In order to validate the effectiveness of our proposed model, we apply three comparative types of CF models to the same dataset. The experimental results show that the proposed model outperforms the other comparative models. In especial, the proposed model significantly performs better than two comparative modes with the cluster analysis from the statistical significance test. However, the difference between the proposed model and the naive model does not have statistical significance.
본 웹사이트에 게시된 이메일 주소가 전자우편 수집 프로그램이나
그 밖의 기술적 장치를 이용하여 무단으로 수집되는 것을 거부하며,
이를 위반시 정보통신망법에 의해 형사 처벌됨을 유념하시기 바랍니다.
[게시일 2004년 10월 1일]
이용약관
제 1 장 총칙
제 1 조 (목적)
이 이용약관은 KoreaScience 홈페이지(이하 “당 사이트”)에서 제공하는 인터넷 서비스(이하 '서비스')의 가입조건 및 이용에 관한 제반 사항과 기타 필요한 사항을 구체적으로 규정함을 목적으로 합니다.
제 2 조 (용어의 정의)
① "이용자"라 함은 당 사이트에 접속하여 이 약관에 따라 당 사이트가 제공하는 서비스를 받는 회원 및 비회원을
말합니다.
② "회원"이라 함은 서비스를 이용하기 위하여 당 사이트에 개인정보를 제공하여 아이디(ID)와 비밀번호를 부여
받은 자를 말합니다.
③ "회원 아이디(ID)"라 함은 회원의 식별 및 서비스 이용을 위하여 자신이 선정한 문자 및 숫자의 조합을
말합니다.
④ "비밀번호(패스워드)"라 함은 회원이 자신의 비밀보호를 위하여 선정한 문자 및 숫자의 조합을 말합니다.
제 3 조 (이용약관의 효력 및 변경)
① 이 약관은 당 사이트에 게시하거나 기타의 방법으로 회원에게 공지함으로써 효력이 발생합니다.
② 당 사이트는 이 약관을 개정할 경우에 적용일자 및 개정사유를 명시하여 현행 약관과 함께 당 사이트의
초기화면에 그 적용일자 7일 이전부터 적용일자 전일까지 공지합니다. 다만, 회원에게 불리하게 약관내용을
변경하는 경우에는 최소한 30일 이상의 사전 유예기간을 두고 공지합니다. 이 경우 당 사이트는 개정 전
내용과 개정 후 내용을 명확하게 비교하여 이용자가 알기 쉽도록 표시합니다.
제 4 조(약관 외 준칙)
① 이 약관은 당 사이트가 제공하는 서비스에 관한 이용안내와 함께 적용됩니다.
② 이 약관에 명시되지 아니한 사항은 관계법령의 규정이 적용됩니다.
제 2 장 이용계약의 체결
제 5 조 (이용계약의 성립 등)
① 이용계약은 이용고객이 당 사이트가 정한 약관에 「동의합니다」를 선택하고, 당 사이트가 정한
온라인신청양식을 작성하여 서비스 이용을 신청한 후, 당 사이트가 이를 승낙함으로써 성립합니다.
② 제1항의 승낙은 당 사이트가 제공하는 과학기술정보검색, 맞춤정보, 서지정보 등 다른 서비스의 이용승낙을
포함합니다.
제 6 조 (회원가입)
서비스를 이용하고자 하는 고객은 당 사이트에서 정한 회원가입양식에 개인정보를 기재하여 가입을 하여야 합니다.
제 7 조 (개인정보의 보호 및 사용)
당 사이트는 관계법령이 정하는 바에 따라 회원 등록정보를 포함한 회원의 개인정보를 보호하기 위해 노력합니다. 회원 개인정보의 보호 및 사용에 대해서는 관련법령 및 당 사이트의 개인정보 보호정책이 적용됩니다.
제 8 조 (이용 신청의 승낙과 제한)
① 당 사이트는 제6조의 규정에 의한 이용신청고객에 대하여 서비스 이용을 승낙합니다.
② 당 사이트는 아래사항에 해당하는 경우에 대해서 승낙하지 아니 합니다.
- 이용계약 신청서의 내용을 허위로 기재한 경우
- 기타 규정한 제반사항을 위반하며 신청하는 경우
제 9 조 (회원 ID 부여 및 변경 등)
① 당 사이트는 이용고객에 대하여 약관에 정하는 바에 따라 자신이 선정한 회원 ID를 부여합니다.
② 회원 ID는 원칙적으로 변경이 불가하며 부득이한 사유로 인하여 변경 하고자 하는 경우에는 해당 ID를
해지하고 재가입해야 합니다.
③ 기타 회원 개인정보 관리 및 변경 등에 관한 사항은 서비스별 안내에 정하는 바에 의합니다.
제 3 장 계약 당사자의 의무
제 10 조 (KISTI의 의무)
① 당 사이트는 이용고객이 희망한 서비스 제공 개시일에 특별한 사정이 없는 한 서비스를 이용할 수 있도록
하여야 합니다.
② 당 사이트는 개인정보 보호를 위해 보안시스템을 구축하며 개인정보 보호정책을 공시하고 준수합니다.
③ 당 사이트는 회원으로부터 제기되는 의견이나 불만이 정당하다고 객관적으로 인정될 경우에는 적절한 절차를
거쳐 즉시 처리하여야 합니다. 다만, 즉시 처리가 곤란한 경우는 회원에게 그 사유와 처리일정을 통보하여야
합니다.
제 11 조 (회원의 의무)
① 이용자는 회원가입 신청 또는 회원정보 변경 시 실명으로 모든 사항을 사실에 근거하여 작성하여야 하며,
허위 또는 타인의 정보를 등록할 경우 일체의 권리를 주장할 수 없습니다.
② 당 사이트가 관계법령 및 개인정보 보호정책에 의거하여 그 책임을 지는 경우를 제외하고 회원에게 부여된
ID의 비밀번호 관리소홀, 부정사용에 의하여 발생하는 모든 결과에 대한 책임은 회원에게 있습니다.
③ 회원은 당 사이트 및 제 3자의 지적 재산권을 침해해서는 안 됩니다.
제 4 장 서비스의 이용
제 12 조 (서비스 이용 시간)
① 서비스 이용은 당 사이트의 업무상 또는 기술상 특별한 지장이 없는 한 연중무휴, 1일 24시간 운영을
원칙으로 합니다. 단, 당 사이트는 시스템 정기점검, 증설 및 교체를 위해 당 사이트가 정한 날이나 시간에
서비스를 일시 중단할 수 있으며, 예정되어 있는 작업으로 인한 서비스 일시중단은 당 사이트 홈페이지를
통해 사전에 공지합니다.
② 당 사이트는 서비스를 특정범위로 분할하여 각 범위별로 이용가능시간을 별도로 지정할 수 있습니다. 다만
이 경우 그 내용을 공지합니다.
제 13 조 (홈페이지 저작권)
① NDSL에서 제공하는 모든 저작물의 저작권은 원저작자에게 있으며, KISTI는 복제/배포/전송권을 확보하고
있습니다.
② NDSL에서 제공하는 콘텐츠를 상업적 및 기타 영리목적으로 복제/배포/전송할 경우 사전에 KISTI의 허락을
받아야 합니다.
③ NDSL에서 제공하는 콘텐츠를 보도, 비평, 교육, 연구 등을 위하여 정당한 범위 안에서 공정한 관행에
합치되게 인용할 수 있습니다.
④ NDSL에서 제공하는 콘텐츠를 무단 복제, 전송, 배포 기타 저작권법에 위반되는 방법으로 이용할 경우
저작권법 제136조에 따라 5년 이하의 징역 또는 5천만 원 이하의 벌금에 처해질 수 있습니다.
제 14 조 (유료서비스)
① 당 사이트 및 협력기관이 정한 유료서비스(원문복사 등)는 별도로 정해진 바에 따르며, 변경사항은 시행 전에
당 사이트 홈페이지를 통하여 회원에게 공지합니다.
② 유료서비스를 이용하려는 회원은 정해진 요금체계에 따라 요금을 납부해야 합니다.
제 5 장 계약 해지 및 이용 제한
제 15 조 (계약 해지)
회원이 이용계약을 해지하고자 하는 때에는 [가입해지] 메뉴를 이용해 직접 해지해야 합니다.
제 16 조 (서비스 이용제한)
① 당 사이트는 회원이 서비스 이용내용에 있어서 본 약관 제 11조 내용을 위반하거나, 다음 각 호에 해당하는
경우 서비스 이용을 제한할 수 있습니다.
- 2년 이상 서비스를 이용한 적이 없는 경우
- 기타 정상적인 서비스 운영에 방해가 될 경우
② 상기 이용제한 규정에 따라 서비스를 이용하는 회원에게 서비스 이용에 대하여 별도 공지 없이 서비스 이용의
일시정지, 이용계약 해지 할 수 있습니다.
제 17 조 (전자우편주소 수집 금지)
회원은 전자우편주소 추출기 등을 이용하여 전자우편주소를 수집 또는 제3자에게 제공할 수 없습니다.
제 6 장 손해배상 및 기타사항
제 18 조 (손해배상)
당 사이트는 무료로 제공되는 서비스와 관련하여 회원에게 어떠한 손해가 발생하더라도 당 사이트가 고의 또는 과실로 인한 손해발생을 제외하고는 이에 대하여 책임을 부담하지 아니합니다.
제 19 조 (관할 법원)
서비스 이용으로 발생한 분쟁에 대해 소송이 제기되는 경우 민사 소송법상의 관할 법원에 제기합니다.
[부 칙]
1. (시행일) 이 약관은 2016년 9월 5일부터 적용되며, 종전 약관은 본 약관으로 대체되며, 개정된 약관의 적용일 이전 가입자도 개정된 약관의 적용을 받습니다.