• Title/Summary/Keyword: Transaction-Based Data

Search Result 543, Processing Time 0.022 seconds

Enzyme Metabolite Analysis Using Data Mining (데이터 마이닝을 활용한 효소 대사물의 분석)

  • Ceong, Hyi-Thaek;Park, Chun-Goo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.11 no.10
    • /
    • pp.969-982
    • /
    • 2016
  • Recently, the researches to discovery drug candidates from natural herbs have received considerable attention. In human body, enzyme mostly metabolize the compounds of natural herbs. In this study, we analysis the enzyme interactions using assoication mining. We get this data from BRENDA(: BRaunschweig ENzyme DAtabase) system. Based on enzyme interaction model, we divide the metabolites into substrate metabolites, product metabolites, inhibitor metabolites, and activating metabolites. We then compose substrate metabolite transaction, product metabolite transaction with each metabolites and enzyme interaction transaction with all metabolites. Also we take account of organism for each transactions. We mine frequent metabolites and patterns from six transactions using association rule mining. And we analysis the relationship among metabolites. As a result, we identify the distributions and patterns of metabolites consist in enzyme interactions. We found that metabolites include in only substrate are identified and have very low supports. This results can be useful to develop the effective metabolism prediction model for compounds of natural herbs.

Market Microstructure Noise and Optimal Sampling Frequencies for the Realized Variances of Stock Prices of Four Leading Korean Companies (한국주요상장사 주가 실현변동성 추정시 시장미시구조 잡음과 최적 추출 빈도수)

  • Oh, Rosy;Shin, Dong-Wan
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.15-27
    • /
    • 2012
  • We have studied the realized variance(RV) of intra-day returns and market microstructure noise based on high-frequency stock transaction data for the four largest companies in terms of market capitalization in the KOSPI. First, non-negligible biases are observed for the RV and for the bias-corrected realized variance($RV_{AC_1}$) which is constructed by adjusting RV for the first order autocorrelation in intra-day returns. Bias is more obvious for the RV and the $RV_{AC_1}$ when intra-day returns are sampled more frequently than every 2 minutes. Transaction Time Sampling(TTS) is shown to be better than Calendar Time Sampling(CTS) in terms of biases of the RV and the $RV_{AC_1}$ for the 4 companies. The analysis reveals that market microstructure noise is temporally dependent. Second, by using the Noise-to-Signal Ratio(NSR), we estimate sampling frequencies that are optimal in terms of the Mean Square Errors(MSE) of the RV and the $RV_{AC_1}$. The optimal sampling frequencies are around 200 for RV and is around 5000 for the $RV_{AC_1}$ for all the four stock prices. For the 6 hour transaction period of the Korean stock trading, these correspond to about 2 minutes and 6 seconds.

A Study on Fraud Detection in the C2C Used Trade Market Using Doc2vec

  • Lim, Do Hyun;Ahn, Hyunchul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.173-182
    • /
    • 2022
  • In this paper, we propose a machine learning model that can prevent fraudulent transactions in advance and interpret them using the XAI approach. For the experiment, we collected a real data set of 12,258 mobile phone sales posts from Joonggonara, a major domestic online C2C resale trading platform. Characteristics of the text corresponding to the post body were extracted using Doc2vec, dimensionality was reduced through PCA, and various derived variables were created based on previous research. To mitigate the data imbalance problem in the preprocessing stage, a complex sampling method that combines oversampling and undersampling was applied. Then, various machine learning models were built to detect fraudulent postings. As a result of the analysis, LightGBM showed the best performance compared to other machine learning models. And as a result of SHAP, if the price is unreasonably low compared to the market price and if there is no indication of the transaction area, there was a high probability that it was a fraudulent post. Also, high price, no safe transaction, the more the courier transaction, and the higher the ratio of 0 in the price also led to fraud.

A Study on the Data-Based Organizational Capabilities by Convergence Capabilities Level of Public Data (공공데이터 융합역량 수준에 따른 데이터 기반 조직 역량의 연구)

  • Jung, Byoungho;Joo, Hyungkun
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.18 no.4
    • /
    • pp.97-110
    • /
    • 2022
  • The purpose of this study is to analyze the level of public data convergence capabilities of administrative organizations and to explore important variables in data-based organizational capabilities. The theoretical background was summarized on public data and use activation, joint use, convergence, administrative organization, and convergence constraints. These contents were explained Public Data Act, the Electronic Government Act, and the Data-Based Administrative Act. The research model was set as the data-based organizational capabilities effect by a data-based administrative capability, public data operation capabilities, and public data operation constraints. It was also set whether there is a capabilities difference data-based on an organizational operation by the level of data convergence capabilities. This study analysis was conducted with hierarchical cluster analysis and multiple regression analysis. As the research result, First, hierarchical cluster analysis was classified into three groups. It was classified into a group that uses only public data and structured data, a group that uses public data on both structured and unstructured data, and a group that uses both public and private data. Second, the critical variables of data-based organizational operation capabilities were found in the data-based administrative planning and administrative technology, the supervisory organizations and technical systems by public data convergence, and the data sharing and market transaction constraints. Finally, the essential independent variables on data-based organizational competencies differ by group. This study contributed. As a theoretical implication, this research is updated on management information systems by explaining the Public Data Act, the Electronic Government Act, and the Data-Based Administrative Act. As a practical implication, the activity reinforcement of public data should be promoting the establishment of data standardization and search convenience and elimination of the lukewarm attitudes and Selfishness behavior for data sharing.

A Multimedia Recommender System Using User Playback Time (사용자의 재생 시간을 이용한 멀티미디어 추천 시스템)

  • Kwon, Hyeong-Joon;Chung, Dong-Keun;Hong, Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.10 no.1
    • /
    • pp.111-121
    • /
    • 2009
  • In this paper, we propose a multimedia recommender system using user's playback time. Proposed system collects multimedia content which is requested by user and its user‘s playback time, as web log data. The system predicts playback time.based preference level and related contents from collected transaction database by fuzzy association rule mining. Proposed method has a merit which sorts recommendation list according to preference without user’s custom preference data, and prevents a false preference. As an experimental result, we confirm that proposed system discovers useful rules and applies them to recommender system from a transaction which doesn‘t include custom preferences.

  • PDF

Does Loss-Leader Pricing Work in Online Shopping Malls?

  • Yeum Dai-Sung;Chae Myungsin;Kim Ji-Young
    • Management Science and Financial Engineering
    • /
    • v.11 no.3
    • /
    • pp.95-107
    • /
    • 2005
  • As online shopping malls have emerged as a substantial shopping channel, they have used various sales promotion strategies to acquire new customers. Most of these strategies have been applied by offline malls for years. One, loss-leader pricing, is a type of promotional pricing in which stores sell well known products below their marginal cost, in order to attract customers and induce them to purchase more goods through impulse buying. This strategy is based on the expectation that customers will factor transaction costs into their purchasing decisions. However, its application to online malls fails to recognize that transaction costs are lower online, and that customers will behave differently as a result. Our study predicts that loss-leader pricing will not work online because online malls entail lower searching and moving costs than offline malls The study examines the effectiveness of loss-leader pricing with empirical data from a survey as well as log data from a Korean online shopping mall. The results show that while loss-leader pricing does attract customers to online shopping malls, it encourages cherry-picking rather than impulse purchases of regular-price goods.

Performance Analysis of a PCI-Bus based RAID System (PCI-버스 기반 RAID 시스템의 버스 성능 분석)

  • 이찬수;성영락;오하령
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.7_8
    • /
    • pp.370-380
    • /
    • 2003
  • A large RAID system may consist of several PCI bus segments since a PCI bus segment can connect only a limited number of disks. In this paver, PCI bus transactions in a RAID system are classified in terms of the initiator and the target of the transaction. Also, the data transfer time of each transaction type is analyzed. By using the analysis results, read and write performance of two RAID system configurations are formulated. From simulation of the RAID system using the DEVS formalism, performance of the configurations are evaluated and compared with the analytical results while changing various system parameters.

Financial Footnote Analysis for Financial Ratio Predictions based on Text-Mining Techniques (재무제표 주석의 텍스트 분석 통한 재무 비율 예측 향상 연구)

  • Choe, Hyoung-Gyu;Lee, Sang-Yong Tom
    • Knowledge Management Research
    • /
    • v.21 no.2
    • /
    • pp.177-196
    • /
    • 2020
  • Since the adoption of K-IFRS(Korean International Financial Reporting Standards), the amount of financial footnotes has been increased. However, due to the stereotypical phrase and the lack of conciseness, deriving the core information from footnotes is not really easy yet. To propose a solution for this problem, this study tried financial footnote analysis for financial ratio predictions based on text-mining techniques. Using the financial statements data from 2013 to 2018, we tried to predict the earning per share (EPS) of the following quarter. We found that measured prediction errors were significantly reduced when text-mined footnotes data were jointly used. We believe this result came from the fact that discretionary financial figures, which were hardly predicted with quantitative financial data, were more correlated with footnotes texts.

The Development and Application of Office Price Index for Benchmark in Seoul using Repeat Sales Model (반복매매모형을 활용한 서울시 오피스 벤치마크 가격지수 개발 및 시험적 적용 연구)

  • Ryu, Kang Min;Song, Ki Wook
    • Land and Housing Review
    • /
    • v.11 no.2
    • /
    • pp.33-46
    • /
    • 2020
  • As the fastest growing office transaction volume in Korea, there's been a need for development of indicators to accurately diagnose the office capital market. The purpose of this paper is experimentally calculate to the office price index for effective benchmark indices in Seoul. The quantitative methodology used a Case-Shiller Repeat Sales Model (1991), based on actual multiple office transaction dataset with over minimum 1,653 ㎡ from Q3 1999 to 4Q 2019 in the case of 1,536 buildings within Seoul Metropolitan. In addition, the collected historical data and spatial statistical analysis tools were treated with the SAS 9.4 and ArcGIS 10.7 programs. The main empirical results of research are briefly summarized as follows; First, Seoul office price index was estimated to be 344.3 point (2001.1Q=100.0P) at the end of 2019, and has more than tripled over the past two decades. it means that the sales price of office per 3.3 ㎡ has consistently risen more than 12% every year since 2000, which is far above the indices for apartment housing index, announced by the MOLIT (2009). Second, between quarterly and annual office price index for the two-step estimation of the MIT Real Estate Research Center (MIT/CRE), T, L, AL variables have statistically significant coefficient (Beta) all of the mode l (p<0.01). Third, it was possible to produce a more stable office price index against the basic index by using the Moore-Penrose's pseoudo inverse technique at low transaction frequency. Fourth, as an lagging indicators, the office price index is closely related to key macroeconomic indicators, such as GDP(+), KOSPI(+), interest rates (5-year KTB, -). This facts indicate that long-term office investment tends to outperform other financial assets owing to high return and low risk pattern. In conclusion, these findings are practically meaningful to presenting an new office price index that increases accuracy and then attempting to preliminary applications for the case of Seoul. Moreover, it can provide sincerely useful benchmark about investing an office and predicting changes of the sales price among market participants (e.g. policy maker, investor, landlord, tenant, user) in the future.

Design of a Secure Payment Mechanism based on S/MIME (S/MIME을 적용한 안전한 지불 메커니즘 설계)

  • Chun, Cheul-Woo;Lee, Jong-Hu;Lee, Sang-Ho
    • Journal of KIISE:Information Networking
    • /
    • v.29 no.5
    • /
    • pp.482-494
    • /
    • 2002
  • In E-mail based accounting system, the remitter does not have need to find collector's account number. To transfer money to a collector's account, what remitter need is just a collector's E-mail address. But the current E-mail based accounting systems are built on SSL technology. Basically SSL provides some security services - confidentiality, user authentication and data integrity, but does not provide non-repudiation. So, in the current E-mail based accounting system, it is possible to deny transaction. And there is no receipt of transaction. In this paper, we design and implementation of a S/MIME applied Secure Payment Mechanism. In our system, every account information - account number, receiver name, amount of money, etc. - is included in a 'check' message. And this message is protected under the Secure Web-mail using S/MIME. In a view point of the convenience, users using our system do not have need to find collector's account number. And in a view point of the security, our system provides confidentiality, user authentication, data integrity and non-repudiation. Moreover our system provides a receipt.