• Title/Summary/Keyword: 벡터

Search Result 7,453, Processing Time 0.033 seconds

Automatic scoring of mathematics descriptive assessment using random forest algorithm (랜덤 포레스트 알고리즘을 활용한 수학 서술형 자동 채점)

  • Inyong Choi;Hwa Kyung Kim;In Woo Chung;Min Ho Song
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.165-186
    • /
    • 2024
  • Despite the growing attention on artificial intelligence-based automated scoring technology as a support method for the introduction of descriptive items in school environments and large-scale assessments, there is a noticeable lack of foundational research in mathematics compared to other subjects. This study developed an automated scoring model for two descriptive items in first-year middle school mathematics using the Random Forest algorithm, evaluated its performance, and explored ways to enhance this performance. The accuracy of the final models for the two items was found to be between 0.95 to 1.00 and 0.73 to 0.89, respectively, which is relatively high compared to automated scoring models in other subjects. We discovered that the strategic selection of the number of evaluation categories, taking into account the amount of data, is crucial for the effective development and performance of automated scoring models. Additionally, text preprocessing by mathematics education experts proved effective in improving both the performance and interpretability of the automated scoring model. Selecting a vectorization method that matches the characteristics of the items and data was identified as one way to enhance model performance. Furthermore, we confirmed that oversampling is a useful method to supplement performance in situations where practical limitations hinder balanced data collection. To enhance educational utility, further research is needed on how to utilize feature importance derived from the Random Forest-based automated scoring model to generate useful information for teaching and learning, such as feedback. This study is significant as foundational research in the field of mathematics descriptive automatic scoring, and there is a need for various subsequent studies through close collaboration between AI experts and math education experts.

A Study on the Extraction of Psychological Distance Embedded in Company's SNS Messages Using Machine Learning (머신 러닝을 활용한 회사 SNS 메시지에 내포된 심리적 거리 추출 연구)

  • Seongwon Lee;Jin Hyuk Kim
    • Information Systems Review
    • /
    • v.21 no.1
    • /
    • pp.23-38
    • /
    • 2019
  • The social network service (SNS) is one of the important marketing channels, so many companies actively exploit SNSs by posting SNS messages with appropriate content and style for their customers. In this paper, we focused on the psychological distances embedded in the SNS messages and developed a method to measure the psychological distance in SNS message by mixing a traditional content analysis, natural language processing (NLP), and machine learning. Through a traditional content analysis by human coding, the psychological distance was extracted from the SNS message, and these coding results were used for input data for NLP and machine learning. With NLP, word embedding was executed and Bag of Word was created. The Support Vector Machine, one of machine learning techniques was performed to train and test the psychological distance in SNS message. As a result, sensitivity and precision of SVM prediction were significantly low because of the extreme skewness of dataset. We improved the performance of SVM by balancing the ratio of data by upsampling technique and using data coded with the same value in first content analysis. All performance index was more than 70%, which showed that psychological distance can be measured well.

High-efficiency development of herbicide-resistant transgenic lilies via an Agrobacterium-mediated transformation system (고효율의 아그로박테리움 형질전환법을 이용한 제초제저항성 나리 식물체 개발)

  • Jong Bo Kim
    • Journal of Plant Biotechnology
    • /
    • v.50
    • /
    • pp.56-62
    • /
    • 2023
  • Transgenic lilies have been obtained using Agrobacterium tumefaciens (AGL1) with the plant scale explants, followed by DL-phosphinothricin (PPT) selection. In this study, scales of lily plants cv. "red flame" were transformed with the pCAMBIA3301 vector containing the gus gene as a reporter and the blpR gene as a selectable marker, as well as a gene of interest showing herbicide tolerance, both driven by the CaMV 35S promoter. Using a 20-minute infection time and a 5-day cultivation period, factors that optimized and demonstrated a high transformation efficiency were achieved. With these conditions, approximately 22-27% efficiency was observed for Agrobacterium-mediated transformation in lilies. After transformation with Agrobacterium, scales of lilies were transferred to MS medium without selective agents for 2 weeks. They were then placed on selection MS medium containing 5 mg/L PPT for a month of further selection and then cultured for another 4-8 weeks with a 4-week subculture regime on the same selection medium. PPT-resistant scales with shoots were successfully rooted and regenerated into plantlets after transferring into hormone-free MS medium. Also, most survived putatively transformed plantlets indicated the presence of the blpR gene by PCR analysis and showed a blue color indicating expression of the gus gene. In conclusion, when 100 scales of lily cv. "red flame" are transformed with Agrobacterium, approximately 22-27 transgenic plantlets can be produced following an optimized protocol. Therefore, this protocol can contribute to the lily breeding program in the future.

Identifying Personal Values Influencing the Lifestyle of Older Adults: Insights From Relative Importance Analysis Using Machine Learning (중고령 노인의 개인적 가치에 따른 라이프스타일 분류: 머신러닝을 활용한 상대적 중요도 분석 )

  • Lim, Seungju;Park, Ji-Hyuk
    • Therapeutic Science for Rehabilitation
    • /
    • v.13 no.2
    • /
    • pp.69-84
    • /
    • 2024
  • Objective : This study aimed to categorize the lifestyles of older adults into two types - healthy and unhealthy, and use machine learning to identify the personal values that influence these lifestyles. Methods : This cross-sectional study targeting middle-aged and older adults (55 years and above) living in local communities in South Korea. Data were collected from 300 participants through online surveys. Lifestyle types were dichotomized by the Yonsei Lifestyle Profile (YLP)-Active, Balanced, Connected, and Diverse (ABCD) responses using latent profile analysis. Personal value information was collected using YLP-Values (YLP-V) and analyzed using machine learning to identify the relative importance of personal values on lifestyle types. Results : The lifestyle of older adults was categorized into healthy (48.87%) and unhealthy (51.13%). These two types showed the most significant difference in social relationship characteristics. Among the machine learning models used in this study, the support vector machine showed the highest classification performance, achieving 96% accuracy and 95% area under the receiver operating characteristic (ROC) curve. The model indicated that individuals who prioritized a healthy diet, sought health information, and engaged in hobbies or cultural activities were more likely to have a healthy lifestyle. Conclusion : This study suggests the need to encourage the expansion of social networks among older adults. Furthermore, it highlights the necessity to comprehensively intervene in individuals' perceptions and values that primarily influence lifestyle adherence.

Expressions of Magnetic Field and Magnetic Gradient Tensor due to an Elliptical Disk (타원판에 의한 자력 및 자력 변화율 텐서 반응식)

  • Hyoungrea Rim
    • Geophysics and Geophysical Exploration
    • /
    • v.27 no.2
    • /
    • pp.108-118
    • /
    • 2024
  • In this study, expressions for the magnetic field and magnetic gradient tensor due to an elliptical disk were derived. Igneous intrusions and kimberlite structures often have elliptical cylinders with axial symmetry and elliptical cross sections. An elliptical cylinder with varying cross-sectional areas was approximated using stacks of elliptical disks. The magnetic fields of elliptical disks were derived using the Poisson relation, which includes the direction of magnetization in the gravity gradient tensor, as described in a previous study (Rim, 2024). The magnetic gradient tensor due to an elliptical disk is derived by differentiating the magnetic fields, which is equivalent to obtaining ten triple-derivative functions acquired by differentiating the gravitational potential of the elliptical disk three times in each axis direction. Because it is possible to exchange the order of differentiation, the magnetic gradient tensor is derived by differentiating the gravitational potential of the elliptical disk three times, which is then converted into a complex line integral along the closed boundary curve of the elliptical disk in the complex plane. The expressions for the magnetic field and magnetic gradient tensor derived from a complex line integral in complex plane are perfectly consistent with those of the circular disk derived from the Lipschitz-Hankel integral.

Federated learning-based client training acceleration method for personalized digital twins (개인화 디지털 트윈을 위한 연합학습 기반 클라이언트 훈련 가속 방식)

  • YoungHwan Jeong;Won-gi Choi;Hyoseon Kye;JeeHyeong Kim;Min-hwan Song;Sang-shin Lee
    • Journal of Internet Computing and Services
    • /
    • v.25 no.4
    • /
    • pp.23-37
    • /
    • 2024
  • Digital twin is an M&S (Modeling and Simulation) technology designed to solve or optimize problems in the real world by replicating physical objects in the real world as virtual objects in the digital world and predicting phenomena that may occur in the future through simulation. Digital twins have been elaborately designed and utilized based on data collected to achieve specific purposes in large-scale environments such as cities and industrial facilities. In order to apply this digital twin technology to real life and expand it into user-customized service technology, practical but sensitive issues such as personal information protection and personalization of simulations must be resolved. To solve this problem, this paper proposes a federated learning-based accelerated client training method (FACTS) for personalized digital twins. The basic approach is to use a cluster-driven federated learning training procedure to protect personal information while simultaneously selecting a training model similar to the user and training it adaptively. As a result of experiments under various statistically heterogeneous conditions, FACTS was found to be superior to the existing FL method in terms of training speed and resource efficiency.

Analysis of effect of global uncertainty on domestic uncertainty using connectedness index (연계성 지수를 이용한 대외 경제 불확실성이 국내 경제 불확실성에 미치는 영향 분석)

  • Sanguk Kwon;Sun Ho Hwang
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.4
    • /
    • pp.509-523
    • /
    • 2024
  • This study estimates connectedness index among the US, China, Europe, Japan, and South Korea using monthly economic policy uncertainty (EPU) data from January 2000 to December 2023. The connectedness index allows us to analyze the effect of global economic uncertainty on domestic economic uncertainty. The EPU is used as a proxy for economic uncertainty. Inter-country connectedness index is computed from variance decomposition. The findings from forecast error variance decomposition show that three-fourths of total uncertainty comes from economic uncertainty in the own country and one-fourth of total uncertainty comes from economic uncertainty in the others. The analysis on net pairwise connectedness reveals that, even though the extent of the effect of economic uncertainty in one country from economic uncertainty in another country varies over time, economic uncertainty in South Korea, a small-open economy, is mainly affected by economic uncertainty in the others. The reverse situation rarely happens except in the specific occurrence such as the collapse of the credit bubble in 2003 and the subsequent years, the inter-Korean summit and North Korea-the US summit in 2018, and the period from the first outbreak of COVID-19 on the implementation of the government's severe regulation against COVID-19.

Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

  • Thay, Setha;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.1-20
    • /
    • 2013
  • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.

Index-based Searching on Timestamped Event Sequences (타임스탬프를 갖는 이벤트 시퀀스의 인덱스 기반 검색)

  • 박상현;원정임;윤지희;김상욱
    • Journal of KIISE:Databases
    • /
    • v.31 no.5
    • /
    • pp.468-478
    • /
    • 2004
  • It is essential in various application areas of data mining and bioinformatics to effectively retrieve the occurrences of interesting patterns from sequence databases. For example, let's consider a network event management system that records the types and timestamp values of events occurred in a specific network component(ex. router). The typical query to find out the temporal casual relationships among the network events is as fellows: 'Find all occurrences of CiscoDCDLinkUp that are fellowed by MLMStatusUP that are subsequently followed by TCPConnectionClose, under the constraint that the interval between the first two events is not larger than 20 seconds, and the interval between the first and third events is not larger than 40 secondsTCPConnectionClose. This paper proposes an indexing method that enables to efficiently answer such a query. Unlike the previous methods that rely on inefficient sequential scan methods or data structures not easily supported by DBMSs, the proposed method uses a multi-dimensional spatial index, which is proven to be efficient both in storage and search, to find the answers quickly without false dismissals. Given a sliding window W, the input to a multi-dimensional spatial index is a n-dimensional vector whose i-th element is the interval between the first event of W and the first occurrence of the event type Ei in W. Here, n is the number of event types that can be occurred in the system of interest. The problem of‘dimensionality curse’may happen when n is large. Therefore, we use the dimension selection or event type grouping to avoid this problem. The experimental results reveal that our proposed technique can be a few orders of magnitude faster than the sequential scan and ISO-Depth index methods.hods.

Study of Dynamic Variation Aspect in Lung Volume due to Respiration in Stereotactic Body Radiotherapy Using Abdominal Compressor (복부압박장치를 이용한 정위적방사선치료 시 호흡에 따른 폐암 용적의 동적변이 양상에 대한 연구)

  • Park, Kwang Soon;Kim, Joo Ho;Park, Hyo Kook;Beak, Jong Geal;Lee, Sang Kyoo;Yoon, Jong Won;Cho, Jeong Hee
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.25 no.2
    • /
    • pp.159-165
    • /
    • 2013
  • Purpose: Abdominal compressor is used to control breathing in stereotactic body radiotherapy for lung tumors frequently. We evaluated the dynamic variation aspect of internal tumor volume by breathing. Materials and Methods: We reviewed 20 lung cancer patients (7 upper lung patients, 4 middle lung patients, 9 lower lung patients) who received stereotactic body radiotherapy using abdominal compressor between April 2012 to April 2013. Coordinate shift values were obtained by using four-dimensional cone-beam CT (4D-CBCT) to investigate treatment set-up error and moving tumor position error. To investigate how much difference of each part, we compared 95% confidence interval, maximum values and minimum values of three-dimensional vector value and analyzed conformity degree through the Pearson square correlation coefficient. Results: 95% confidence interval of three-dimensional vector value of each part is 1.8~2.9 mm in upper lobe, 2.3~5.4 mm in middle lobe and 2.2~4.0 mm in lower lobe. Conformity degree was the result that respectively is LR direction 0.75, SI direction 0.68 and AP direction 0.63 in upper lobe, LR direction 0.82, SI direction 0.51 and AP direction 0.92 in middle lobe and LR direction 0.63, SI direction 0.50 and AP direction 0.34 in lower lobe. Conclusion: We showed difference by each site in lung tumor due to respiration by using abdominal compressor. Therefore, we must correct treatment set-up error as well as moving tumor position error by breathing. It is also considered to be useful that it is the use of 4D-CBCT when correcting the error due to various dynamic variation.

  • PDF