The Imputation Many Missing Value in Time Series Data Use Multivariate Relationships

พยุง มีสัจ; กรสิริณัฐ โรจนวรรณ์

doi:10.14456/jist.2018.11

PDF

เผยแพร่แล้ว: มิ.ย. 23, 2018

DOI: https://doi.org/10.14456/jist.2018.11

คำสำคัญ:

การเติมค่าสูญหาย ข้อมูลอนุกรมเวลา ข้อมูลหลายตัวแปร โครงข่ายประสาทเทียม การทําเหมืองข้อมูล เครื่องจักรเรียนรู้

พยุง มีสัจ

ภาควิชาการจัดการเทคโนโลยีสารสนเทศ คณะเทคโนโลยีสารสนเทศ มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ กรุงเทพฯ

กรสิริณัฐ โรจนวรรณ์

ภาควิชาเทคโนโลยีสารสนเทศ คณะเทคโนโลยีสารสนเทศมหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ กรุงเทพฯ

บทคัดย่อ

- ข้อมูลอนุกรมเวลามีความสำคัญในงานต่างๆ มากมายหลายประเภท ซึ่งมีประโยชน์ในการพยากรณ์แนวโน้มเพื่อประกอบการตัดสินในธุรกิจ ปัญหาของการเก็บข้อมูลที่ไม่ครบถ้วน ข้อมูลที่สูญหายจำนวนมาก จึงไม่สามารถนำไปใช้ในการวิเคราะห์แนวโน้วได้อย่างมีประสิทธิภาพ บทความวิจัยนี้นำเสนอวิธีการเติมค่าสูญหายในข้อมูลอนุกรมเวลาจำนวนมากใช้ความสัมพันธ์หลายตัวแปร โดยใช้ข้อมูลที่มีอยู่สร้างตัวแบบการเติมค่าสูญหาย การวิจัยเน้นที่การค้นหารูปแบบ ข้อมูลที่เหมาะสมสำหรับใช้สอนตัวแบบการเติมค่าสูญหาย ด้วยการเปรียบเทียบเทคนิคการแทนค่า จำนวน 4 เทคนิค ได้แก่ ค่าเฉลี่ยแถว (Row Average) เพื่อนบ้านใกล้เคียง (K-Nearest Neighbor: KNN) ระบบคลุมเครือ (Fuzzy Logic Systems) โครงข่ายประสาทเทียม (Artificial Neural Network) ผลวิจัยพบว่า โครงข่ายประสาทเทียมให้ผลการทำนาย ในชุดทดสอบได้ดีที่สุด และเมื่อทำไปใช้ในการแทนที่ค่าสูญหายให้ผลลัพธ์คล้ายค่าจริง ซึ่งการใช้ข้อมูลที่มีอยู่บางส่วน จากหลายตัวแปรสามารถนำไปใช้สำหรับการสร้างตัวแบบแทนค่าสูญหายได้อย่างมีประสิทธิภาพ ทั้งนี้ข้อมูลตัวแปรหลาย ตัวแปรจะมีผล โดยตรงต่อการจัดรูปแบบข้อมูลและตัวแบบ หรือไม่นั้นควรทำการวิเคราะห์ทำความเข้าใจในข้อมูลอย่างดี ก่อนการนำไปสร้างตัวแบบการแทนค่าสูญหาย

รูปแบบการอ้างอิง

[1]

มีสัจ พ. และ โรจนวรรณ์ ก., “การแทนค่าสูญหายจำนวนมากในข้อมูลอนุกรมเวลา ใช้ความสัมพันธ์หลายตัวแปร”, JIST, ปี 8, ฉบับที่ 1, น. 16–25, มิ.ย. 2018.

ฉบับ

ปีที่ 8 ฉบับที่ 1 (2018): Journal of Information Science and Technology (JIST) [Jan. 2018 - Jun. 2018]

ประเภทบทความ

บทความวิจัย Soft Computing:

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

I/we certify that I/we have participated sufficiently in the intellectual content, conception and design of this work or the analysis and interpretation of the data (when applicable), as well as the writing of the manuscript, to take public responsibility for it and have agreed to have my/our name listed as a contributor. I/we believe the manuscript represents valid work. Neither this manuscript nor one with substantially similar content under my/our authorship has been published or is being considered for publication elsewhere, except as described in the covering letter. I/we certify that all the data collected during the study is presented in this manuscript and no data from the study has been or will be published separately. I/we attest that, if requested by the editors, I/we will provide the data/information or will cooperate fully in obtaining and providing the data/information on which the manuscript is based, for examination by the editors or their assignees. Financial interests, direct or indirect, that exist or may be perceived to exist for individual contributors in connection with the content of this paper have been disclosed in the cover letter. Sources of outside support of the project are named in the cover letter.
I/We hereby transfer(s), assign(s), or otherwise convey(s) all copyright ownership, including any and all rights incidental thereto, exclusively to the Journal, in the event that such work is published by the Journal. The Journal shall own the work, including 1) copyright; 2) the right to grant permission to republish the article in whole or in part, with or without fee; 3) the right to produce preprints or reprints and translate into languages other than English for sale or free distribution; and 4) the right to republish the work in a collection of articles in any other mechanical or electronic format.
We give the rights to the corresponding author to make necessary changes as per the request of the journal, do the rest of the correspondence on our behalf and he/she will act as the guarantor for the manuscript on our behalf.
All persons who have made substantial contributions to the work reported in the manuscript, but who are not contributors, are named in the Acknowledgment and have given me/us their written permission to be named. If I/we do not include an Acknowledgment that means I/we have not received substantial contributions from non-contributors and no contributor has been omitted.

เอกสารอ้างอิง

1. H. Song, C. Miao, W. Roel, Z. Shen, and F. Catthoor, “Implementation of Fuzzy Cognitive Maps Based on Fuzzy Neural Network and Application in Prediction of Time Series,” IEEE Trans. Fuzzy Syst., vol. 18, no. 2, pp. 233–250, Apr. 2010.

2. X. Bai, F. Zhang, J. Hou, F. Xia, A. Tolba, and E. Elashkar, “Implicit Multi-Feature Learning for Dynamic Time Series Prediction of the Impact of Institutions,” IEEE Access, vol. 5, pp. 16372–16382, 2017.

3. I. Pratama, A. E. Permanasari, I. Ardiyanto, and R. Indrayani, “A review of missing values handling methods on time-series data,” in 2016 International Conference on Information Technology Systems and Innovation (ICITSI), pp. 1–6, 2016.

4. Y. S. Afrianti, “Imputation Algorithm Based on Copula for Missing,”, pp. 252–257, 2014.

5. J. D. Velasquez, “Adaptive Multidimensional Neuro-Fuzzy Inference System for Time Series Prediction,” IEEE Lat. Am. Trans., vol. 13, no. 8, pp. 2694–2699, Aug. 2015.

6. W. Insuwan, U. Suksawatchon, and J. Suksawatchon, “Improving missing values imputation in collaborative filtering with user-preference genre and singular value decomposition,” in 2014 6th International Conference on Knowledge and Smart Technology (KST), pp. 87–92, 2014.

7. G. Chang, Y. Zhang, and D. Yao, “Missing data imputation for traffic flow based on improved local least squares,” Tsinghua Sci. Technol., vol. 17, no. 3, pp. 304–309, Jun. 2012.

8. Y. Li, A. Ngom, and L. Rueda, “Missing value imputation methods for gene-sample-time microarray data analysis,” in 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, pp. 1–7, 2010.

9. P. Keerin, W. Kurutach, and T. Boongoen, “Cluster-based KNN missing value imputation for DNA microarray data,” in 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 445–450, 2012.

10. H. Ichihashi, K. Honda, A. Notsu, and T. Yagi, “Fuzzy c-Means Classifier with Deterministic Initialization and Missing Value Imputation,” in 2007 IEEE Symposium on Foundations of Computational Intelligence, pp. 214–221, 2007.

11. พยุง มีสัจ, ระบบฟัซซีและโครงข่ายประสาทเทียม, มหาวิทยาลัยเทคโนโลยีพระจอมเกล้าพระนครเหนือ. 2555.

12. N. A. Setiawan, P. A. Venkatachalam, and A. F. M. Hani, “Missing Attribute Value Prediction Based on Artificial Neural Network and Rough Set Theory,” presented at the 2008 International Conference on BioMedical Engineering and Informatics, vol. 1, pp. 306–310, 2008.

13. R. and R. D. Little, “Statistical Analysis with Missing Data,” Wiley, New York., p. 381, 1987.

Article Sidebar

Main Article Content

บทคัดย่อ

Article Details

เอกสารอ้างอิง