AI System Design for Robotic Hand to Play the Piano

Wuttichon Aukkhosuwan; Wannarat  Suntiamorntut

doi:10.55164/ajstr.v25i3.246950

PDF

เผยแพร่แล้ว: ก.ย. 29, 2022

DOI: https://doi.org/10.55164/ajstr.v25i3.246950

คำสำคัญ:

ปัญญาประดิษฐ์ หุ่นยนต์ การแยกแหล่งที่มาของเพลง การถอดเสียงดนตรีอัตโนมัติ การจดจำเพลงด้วยแสง คลื่นไฟฟ้าสมอง

Wuttichon Aukkhosuwan

Faculty of Engineering, Prince of Songkla University, Songkhla, 90110, Thailand

Wannarat Suntiamorntut

Faculty of Engineering, Prince of Songkla University, Songkhla, 90110, Thailand

บทคัดย่อ

หุ่นยนต์และปัญญาประดิษฐ์ถูกนำมาใช้เป็นปัจจัยสำคัญสำหรับการปฏิวัติอุตสาหกรรม 4.0 เช่น การผลิต เกษตรกรรม โลจิสติกส์ และซัพพลายเชน เป็นต้น มีการเปลี่ยนแปลงและประยุกต์ใช้หุ่นยนต์และ AI ร่วมกันเพื่อเพิ่มผลผลิตและลดต้นทุน นอกจากนี้ระบบอัตโนมัติที่ใช้ปัญญาประดิษฐ์สมัยใหม่ยังได้รับการยอมรับอย่างสูงตามการขาดแคลนแรงงานในยุคสังคมสูงวัยอีกด้วย ปัญญาประดิษฐ์ในงานสร้างสรรค์ล้วนเป็นสิ่งที่ท้าทายเป็นอย่างมากโดยเฉพาะในด้านดนตรี บทความนี้นำเสนอระบบสำหรับทำให้แขนหุ่นยนต์สามารถเล่นเปียโนได้โดยมีข้อผิดพลาดน้อยที่สุด พวกเราใช้ความรู้ด้าน Optical Music Recognition (OMR), Automatic Music Transcription (AMT), Music Source Separation (MSS) และการกำจัดปัญหาของรอบเวลาการทำงานของแขนหุ่นยนต์ แขนหุ่นยนต์ที่ใช้ทดสอบร่วมกับระบบนี้คือ LEGO ซึ่งมันสามารถทำงานได้ 4 รูปแบบ ได้แก่ การเล่นโน้ตจากแผ่นโน้ตดนตรีโดยใช้เครื่องมือที่ชื่อว่า Sheet Vision และ Tesseract-OCR การเล่นเปียโนให้เหมือนกับเสียงเปียโนจากไฟล์เพลงโดยใช้ Spleeter และ Onsets and frames การเล่นโน้ตเปียโนจากเสียงเปียโนที่ได้ยินแบบเรียลไทม์โดยใช้ Onsets and frames แบบเรียลไทม์ รวมถึงการเล่นโน้ตดนตรีจากคลื่นสมองโดยใช้การเปรียบเทียบความถี่ของคลื่นสมองกับความถี่ของโน้ตดนตรี ผลการออกแบบและการทดลองได้อธิบายไว้ในบทความนี้

ฉบับ

ปีที่ 25 ฉบับที่ 3 (2022): กรกฎาคม - กันยายน 2565

บท

บทความวิจัย

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

References

Weinberg, G.; Raman, A.; Mallikarjuna, T. Interactive jamming with Shimon: A social robotic musician. In: 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI). 2009, 233-234. https://doi.org/10.1145/1514095.1514152.

Lin, JC.; Huang, HH.; Li, YF.; Tai, JC.; Liu, LW. Electronic piano playing robot. In: 2010 International Symposium on Computer, Communication, Control and Automation (3CA). 2010, 2, 353-356. https://doi.org/10.1109/3CA.2010.5533457.

Zhang, D.; Jianhe, L.; Beizhi, L.; Lau, D.; Cameron, C. Design and analysis of a piano playing robot. In: 2009 International Conference on Information and Automation. 2009, 757-761. https://doi.org/10.1109/ICINFA.2009.5205022.

Zhang, A.; Malhotra, M.; Matsuoka, Y. Musical piano performance by the ACT Hand. In: 2011 IEEE International Conference on Robotics and Automation. 2011, 3536-3541. https://doi.org/10.1109/ICRA.2011.5980342.

Li, YF.; Chuang, LL. Controller design for music playing robot — Applied to the anthropomorphic piano robot. In: 2013 IEEE 10th International Conference on Power Electronics and Drive Systems (PEDS). 2013, 968-973. https://doi.org/10.1109/PEDS.2013.6527158.

Fahn, CS.; Lu, KJ. Humanoid recognizing piano scores techniques. In: 2014 International Conference on Information Science, Electronics and Electrical Engineering. 2014, 3, 1397-1402. https://doi.org/10.1109/InfoSEEE.2014.6946149.

Luo, Y.; Mesgarani, N. Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation. IEEE/ACM Trans Audio, Speech, Lang Process. 2019, 27(8), 1256-1266. https://doi.org/10.1109/TASLP.2019.2915167.

Defossez, A.; Usunier, N.; Bottou, L.; Bach, F. Music Source Separation in the Waveform Domain. Published online 2019.

Hennequin, R.; Khlif, A.; Voituret, F.; Moussallam, M. Spleeter: a fast and efficient music source separation tool with pre-trained models. Journal of Open Source Software. 2020, 5, 2154. https://doi.org/10.21105/joss.02154

Hawthorne, C.; Elsen, E.; Song, J.; Roberts, A.; Raffel, C.; Engel, J.; Oore, S.; Eck, D. Onsets and Frames: Dual-Objective Piano Transcription. in Proceedings of the 19th ISMIR Conference, Paris, France, September 23-27, 2018. 50-57.

Trabelsi, C.; Bilaniuk, O.; Zhang, Y.; et al. Deep Complex Networks. Published online 2017. https://doi.org/10.48550/ARXIV.1705.09792.

Hawthorne, C.; Simon, I.; Swavely, R.; Manilow, E.; Engel, J. Sequence-to-Sequence Piano Transcription with Transformers. Published online 2021. https://doi.org/10.48550/ARXIV.2107.09142.

Raffel, C. Learning-Based Methods for Comparing Sequences, with Applications to Audio-to-MIDI Alignment and Matching. Columbia University, 2016.

Sharma, S.; Mittal, VK. Window selection for accurate music source separation using REPET. In: 2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN). 2016, 270-274. https://doi.org/10.1109/SPIN.2016.7566702.

Article Sidebar

Main Article Content

บทคัดย่อ

Article Details

References