Video Analytic for Human Management and Security and FPGA Accelerated High Concurrency
Main Article Content
Abstract
This paper explores the use of video analytics by leveraging accelerated FPGA technology in combination with high-performance computing, specifically utilizing two Xilinx Alveo U50Lv cards and one U55C card. While many applications exist for motion analysis and detection in videos, the use of FPGAs in this context remains relatively scarce. FPGAs offer significant advantages in terms of energy efficiency and throughput. We present results demonstrating the parallelism capabilities in terms of the number of threads within a single Docker container that shares stack memory, as well as across multiple Docker containers. When operating within a single Docker process, the application shares the same memory space and resources, making it ideal for tasks that require efficient communication or data sharing. In contrast, running in multiple containers isolates processes, each with its own environment, and can significantly increase the number of threads. Our findings show that the combination of these techniques offers optimal performance for video analytics.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
Yi G, Loia V. High-performance computing systems and applications for AI. 2019.
Ferro M, Kloh VP, de Paula FB, Schulze B. Artificial Intelligence and High Performance Computing Convergence. 2019.
Jung D, Lee D, Kim M, Kim J. Efficient data synchronization method on integrated computing environment. 2018
Lee S, Huh JH. An effective security measures for nuclear power plant using big data analysis approach. 2018.
Yin C, Wang H, Yin X, Sun R, Wang J. Improved deep packet inspection in data stream detection. 2018.
Hwang S. A network clock model for time awareness in the Internet of things and artificial intelligence applications. 2019.
Hu S, Zhu Y, Cheng P, Guo C, Tan K, Padhye J, et al. Deadlocks in Datacenter Networks: Why Do They Form, and How to Avoid Them. Microsoft, Hong Kong University of Science and Technology.
Alveo U55C High Performance Compute Card. Available from: https://www.xilinx.com/products/boards-and-kits/alveo/u55c.html#specifications.
Keller AM, Wirthlin MJ. Impact of Soft Errors on Large-Scale FPGA Cloud Computing. In: ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA). New York, NY, USA: Association for Computing Machinery; 2019. p. 272–81. Available from: https://doi.org/10.1145/3289602.3293911.
Buell D, El-Ghazawi T, Gaj K, Kindratenko V. High-Performance reconfigurable computing. IEEE Computer Society. 2007 Mar.
Altera Cooperation White Paper. Accelerating high performance computing with FPGAs. 2007 Oct.
Kumbhar DD, Je Y, Hong S, Lee D, Kim H, Kwon MJ, et al. Molecularly Reconfigurable Neuroplasticity of Layered Artificial Synapse Electronics. Adv Funct Mater. 2024;2311994.
Liu X, Wang R, Shi C, Zou C, Zhu W. Computing Acceleration to Genome-Wide Association Study Based on CPU/FPGA Heterogeneous System. ACM SIGAPP Appl Comput Rev. 2024;23(4):16-26.
Li Y, Han D, Cui M, Yuan F, Zhou Y. RESNETCNN: An abnormal network traffic flows detection model. Comput Sci Inf Syst. 2023;(00):4-4.
Hussen N, Elghamrawy SM, Salem M, El-Desouky AI. A Fully Streaming Big Data Framework for Cyber Security based on Optimized Deep Learning Algorithm. IEEE Access. 2023.
Sultan MT, El Sayed H. QoE-aware analysis and management of multimedia services in 5G and beyond heterogeneous networks. IEEE Access. 2023.
Zhang L, Yang J, Wang T, Sun Z, Sun K, Zeng J. Event Building Algorithm in a Distributed Stream Processing Data Acquisition Platform: D-Matrix. IEEE Trans Nucl Sci. 2023;70(2):105-12.
Jayashankara M, Shah P, Sharma A, Chanak P, Singh SK. A Novel Approach for Short-Term Energy Forecasting in Smart Buildings. IEEE Sens J. 2023;23(5):5307-14.
Potnurwar AV, Bongirwar VK, Ajani S, Shelke N, Dhone M, Parati N. Deep Learning-Based Rule-Based Feature Selection for Intrusion Detection in Industrial Internet of Things Networks. Int J Intell Syst Appl Eng. 2023;11(10s):23-35.
Alonso T, Petrica L, Ruiz M, Petri-Koenig J, Umuroglu Y, Stamelos I, et al. Elastic-DF: Scaling Performance of DNN Inference in FPGA Clouds through Automatic Partitioning. ACM Trans Reconfigurable Technol Syst. 2021 Dec;15(2).
Agiakatsikas D, Foutris N, Sari A, et al. Evaluation of Xilinx Deep Learning Processing Unit under Neutron Irradiation. In: 21st European Conference on Radiation and Its Effects on Components and Systems (RADECS); 2021 Sep 13-17.
Li Y, Liu Z, Xu K, Yu H, Ren F. A GPU-outperforming FPGA accelerator architecture for binary convolutional neural networks. ACM J Emerg Technol Comput Syst. 2018;14(2):1-16.