Capsule Attention Based Detection of Contraband in X-Ray Images

x-ray optics; contraband detection; matrix capsule networks; attention mechanism; feature enhancement

Authors

  • YAN Zhiming School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo 454000, China & Henan Key Laboratory of Intelligent Detection and Control of Coal Mine Equipment, Jiaozuo 454000, China
  • LI Xinwei School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo 454000, China & Henan Key Laboratory of Intelligent Detection and Control of Coal Mine Equipment, Jiaozuo 454000, China
  • YANG Yi School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo 454000, China & Henan Key Laboratory of Intelligent Detection and Control of Coal Mine Equipment, Jiaozuo 454000, China
April 30, 2024
May 1, 2024

Downloads

 Aiming at the problem of low detection accuracy caused by the different postures, different sizes, complex backgrounds and overlapping occlusions of contraband in the security checking process, a Matrix capsule network based on attention mechanism (MCAM) is designed by introducing attention mechanism into the capsule network. Firstly, a multi-feature-extraction (MFE) module is designed to solve the difficulty of detecting contraband with different sizes and complex backgrounds; then the Conv2d with Attention for ConvCap (CACC) is constructed in the convolutional layer of the capsule, and the weight information is computed in the channel dimensions of the feature map to give the contraband regions higher coefficients to enhance the contraband detection ability when the poses are different and the occlusion is severe; finally, a new capsule detection layer is designed by utilizing the pose matrix of the capsule to give full play to the detection ability of the matrix capsule network. The mAPs of the proposed model on SIXray, SIXray10, and SIXray100 are 85.10%, 64.05%, and 53.67%, respectively, which are 42.79%, 19.73%, and 9.41% higher than those of the original network, higher than those of the current mainstream detection networks, and the number of parameters of the network and the amount of computation are also lower.