• Acta Optica Sinica
  • Vol. 42, Issue 19, 1920001 (2022)
Xingya Zhao, Zhiwei Yang, Jian Dai, Tian Zhang*, and Kun Xu
Author Affiliations
  • State Key Laboratory of Information Photonics and Optical Communications, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • show less
    DOI: 10.3788/AOS202242.1920001 Cite this Article Set citation alerts
    Xingya Zhao, Zhiwei Yang, Jian Dai, Tian Zhang, Kun Xu. VGG16-Based Diffractive Optical Neural Network and Context-Dependent Processing[J]. Acta Optica Sinica, 2022, 42(19): 1920001 Copy Citation Text show less
    Schematic diagram of VGG16-ECNN,and convolution process. (a) Schematic diagram of VGG16-ECNN; (b) convolution process
    Fig. 1. Schematic diagram of VGG16-ECNN,and convolution process. (a) Schematic diagram of VGG16-ECNN; (b) convolution process
    Schematic diagrams of VGG16-DONN structure. (a) Simplified diagram of VGG16-ECNN; (b) 4f system [fx,y is input complex amplitude function, Fu,v is Fourier transform of fx,y, and fξ,η is reconstructed signal restored by second convex lens]
    Fig. 2. Schematic diagrams of VGG16-DONN structure. (a) Simplified diagram of VGG16-ECNN; (b) 4f system [fx,y is input complex amplitude function, Fu,v is Fourier transform of fx,y, and fξ,η is reconstructed signal restored by second convex lens]
    Weights of convolution kernels
    Fig. 3. Weights of convolution kernels
    Dataset examples. (a) CelebA dataset; (b) cat and dog dataset
    Fig. 4. Dataset examples. (a) CelebA dataset; (b) cat and dog dataset
    Training flow chart
    Fig. 5. Training flow chart
    Original image and output images of two network structures through partial convolution kernel in the first convolution layer.(a) Face image; (b) VGG16-ECNN; (c) VGG16-DONN
    Fig. 6. Original image and output images of two network structures through partial convolution kernel in the first convolution layer.(a) Face image; (b) VGG16-ECNN; (c) VGG16-DONN
    Output of face image convolved with all convolution kernels in the first convolution layer
    Fig. 7. Output of face image convolved with all convolution kernels in the first convolution layer
    Output graph of each convolution block after inputting face image to VGG16-DONN structure. (a) Conv1; (b) Conv2; (c) Conv3; (d) Conv4; (e) Conv5
    Fig. 8. Output graph of each convolution block after inputting face image to VGG16-DONN structure. (a) Conv1; (b) Conv2; (c) Conv3; (d) Conv4; (e) Conv5
    Original image and output graphs of two network structures through partial convolution kernel in the first convolution layer.(a) Dog image; (b) VGG16-ECNN; (c) VGG16-DONN
    Fig. 9. Original image and output graphs of two network structures through partial convolution kernel in the first convolution layer.(a) Dog image; (b) VGG16-ECNN; (c) VGG16-DONN
    VGG16-DONN for CDP
    Fig. 10. VGG16-DONN for CDP
    OWM algorithm flow chart
    Fig. 11. OWM algorithm flow chart
    Training results of VGG16-DONN and VGG16-ECNN. (a) Training accuracy and validation accuracy of VGG16-DONN and VGG16-ECNN; (b) training loss and validation loss of VGG16-DONN and VGG16-ECNN
    Fig. 12. Training results of VGG16-DONN and VGG16-ECNN. (a) Training accuracy and validation accuracy of VGG16-DONN and VGG16-ECNN; (b) training loss and validation loss of VGG16-DONN and VGG16-ECNN
    Training/validation accuracy and time obtained by varying different parameters. (a) Weights of different layers trained by VGG16-ECNN structure; (b) weights of different layers trained by VGG16-DONN structure; (c) VGG16-DONN structure changes optimizer; (d) VGG16-DONN structure changes learning rate
    Fig. 13. Training/validation accuracy and time obtained by varying different parameters. (a) Weights of different layers trained by VGG16-ECNN structure; (b) weights of different layers trained by VGG16-DONN structure; (c) VGG16-DONN structure changes optimizer; (d) VGG16-DONN structure changes learning rate
    Convolution layerTime /sProportion of total time /%
    Conv1_10.9222.3
    Conv1_20.338.0
    Conv2_10.245.8
    Conv2_20.276.6
    Conv3_10.225.3
    Conv3_20.256.1
    Conv3_30.256.1
    Conv4_10.245.8
    Conv4_20.276.6
    Conv4_30.276.6
    Conv5_10.286.8
    Conv5_20.297.0
    Conv5_30.297.0
    Total time4.12100.0
    Table 1. Time of each convolutional layer of VGG16-ECNN
    Network structureClassification accuracy /%Time /min
    VGG16-DONN88.53320
    AlexNet-DONN83.811117
    Table 2. Classification accuracy and time of VGG16-DONN and AlexNet-DONN for cat and dog dataset
    AttributeVGG16-DONNVGG16-ECNNAttributeVGG16-DONNVGG16-ECNN
    5_o_Clock_Shadow90.2090.70Mouth_Slightly_Open62.0065.30
    Arched_Eyebrows70.8071.70Mustache96.8096.30
    Attractive68.0071.60Narrow_Eyes87.0086.10
    Bags_Under_Eyes79.6078.60No_Beard86.0086.00
    Bald98.2098.20Oval_Face71.0073.40
    Bangs83.2086.00Pale_Skin95.8095.70
    Big_Lips67.2066.20Pointy_Nose71.8071.00
    Big_Nose81.8080.30Receding_Hairline92.0092.90
    Black_Hair75.8076.00Rosy_Cheeks93.2093.40
    Blond_Hair87.4086.90Sideburns94.6095.30
    Blurry96.2094.60Smiling67.0075.00
    Brown_Hair81.2081.10Straight_Hair79.2077.50
    Bushy_Eyebrows85.0086.40Wavy_Hair66.6073.60
    Chubby95.0095.90Wearing_Earrings79.4078.10
    Double_Chin97.4095.70Wearing_Hat96.0095.80
    Eyeglasses94.6093.70Wearing_Lipstick75.2081.60
    Goatee94.8095.60Wearing_Necklace84.2085.70
    Gray_Hair97.2097.50Wearing_Necktie91.0093.30
    Heavy_Makeup73.8079.00Young77.6076.90
    High_Cheekbones62.8071.20Male77.6084.90
    Table 3. Accuracy of 40 kinds of face attributes obtained by VGG16-DONN and VGG16-ECNN combined with CDP module
    Xingya Zhao, Zhiwei Yang, Jian Dai, Tian Zhang, Kun Xu. VGG16-Based Diffractive Optical Neural Network and Context-Dependent Processing[J]. Acta Optica Sinica, 2022, 42(19): 1920001
    Download Citation