• Laser & Optoelectronics Progress
  • Vol. 58, Issue 4, 0410012 (2021)
Youwen Huang, Bin Zhou*, and Xin Tang
Author Affiliations
  • School of Information Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi 341000, China
  • show less
    DOI: 10.3788/LOP202158.0410012 Cite this Article Set citation alerts
    Youwen Huang, Bin Zhou, Xin Tang. Text Image Generation Method with Scene Description[J]. Laser & Optoelectronics Progress, 2021, 58(4): 0410012 Copy Citation Text show less
    Generation network model
    Fig. 1. Generation network model
    Mask generation network
    Fig. 2. Mask generation network
    Discrimination network model
    Fig. 3. Discrimination network model
    Layout discriminator
    Fig. 4. Layout discriminator
    Comparison results of same description
    Fig. 5. Comparison results of same description
    Comparison results after adding objects
    Fig. 6. Comparison results after adding objects
    Comparison results of predicted mask
    Fig. 7. Comparison results of predicted mask
    t0.30.40.50.60.7
    Number of objects156151146127103
    Number of relationship types3838373024
    Table 1. Preprocessing results under different threshold values
    ModelISFID
    Real image(64×64)13.90±0.500
    Proposed model(no Dlayout)6.72±0.2457.48
    Proposed model(no Gmask)6.69±0.1461.34
    Proposed model(full model)7.11±0.1442.20
    Sg2im[11]6.30±0.2073.39
    StackGAN[8]6.35±0.16108.68
    AttnGAN[10]6.38±0.2296.40
    Table 2. Comparison results of IS and FID values
    ModelProposed model(full model)Sg2im[11]StackGAN[8]AttnGAN[10]
    Time/s0.02780.02160.06340. 0302
    Table 3. Comparison results of image generation time
    Youwen Huang, Bin Zhou, Xin Tang. Text Image Generation Method with Scene Description[J]. Laser & Optoelectronics Progress, 2021, 58(4): 0410012
    Download Citation