ing at the specific application of building detection, a sparse representation-based full-reference quality assessment method for distorted satellite stereoscopic images is proposed. First, a new distorted satellite stereo image database is constructed, in which the corner detection and the digital surface model elevation information are used for building detection. And a detection accuracy index is proposed to represent the degree of distortion based on the change of the detected corners. Then, an objective evaluation model based on sparse representation is proposed, which extracts scale-invariant feature transforms and binary robust invariant scalable key points of the original and the distorted images for dictionary learning. Four quality scores are obtained using sparse representation to measure the similarity between the original and the distorted images. Finally, the final objective assessment value is obtained by fusing the four quality scores using support vector regression. The test is carried out on the constructed database. The test results on the constructed database show that the Pearson linear correlation coefficient is higher than 0.90, and the Spearman rank correlation coefficient is higher than 0.87. Compared with the existing assessment methods, the proposed objective evaluation method can better reflect the quality of satellite stereo images.
.ing at the problem that the accuracy and real-time of multi-target detection in complex and large scenes are difficult to balance in the existing target detection algorithms, we imitate the human visual mechanism inspired by the convolution kernel shape of the deep neural network. The target detection framework——the single shot multi-box detection (SSD) based on deep learning is improved, and a multi-target detection framework adaptive perceive SSD is proposed, which is specially used for the multi-target detection in complex and large traffic scenes. A feature convolution kernel library composed of multi-form Gabor and color Gabor is designed. The optimal feature extraction convolution kernel group is trained and screened to replace the low-level convolution kernel group of the original network, and effectively improves the detection accuracy. A single image detection framework is combined with a convolution long-short-term memory network, and the temporal association of network frame-level information is realized by extracting the characteristic mapping between propagation frames with a bottleneck-long-term and short-term memory layer. And the calculation cost is reduced, and the tracking and identification of targets affected by the strong interference in the video are realized. An adaptive threshold strategy is added to reduce the rate of missing and false alarms. The experimental results show that compared with other target detection frameworks based on deep learning, the average accuracy of various target recognition is increased by 9%~16%, the average accuracy is increased by 14%~21%, the multi-target detection rate is increased by 21%~36%, and the detection frame rate reaches 32 frame·s -1, which achieves a balance between the accuracy and real-time performance of the algorithm and achieves better detection and recognition results.
.