preview

Describe The Procedure We Used To Classify Remote Sensinging Image Scenes

Decent Essays

4.1 Introduction This section describes the procedure we used to classify remote sensing image scenes. We applied different procedures to correctly classify remote sensing scene images. We applied different hand-engineered feature extraction methods, proposed CNN model, applied fine-tuning on VGG16 and ResNet50. We have also used VGG16 and ResNet50 as fixed feature extractor and trained the classifier. The major contributions of this paper are as follows: 1. We gave a brief review on hand engineered feature extraction methods and deep learning procedures. 2. Discussed about transfer learning and its application in different situations in deep learning. 3. We fine-tuned two popular deep learning models named VGGNet16 and RESNet50 on …show more content…

The input image is then zero padded so that the size of feature map remains same after convolution. The padded image then convolved with 64 kernels and produces 64 feature map of 224x224 size. The feature maps then max-pooled from 2x2 grid with 2x2 stride and produces 64 feature maps of 112x112 in size. The feature maps found from the previous max-pooling layer is zero padded and convolved with 128 kernels of 64x3x3 size which produces 128 feature maps of 112x112 size. The input is then max-pooled and produces output of 56x56 sized 128 feature maps. 56x56 sized 128 feature maps then zero padded and convolved with 128x3x3 sized 256 kernels and produce output of 56x56 sized 256 feature maps. 256 feature maps are then max-pooled (2x2 grid with 2x2 stride) and produces 256 feature maps of size 28x28 as output. ; Figure 4.1: Proposed CNN model 256 feature maps then zero padded and convolved with 512 kernels of 256x28x28 sized. This fourth convolution produces 512 feature maps of 28x28 as output. The output then max-pooled (2x2 grid with 4x4 stride) by producing 512 feature maps size as 7x7. A flatten layer is then applied to produce 1 dimensional vector from 512x7x7 input. A fully-connected layer with 256 then added which is connected with every neurons of previous layer. A dropout layer of probability 0.5 is added just after first fully connected layer to reduce overfitting. Another fully connected layer with 256 neurons follows first dropout

Get Access