Introduction

Semantic annotation is a sub-system of the DAS-Tool plugin, focusing on the identification of relevant scene characteristics and further grouping (based on a specific criteria) by means of classification methods. This step will result in a classification map emphasizing the existing categories of objects inside the scene. This procedure entails a compact workflow interconnecting feature extraction and feature classification in such a way to best describe the Sentinel 2 data content characteristics [1].
The analysis can be performed at pixel or patch level.

Pixel level analysis

The workflow of the pixel level analysis is presented below:

Open the product
From the Optical menu select DAS-Tool then select the Semantic Annotation option and the following pop-up window will be displayed
In the I/O Available Values, you should select the name of the source and target products, and if you want to save it, the file format and its destination directory.
In the Feature Extraction, you should activate the Pixel based radio button, then one of the two feature extraction algorithms and the bands on which the process will be applied.
In the last tab, the Feature Classification one, you will choose the classifier and its parameters.
After the Run button is pressed, a new product will appear in the Product Explorer window. This product will have a special band representing the mask of the identified clusters, named Classification_[classifier_name]. The resulted mask will have a number of grey levels equal to the selected number of clusters.
For easier interpretation of the result, the grey levels of the mask could be changed in colours. The easiest way to do this is from Colour Manipulation -> select Table -> choose the desired colours.

At pixel level, the analysis can be made using any combinations of the following feature extraction and classifications techniques.

Feature extraction algorithms

Spectral indexes. The spectral indexes are referring to the combinations of surface reflectance from multiple spectral bands to highlight a specific feature that indicates the presence of vegetation, water, mud, ice, geologic coverage, etc. The indexes computed by the DAS - Tool module are the ones presented in [8].

Values / Parameters: Not applicable

Polar coordinates. To compute the polar coordinates features of a multispectral image, the radiances values of each spectral band is converted into polar coordinates and the image is characterized through a distance ρ and N-1 angles θ, where N represents the number of the spectral bands present in the multispectral image [2].

Values / Parameters:
- Haar histogram bins - the number of elements for which the Haar histogram is calculated; Recommended values:
  
  Range of values 2^2 - 2^16
  
  Default values 32
- Histogram bins - the number of elements for which the spectral histogram is calculated; Recommended values:
  
  Range of values 2^5 - 2^9
  
  Default values 256

Classification algorithms

K-means Classifier. The k-means classifier is an unsupervised learning algorithm which groups the data points into homogeneous subsets. After the user indicates the desired number of subsets, the algorithm's goal is to identify the subsets' centroids that minimizes the distance between same class points.

Values / Parameters:
- Number of clusters - the number of clusters which algorithm should identify; Recommended values:
  
  Range of values 3-10
  
  Default values 7
- Number of iterations - the maximum number of iterations computed by the algorithm to identify the clusters; Recommended values:
  
  Range of values 1-100
  
  Default values 100
- Seed value - the seed used by the random generator to select the initial mean of the clusters; Recommended values:
  
  Range of values -
  
  Default values 1000
- Used distance measure - the distance measure used by the algorithm to identify clusters components; Recommended values:
  
  Range of values Canberra, Chebyshev, Earth movers, Euclidean, Manhattan
  
  Default values Euclidean

Latent Dirichlet Allocation. The Latent Dirichlet Allocation (LDA) approach proposed for DAS-Tool includes the use of k-means algorithms together with the LDA modelling. At first, k-means will be applied directly on the image to perform a rough grouping of pixels (based on high similarity, we release the threshold to ensure convergence) and define the visual words. The second step refers to the application of a Latent Dirichlet Allocation (LDA) model capable of discovering classes with semantic meaning based on the latent information hidden in the scene [6]. LDA is basically a three-level hierarchical Bayesian model for text analysis, formed by words, documents, and corpus. The model assumes that the order of words inside the document and the order of documents inside the corpus are ignored. In addition, each document is defined as a finite random mixture of latent topics. A topic is characterized by a probability distribution over a set of words in the vocabulary.

Values / Parameters:
- Number of visual words - the number of classes for k-means. The centre of each class identified by k-means corresponds to a visual word; Recommended values:
  
  Range of values 100-300
  
  Default values 100
- Number of classes - the number of classes which the algorithm should identify; Recommended values:
  
  Range of values 3-10
  
  Default values 5
- Document size - the size of the document/patch; Recommended values:
  
  Range of values 5-50
  
  Default values 10
- Step to select training set - selection of the each "step"-th document/patch from the collection; Recommended values:
  
  Range of values 1-10
  
  Default values 4

Patch level analysis

The workflow of the patch level analysis is presented below:

Open the product
Open RGB Image Window or display the desired band
Create the training dataset:
- Select the positive examples for each semantic class which you want to identify: Vector -> New Vector Data Container -> Draw Rectangle/ Circles Polygon samples of the selected class
From the Optical menu select DAS-Tool then select the Semantic Annotation option and the following pop-up window will be displayed
In the I/O Available Values, you should select the name of the source and target products, and if you want to save it, the file format and its destination directory.
In the Feature Extraction, you should activate the Patch based radio button, then one of the two feature extraction algorithms and the bands on which the feature will be calculated.
In the last tab, the Feature Classification one, you will choose the classifier and its parameters.
Pressing the Run button will generate a new product consisting of one mask for each identified class.

At patch level, the analysis ca be made using any combinations of the following feature extraction and classifications techniques.

Feature extraction algorithms

Mean and dispersion. Each patch is described by a feature vector consisting of its mean and dispersion on each spectral band.

Histogram. The histogram is the colour histogram extended for the high spectral resolution of multispectral images generating the distribution of the spectral values [3].

Values / Parameters:
- Number of bins - the number of elements for which the spectral histogram is calculated; Recommended values:
  
  Range of values 2^5 - 2^9
  
  Default values 32
- Minimum data value - the minimum value for which the spectral histogram is calculated; Recommended values:
  
  Range of values 2^8 - 2^16
  
  Default values 0
- Maximum data value - the maximum value for which the spectral histogram is calculated; Recommended values:
  
  Range of values 2^8 - 2^16
  
  Default values 255

Gabor. The Gabor features are obtained by space-frequency analysis of the texture, using constant bandwidth filters. The features are the result of a multi-resolution analysis and their vectors consist of the first and second order moments of the Gabor [4].

Values / Parameters:
- Orientations - the number of orientations for which the filter is computed; Recommended values:
  
  Range of values -
  
  Default values 6
- Scales - the number of scales for which the filter is computed; Recommended values:
  
  Range of values -
  
  Default values 4

BoW. The Bag of Words framework computes local features for each patch, then a k-words dictionary is formed collecting k randomly selected local features. Finally, each patch is represented by a k-dimensional descriptor with only one non-zero element, the one related to its closest word.

The BoW - Spectral indexes the features are computed on the image represented by its spectral indices.

Values / Parameters:
- Number of dictionaries - the number of words in the dictionary (default 100); Recommended values:
  
  Range of values 100 - 300
  
  Default values 100
- Dictionary selection method - the method used to select the words in the dictionary; Recommended values:
  
  Range of values Random samples
  
  Default values Random samples

The BoW - Polar coordinates features are computed on its polar coordinates [5].

Values / Parameters:
- Histogram bins - the number of elements for which the histogram is calculated; Recommended values:
  
  Range of values 2^2 - 2^16
  
  Default values 256
- Number of dictionaries - the number of words in the dictionary; Recommended values:
  
  Range of values 100-300
  
  Default values 100
- Dictionary selection method - the method used to select the words in the dictionary; Recommended values:
  
  Range of values Random samples
  
  Default values Random samples

Classification algorithms

K-means Classifier - as presented in the pixel level subsection.

Support Vector Machine. Support vector machines (SVM) is a supervised learning method, becoming a reference algorithm to solve classification problems due to its flexibility and computational efficiency of handling multidimensional data. The SVM implementation used by the DAS-Tool module is LIBSVM [https://www.csie.ntu.edu.tw/~cjlin/libsvm/].

Values / Parameters:
- SVM type - the type of support vector algorithm: classification (C-SVC, nu-SVC), regression (epsilon - SVR, nu-SVR) and distribution estimation (one-class SVM); Recommended values:
  
  Range of values C-SVC, nu-SVC, epsilon - SVR, nu-SVR, one-class SVM
  
  Default values C-SVC
- Kernel type - the type of the kernel function; Recommended values:
  
  Range of values Linear, polynomial, radial, sigmoid
  
  Default values radial
- Polynomial degree - the degree of the kernel function (default 1); Recommended values:
  
  Range of values 1-10
  
  Default values 1
- Gamma - the gamma values of the kernel function (default 0.01); Recommended values:
  
  Range of values 10^(-4) - 10^(4)
  
  Default values 0.01
- Coef - the coef value of the kernel function (default 1.0); Recommended values:
  
  Range of values 10^(-4) - 10^(4)
  
  Default values 1

K-Nearest Neighbour. The k-Nearest Neighbour Classifier (kNN) is a supervised learning method that computes the distance between the unclassified points to all the labelled training data. Further, based on the metric used to measure the distance, the algorithm generates a set consisting of the k-nearest neighbours to the unclassified point. Finally, the most representative class for the chosen neighbours is assigned to the new data [7].

Values / Parameters:
- Number of neighbours - the number of neighbours based on which the label of the test dataset will be decided; Recommended values:
  
  Range of values 3 - 100
  
  Default values 6

User Interface

The Semantic annotation tool can be invoked from the Optical menu, select DAS-Tool -> Semantic Annotation

If the product's bands have the same sizes, selecting the Semantic Annotation command, a dialog box with three tabs will be opened, otherwise the following warning message will appear

I/O Available Values tab allows user to select the product to be analysed and what will happen to the result.

Source Product group
     Name: It allows user to select the product to be analysed.
Target Product group
     Name: It allows user to enter the name of the product resulted after analysis.
     Save as: If the option is checked, the user can select the format and the destination of the file on which the resulted product will be saved on the disk.

Feature Extraction tab tab allows user to select the analysis level (patch/pixel), features used to describe the image characteristics and the required parameters.

Features group
     Computation type: It allows user to select the analysis level: patch or pixel.
     Name: It allows user to select features used to describe the image characteristics. The available features depend on the chosen analysis type (patch/pixel).
Feature settings group
     Raster bands: It allows user to select the spectral bands on which the product analysis will be made.
     Patch size (visible only when the patch based option is selected): Used to select the size of the patch (square area).
     Values area: This area is reserved for the parameters related to the selected features.

Feature Classification tab allows user to select and set the classifier used to analyse the chosen product.

Available classifiers group
     Name: It allows user to select the classifier used to analyse the image. The available classifiers depend on the chosen analysis type (patch/pixel).
Classifiers settings group
     Values area: This area is reserved for the parameters related to the selected classifier.
     Training dataset area (visible only when a supervised classifier is selected - SVM/kNN): It allows user to select the semantic class which the classifier should highlight in the input image.

References

[1] F.A. Georgescu, C. Vaduva, D. Raducanu and M. Datcu Feature Extraction for Patch-Based Classification of Multispectral Earth Observation Images, in IEEE Geoscience and Remote Sensing Letters, Vol. 13, No. 6, pp. 865-869, 2016.

[2] F.A. Georgescu, D. Raducanu, and M. Datcu New MPEG-7 Scalable Color Descriptor Based on Polar Coordinates for Multispectral Earth Observation Image Analysis, IEEE Geoscience and Remote Sensing Letters14.7 (2017): 987-991.

[3] A.M. Ferman, A.M. Tekalp, and R. Mehrotra Robust color histogram descriptors for video segment retrieval and identification, IEEE Transactions on image processing 11.5 (2002): 497-508.

[4] W. Li, K. Mao, H. Zhang, and T. Chai Designing compact Gabor filter banks for efficient texture feature extraction. In 2010 11th International Conference on Control Automation Robotics & Vision (pp. 1193-1197). IEEE. December 2010

[5] S. Cui, G. Schwarz, and M. Datcu Remote sensing image classification: No features, no clustering, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 8.11 (2015): 5158-5170.

[6] C. Vaduva, I. Gavat, and M. Datcu Latent Dirichlet allocation for spatial analysis of satellite images, IEEE Transactions on Geoscience and Remote Sensing 51.5 (2013): 2770-2786.

[7] T. Cover, and P. Hart Nearest neighbor pattern classification, IEEE transactions on information theory 13.1 (1967): 21-27.

[8] G.Marchisio, F. Pacifici, and C. Padwick On the relative predictive value of the new spectral bands in the WorldWiew-2 sensor," in Proceedings of IEEE International Geoscience and Remote Sensing Symposyum (IGARSS), Jul. 2010, pp. 2723-2726.

Range of values	Canberra, Chebyshev, Earth movers, Euclidean, Manhattan
Default values	Euclidean

Range of values	C-SVC, nu-SVC, epsilon - SVR, nu-SVR, one-class SVM
Default values	C-SVC

Range of values	Linear, polynomial, radial, sigmoid
Default values	radial

Range of values	Random samples
Default values	Random samples

Range of values	Random samples
Default values	Random samples