Published on

Anomaly detection survey

Authors
  • avatar
    Name
    Rammy
    Twitter

Deep Learning defect detection strategies

/static/images/ai/anomaly-detection-survey-solutions.png
  • Siamese Network Classification
    • Unknown defect detection for printed circuit board based on multi-scale deep similarity measure method
      • Idea is to compare a PCB ground truth template with input PCB image to classify if a defect exists.

      • The test was performed on the PCB data set with 6 types of defects including short circuit, open circuit, mouse bite, burr, leak, and copper, and the area under the ROC curve of all types was above 0.92.

/static/images/ai/anomaly-detection-survey-siamese.png

Problems :

  • Real time

  • Small sample : Usually we have small number of defect samples which could lead to over fitting. To avoid this we employ below methods

    • Augmentation
    • Unsupervised / Semi - Supervised
    • Transfer learning
  • Small Target : When the absolute or relative size of target defect is quite small compared to entire input image. For this issue below solutions are employed.

    • Feature fusion
    • Augmentation
    • Multi-scale feature fusion
    • Reduce network down sampling rate, thereby preserve small targets information
    • Using high resolution input
  • Unbalanced sample identification problem : Defect type 1 , type 2, defect free images ...etc these categories are usually unbalanced with defect free images count dominating. This imbalance could lead to one class dominating over others and affect generalization.

    • Solutions :
      • Data level solutions

/static/images/ai/anomaly-detection-survey-problems.png
  • Model level solutions
    • Cost sensitive :
      • Reconstruct the training set: Without changing the existing algorithm, weights are assigned to each sample in the training set according to the different misclassification costs of the samples, and the original sample set is reconstructed according to the weight.

      • Introduce cost-sensitive factors: Assign higher costs to small-class samples and lower costs to large-class samples to balance the difference in the number of samples. Cost sensitive factor includes cost sensitive matrix.

    • Integrated learning: There are two main ways of using ensemble learning for defect detection, respectively:
      • A: Integrated learning + data preprocessing: the typical algorithms include Smote bagging, Smote boost, Easy Ensemble and Balance cascade
      • B: Integrated learning + cost sensitive
    • Converted to anomaly detection problem: When the sample classes are extremely unbalanced, the defect detection problem can be regarded as an anomaly detection problem, and the anomaly detection algorithm (such as One-Class SVM, SVDD, etc.) can be used to establish a single classifier to detect the anomaly points

Datasets

Solutions


Status: #done

Tags: #paper #anomaly_detection

References: