Home

Awesome

Industrial ML Datasets

The following is a curated list of datasets, publically available for machine learning research in the area of manufacturing.

For more information, please check our corresponding publication:

@inproceedings{jourdan_machine_2021,
 title = {Machine {Learning} for {Intelligent} {Maintenance} and {Quality} {Control}: {A} {Review} of {Existing} {Datasets} and {Corresponding} {Use} {Cases}},
 volume = {2},
 journal = {Proceedings of the 2nd Conference on Production Systems and Logistics},
 author = {Jourdan, Nicolas and Longard, Lukas and Biegel, Tobias and Metternich, Joachim},
 year = {2021},
}

Some additional datasets may be found here: Link

:heavy_check_mark: indicates a preset split between training and testing data.

:globe_with_meridians: indicates, that the test set labels are hidden behind an evaluation server.

Predictive Maintenance and Condition Monitoring

NameYearFeature TypeFeature CountTarget VariableInstancesOfficial Train/Test SplitData SourceFormatLicenseAccess
Wood veneers before and after drying <br> This dataset consists of 2579 image pairs (5158 images in total) of wood veneers before and after drying.2021Image>4000x4000-5.158:x:RealPNGCC BY 4.0Link
Diesel Engine Faults Features <br> Fault detection based on pressure curves and vibration.2020Signal84C (4)3.500:x:SyntheticMATCC BY 4.0Link
Degradation of a Cutting Blade <br> Wrapping machine process data over 12 months with a degrading cutting tool.2019Signal9C (8) / R1.062.912:x:RealCSVCC BY-SA 3.0Link
CNC Mill Tool Wear <br> CNC process data of wax milling with worn/unworn tools.2018Signal48C (3*2)25.286:x:RealCSVCC0: Public DomainLink
Condition Monitoring of Hydraulic Systems <br> Test rig process data of multiple load cycles with various fault types and severity levels.2018Signal17C (5*(24))2.205:x:RealNon-Standard?Link
Production Plant Data for Condition Monitoring <br> nonymized process data of component run-to-failure experiments.2018Signal26-228.414:x:RealCSVCC BY-SA 3.0Link
Versatile Production <br> Popcorn production process data with multiple process steps.2018Signal5-85-80.000:x:RealCSVCC BY-NC-SA 4.0Link
Degradation Measurement of Robot Arm Position Accuracy <br> Target- and actual values of robotic arm tool position, velocity and current for health assessment.2017Signal73-155.000:x:RealCSV?Link
APS Failure at Scania Trucks <br> Anonymized counters and histograms for air pressure system fault detection.2016Signal170C (2)76.000:heavy_check_mark:RealCSVGNU General Public LicenseLink
Maintenance of Naval Propulsion Plants <br> Gas turbine process data for component decay state prediction.2016Signal16R11.934:x:SyntheticNon-Standard<details><summary> More Information </summary>Use of this dataset in publications must be acknowledged by referencing the following publication: <br> A. Coraddu, L. Oneto, A. Ghio, S. Savio, D. Anguita, M. Figari, Machine Learning Approaches for Improving Condition?Based Maintenance of Naval Propulsion Plants, Journal of Engineering for the Maritime Environment, 2014, DOI: 10.1177/1475090214540874, (In Press)</details>Link
Plant Fault Detection <br> Anonymized process data for plant fault detection.2015Signal10C (6)8.938.370:x:RealCSV?Link
Asset Failure and Replacement <br> Anonymized data for asset fault detection.2014Signal1C (2)447.341:heavy_check_mark: :globe_with_meridians:RealCSV?Link
Maintenance Action Recommendation <br> Anonymized process and maintenance data of an industrial asset for maintenance action recommendation.2013Signal32C (14)2.097.152:heavy_check_mark: :globe_with_meridians:RealCSV?Link
Anemometer Fault Detection <br> Anemometer measurements for fault detection.2011Signal16 <br> 16-20-345.700 <br> 208.800:heavy_check_mark: :globe_with_meridians:RealNon-Standard?Link
Gearbox Fault Detection <br> Test rig accelerometer data for fault detection.2009Signal3-> 10 Mio.:x:RealCSV?Link
Li-Ion Battery Aging <br> Battery test rig data during charge and discharge cycles for degradation detection.2008Signal12-2.167:x:RealMATN/ALink
Turbofan Engine Degradation Simulation <br> C-MAPSS simulation sensor data of various conditions and fault modes.2008Signal26-262.256:heavy_check_mark:SyntheticNon-Standard?Link
Bearing <br> Bearing test rig accelerometer data of run-to-failure experiments.2007Signal4-8-61.440:x:RealCSV?Link
Milling <br> Milling process- and external sensor data for tool wear detection.2007Signal13R1.503.000:x:RealMAT?Link
CWRU Bearing Data <br> Bearing test rig accelerometer data for fault detection.n.A.Signal5C (2)> 10 Mio.:x:RealMAT?Link

Process Monitoring

NameYearFeature TypeFeature CountTarget VariableInstancesOfficial Train/Test SplitData SourceFormatLicenseAccess
Skoltech Anomaly Benchmark (SKAB) <br> Time-series data from water circulation loop testbed for evaluating anomaly detection algorithms.2020Signal8C (2)34×1,200:heavy_check_mark:RealCSVGNU GPL v3.0Link
High Storage System Anomaly Detection <br> Storage test rig process data for anomaly detection.2018Signal20C (2)91.000:x:SyntheticCSVCC-BY-NC-SA 4.0Link
Genesis Pick-and-Place Demonstrator <br> Material sorting test rig process data for anomaly detection.2018Signal23C (3)32.440:x:RealCSVCC-BY-NC-SA 4.0Link
Tennessee Eastman Process Simulation Dataset <br> Simulated chemical process data for anomaly detection with different fault types.2017Signal51C (21) / R> 10 Mio.:heavy_check_mark:SyntheticRData<details><summary> More Information </summary> The person who owns, created, or contributed a work to the data or work provided here dedicated the work to the public domain and has waived his or her rights to the work worldwide under copyright law. You can copy, modify, distribute, and perform the work, for any lawful purpose, without asking permission.</details>Link
Robot Execution Failures <br> Force and torque measurements of an industrial robot with different erroneous operating conditions.1999Signal89C (13)463:x:RealNon-Standard?Link
Mechanical Analysis <br> Vibration measurements of electromechanical devices with different erroneous operating conditions.1990Signal7C (6)209:heavy_check_mark:RealMAT?Link
CWRU Bearing Data Bearing test rig accelerometer data for anomaly detection.n.A.Signal5C (2)> 10 Mio.:x:RealMAT?Link

Predictive Quality and Quality Inspection

NameYearFeature TypeFeature CountTarget VariableInstancesOfficial Train/Test SplitData SourceFormatLicenseAccess
Casting Product Quality Inspection <br> Grayscale images of pump impeller castings with and without defects.2020Image300x300 <br> 512x512C (2)7.348:heavy_check_mark:RealJPGCC-BY-NC-ND 4.0Link
GC10-DET Defect Location for Metal Surface <br> Grayscale images of metal surfaces with various defect types and corresponding bounding box annotations.2020ImageVaryingC (10)3.570:x:RealJPG, XML?Link
Mechanic Component Images <br> Grayscale images of air conditioner pistons with various defect types.2020Image86x90C (3)285:x:RealPNG?Link
Multi-Stage Continuous Flow Process <br> Anonymized process data of a production line with quality measurements of part dimensions.2020Signal116-14.088:x:RealCSV?Link
Plastic Extrusion Defects <br> Process data of a plastic extrusion process.2020Signal470-226.536:x:RealCSVCC BY-NC-ND 4.0Link
AITEX <br> Grayscale images of textile fabrics with various defect types and corresponding segmentation masks.2019Image4096x256C (13)245:x:RealPNG, Mask?Link
Deep PCB <br> Grayscale images of circuit boards with various defect types and corresponding bounding box annotations.2019Image640x640C (7)1.500:heavy_check_mark:RealJPG, Maskonly for research purposeLink
Severstal Steel Defect Detection <br> Grayscale images of steel surfaces with various defect types and corresponding segmentation polygons.2019Image1600x256C (5)18.074:heavy_check_mark: :globe_with_meridians:RealJPG, CSV?Link
Turning Dataset for Chatter Diagnosis <br> Sensory data of a turning test rig and varying strengths of chatter.2019Signal8C (4)> 10 Mio.:x:RealMATCC BY 4.0Link
Magnetic Tile Defect <br> Grayscale images of magnetic tile surfaces with various defect types and corresponding segmentation masks.2018Image248x373C (6)1.344:x:RealJPG, PNG?Link
TIG Welding <br> Grayscale images of a welding process with various defect types.2018Image800x974C (6)33.254:heavy_check_mark:RealPNG, JSONCC BY-SA 4.0Link
Mining Process <br> Process data of a mining process for impurity prediction in ore concentrate.2017Signal24R737.454:x:RealCSVCC0: Public DomainLink
Bosch Production Line Performance <br> Anonymized process data of production lines with and without defects.2016Signal4264C (2)2.368.435:heavy_check_mark: :globe_with_meridians:RealCSV?Link
WM811K Wafer Maps <br> Defect matrices of semiconductor wafers with various defect types.20142D Defect MatrixVaryingC (9)811.457:x:RealMAT?Link
NEU Surface Defect Database <br> Grayscale images of metal surfaces with various defect types and corresponding bounding box annotations.2013Image200x200C (6)1.800:x:RealBMP, XML?Link
Steel Plate Faults <br> Geometric measurements of steel plates with various defect types.2010Signal27C (7)1.941:x:RealCSV?Link
HCI Industrial Optical Inspection <br> Synthetic grayscale images of textured surfaces with corresponding defect ellipses.2007Image512x512C (2)16.100:heavy_check_mark:SyntheticPNG, Non-Standard?Link

Process Parameter Optimization

NameYearFeature TypeFeature CountInstancesOfficial Train/Test SplitData SourceFormatLicenseAccess
Laser Welding <br> Process parameter recordings for correlation with weld quality indicators such as weld depth and geometrical dimensions.2020Signal13361:x:RealXLSCC BY 4.0Link
3D Printer <br> Process parameters of a 3D printer for correlation with print quality indicators such as roughness, tension and elongation.2018Signal1250:x:RealCSV?Link
Tool Path Generation <br> Shape deviation measurements and corresponding simulated cutting conditions.2018Signal94.968:x:RealCSVCC BY 4.0Link
Mercedes-Benz Greener Manufacturing <br> Car feature configurations to be correlated with the required test time of the configurations.2017Signal3788.420:heavy_check_mark: :globe_with_meridians:RealCSV?Link
SECOM <br> Semiconductor process measurements and corresponding yields for determination of key factors to yield.2008Signal5911.567:x:RealNon-Standard?Link