2022 PHM Conference Data Challenge

Objective

This year’s data challenge addresses the problem of fault classification for a rock drill application under different individual configurations of the rock drill. The task is to develop a fault diagnosis/classification model using the provided pressure sensor data as input. The training data consists of data from various faults from five individual configurations, while the testing data for the online leaderboard is blind and is from one individual configuration of the rock drill. A final validation data set for the final scoring for the competition will be from two individual configurations from the rock drill and the labels will be blind to the contest participants. For both the testing data for the online leaderboard and the final validation data set, a reference condition from a no-fault health condition will also be provided.

The training data set contains data from 11 different fault classification categories, in which 10 are different failure modes and one class is from the healthy/no fault condition. The task is to train a model to classify the fault conditions using the training data, and to test this model on the testing data, in which the one submission per day can be used for submitting results to the online leaderboard. Validation is done with a validation data set that will be released for a one-time assessment at the end of the data challenge. Scoring of performance is done through this web interface.

The top three teams will be asked to submit their approach as a journal paper to the International Journal of Prognostics and Health Management (IJPHM) and present their approach and results at the 2022 PHM Society Conference. Top teams are also being recognized at the banquet and will receive a plaque.

PHM2022 Data Challenge Leaderboard

PHM 2022 Data Challenge
Final Results
Rank	Submitting User	Team Name and Members	Accuracy
1st Place	jinoh	Team name: Team-Nuri Organization: Seoul National University, Laboratory for System Health & Risk Management Jinoh Yoo Jongmin Park Sangkyung Lee Hye Jun Oh Minseok Chae	100.00%
--	linghuixuan	Disqualified from competition (unable to attend conference in person)	99.97%
2nd Place	wwaatt301	Team name: SWING Organization: Seoul National University, Laboratory for System Health & Risk Management Jin Uk Ko Jinwook Lee Taehun Kim Yong Chae Kim Keon Kim	99.77%
3rd Place	jmontoya	Team name: MathWorks Organization: MathWorks Jose Montoya Bedoya Shyam Joshi Mahaveer Satra Russell Graves	99.04%
5	ashitg		98.09%
6	igorgo		97.80%
7	jmpion		97.38%
8	mokeeffe7		93.61%
9	asl79		92.76%
10	osarenren		91.66%
11	minamitu		89.95%
12	martinvlh		86.17%
13	jeremy		84.62%
14	hsuch		82.90%
15	seoka		82.04%
16	jacobchesnes		79.63%
17	mtumcakir		60.64%
18	ebaumann		49.71%

Teams

Collaboration is encouraged and teams may be comprised of one or more students and professionals. The teams judged to have the first, second, and third best scores will be awarded prizes, contingent upon:

Having at least one member of the team register and attend the PHM 2022 Conference.
Submitting a journal paper for the IJPHM journal.
Presenting the analysis results and technique employed at a special session within the conference program.

The organizers of the competition reserve the right to both modify these rules and disqualify any team for any efforts it deems inconsistent with fair and open practices.

Data Challenge Registration

Teams may register here. Please note that the registration is a 2 steps process:
1) You must first log in with a PHM user account to complete the form. If you need to create one here –> Sign Up
2) Once logged in, you need to complete the Data Challenge Application form. Then, we will grant your user access to use the Data Challenge submission area.

Please note: In the spirit of fair competition, we allow only one account per team. Please do not register multiple times under different user names, under fictitious names, or using anonymous accounts. Competition organizers reserve the right to delete multiple entries from the same person (or team) and/or to disqualify those who are trying to “game” the system or using fictitious identities.

Relevant Dates

Key PHM Data Challenge Dates
Competition Open – the following information will be posted: Challenge description Training data Scoring definition Online Leaderboard	June 3, 2022
Final Validation Data Posted	July 29, 2022
Competition Closed	August 12, 2022 (11:59:59 pm PDT)
Preliminary Winners Announced	August 19, 2022
Conference Presentation Due, Winners Announced	September 13, 2022
PHM Conference Dates	November 1 – November 4, 2022

Data Challenge Details

The details about this year’s data challenge can be found in this document. For each CSV file that is provided, the first column is the label for that cycle, in which a label of 1 represents the no-fault condition, a label of 2-11 correspond to different fault types. A label of 0, which is present for the testing data for the online leaderboard and the final validation data set, indicate an unknown/blind label, in which the objective is to predict the label for each cycle. The columns of data after the label represent the pressure signal data for that cycle.

Dataset

Training and Testing Dataset

The training and test datasets are available here: Training Data Set, Testing Data Set

Validation Dataset

The one-time validation data set is available here: Validation Data Set
NOTE: The official scoring for the competition is based solely on the validation data set.

Data Challenge Submission

For Testing Dataset:

Please upload your submission in the Data Challenge submission area. Note that you will only see the Submission area if you have previously applied using the application form and we have granted you access

Please ensure the filename is yourusername.txt or else the automated scorer will not be able to read your file and score your algorithm and the conference organizers will not be able to consider your submission as a valid submission. Please ensure that your results are in the format shown in this Data_Challenge_PHM2022_Example_Submission.txt file which contains only 3184 values in a column vector format.

Also, you only can submit (or overwrite) the yourusername.txt file for your submission as often as you’d like. The system will automatically score the most current submission twice daily, at noon and midnight US Pacific Daylight Time (0700 and 2100 UTC). You may continue to upload submission of testing data to the submission area after the validation data is available, and they will continue to be scored and displayed.

For Validation Dataset:

Please upload your submission in the Data Challenge Validation submission area. Note that you will only see the Submission area if you have previously applied using the application form and we have granted you access

Please ensure the filename is yourusername.txt or else the automated scorer will not be able to read your file and score your algorithm and the conference organizers will not be able to consider your submission as a valid submission. Since the validation data consists of two individual configurations (#7 and #8), please combine the results into a single file. The first 7935 rows should correspond to #7 and the last 8461 rows corresponding to #8, for a total of 16396 rows. A format checker has been implemented that displays (in no particular order, other than alphabetically) a table of all current entries and whether or not the system recognizes it as correctly formatted. This checker runs at 30 minutes after each hour.

Also, you only can submit (or overwrite) the yourusername.txt file for your validation submission as often as you’d like. However, only the most current submission at 23:59:59 US PDT on August 12, 2022 (August 13, 2022, 06:59:59 UTC), will be scored for the competition.

Search