Developing Aviation ASR and NLP Datasets and Tools

The goal is to create an ATC ASR dataset for open access. We have obtained 300 hours of audio data and processed 30 hours using the bootstrap approach: Using Whisper to provide the initial transcripts, Correcting the transcripts by hired transcriber team, reviewing the corrected transcripts.

Project Details

Campus: Daytona Beach Campus
College: Daytona Beach College of Aviation
Department: DCAAS
Type: Faculty-Staff

Research Team

Principal Investigators

Jianhua Liu
Jianhua Liu

Associate Professor and Program Coordinator

  • Electrical Engineering and Computer Science Dept
  • Daytona College of Engineering

CO-Investigators

Andrew Schneider
Andrew Schneider

Assistant Professor

  • Flight Department
  • Daytona College of Aviation