Applying Distilled BERT for Question Answering on ASRS Reports - ENAC - École nationale de l'aviation civile Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Applying Distilled BERT for Question Answering on ASRS Reports

Samuel Kierszbaum
Laurent Lapasset

Résumé

This paper employs the Bidirectional Encoder Representations from Transformers (BERT), a language model, fine-tuned on the question answering task, on the Aviation Safety Reporting System (ASRS) dataset’s free text reports, that describe incident occurrences in an International aviation safety context. A four-step method is used to evaluate the produced results. This paper outlines what are the limitations of this approach, as well as its usefulness in trying to extract information from thirty randomly selected free text reports when asking the following question: “When did the incident happen?”. We aim to try to integrate one of the algorithms resulting of the recent advances in Natural Language Processing (NLP) to leverage information in natural language narratives, as opposed to working directly with the structured part of the ASRS dataset. We find that our approach yields interesting results, with roughly seventy percent correct answers, including answers that have information that is not overlapping with the ASRS dataset’s metadata.

Mots clés

Fichier principal
Vignette du fichier
camera-ready.pdf (142.69 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03094753 , version 1 (06-01-2021)

Identifiants

Citer

Samuel Kierszbaum, Laurent Lapasset. Applying Distilled BERT for Question Answering on ASRS Reports. NTCA 2020 New Trends in Civil Aviation, Nov 2020, Prague, Czech Republic. pp.33-38, ⟨10.23919/ntca50409.2020.9291241⟩. ⟨hal-03094753⟩
113 Consultations
457 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More