Text size
  • Small
  • Medium
  • Large
  • Standard
  • Blue text on blue
  • High contrast (Yellow text on black)
  • Blue text on beige

    A Framework for Enhanced Text Classification in Sensitivity and Reputation Management

    Sixth BCS-IRSG Symposium on Future Directions in Information Access (FDIA 2015)

    31 August - 4 September 2015, Thessaloniki, Greece


    Graham McDonald



    Freedom of Information (FOI) laws state that government documents should be open to the public. However, many government documents contain sensitive information that is exempt from release. In this PhD programme, we aim to develop a framework that can automatically classify sensitive information in documents. However, automatic classification of sensitive information is a complex task that requires a relative judgement on the effect of a combination of factors. In this paper, we present an overview of the features of sensitivity that we can use to automatically classify documents containing FOI exemptions, such as International Relations. Moreover, we argue that current Named Entity Recognition (NER) approaches to classifying sensitive information are not appropriate for classifying FOI exemptions and, therefore, we need classification models that consider the document’s content and context at the time of classification.


    PDF file PDF Version of this Paper (300kb)