Some applications in the market assist users to correct different writing mistakes, including spelling and grammar errors. However, very rarely these tools are used by school teachers. For most of them, it is still time- consuming and tedious to correct (beginners) student essays.
This talk will introduce some challenges in automatic English text correction. The objective is to present how it is possible to use Python libraries (scikit-learn, SciPy and NumPy) for spotting English mistakes such as: articles, capitalization and spelling.
In order to train and test the classifier, an open dataset will be used: EF- Cambridge Open Language Dataset (https://corpus.mml.cam.ac.uk/efcamdat/). This will allow participants to reproduce all the steps.
During the presentation, the accuracy of the implementation will be compared to at least one commercial application. It will be discussed how this kind of work can bring value to existing educational applications. Limitations and further steps will also be discussed.
This presentation is a successive work from the cooperation of Education First (language teaching institution) engineers and University of Cambridge language researchers.