Dr Mahmoud El-Haj

Senior Research Associate

Research Overview

Natural Language Processing (NLP); mainly on corpus and computational linguistics, sing multi-document text summarisation for both Arabic and English, information extractionl, question answering, machine learning, text classification, crowd-sourcing, and creation of NLP resources.

A Comparison Between Genetics Papers Relating to Immune Disorders and Psychiatric Disorders
El-Haj, M., Piao, S.S., Rayson, P.E., Knight, J. 11/09/2017
Poster

Creating and validating multilingual semantic representations for six languages: expert versus non-expert crowds
El-Haj, M., Rayson, P., Piao, S., Wattam, S. 3/04/2017 In: Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications. Association for Computational Linguistics p. 61-71. 11 p. ISBN: 9781945626500.
Conference contribution

Learning tone and attribution for financial text mining
El-Haj, M., Rayson, P.E., Young, S.E., Walker, M., Moore, A., Athanasakou, V., Schleicher, T. 23/05/2016 In: Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA) p. 1820-1825. 6 p. ISBN: 9782951740891.
Conference contribution

Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages
Piao, S.S., Rayson, P.E., Archer, D., Bianchi, F., Dayrell , C., El-Haj, M., Jiménez, R., Knight, D., Křen, M., Lofberg, L., Nawab, R.M.A., Shafi, J., Teh, P.L., Mudraya, O. 23/05/2016 In: LREC 2016, Tenth International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA) p. 2614-2619. 6 p. ISBN: 9782951740891.
Conference contribution

OSMAN: a novel Arabic readability metric
El-Haj, M., Rayson, P.E. 23/05/2016 In: Proceedings of the Language Resources and Evaluation Conference 2016. Slovenia : European Language Resources Association (ELRA) p. 250-255. 6 p. ISBN: 9782951740891.
Conference contribution

Creating language resources for under-resourced languages: methodologies, and experiments with Arabic
El-Haj, M., Kruschwitz, U., Fox, C. 09/2015 In: Language Resources and Evaluation. 49, 3, p. 549-580. 32 p.
Journal article

Does equity analyst research lack rigor and objectivity? Evidence from conference call questions and research notes
Salzedo, C., Young, S., El-Haj, M. 6/08/2014 Lancaster University Management School, 50 p.
Working paper

Computer-based analysis of the strategic content of UK annual report narratives
El-Haj, M., Athanasakou, V., Rayson, P., Young, S., Walker, M. 2014
Conference paper

Detecting document structure in a very large corpus of UK financial reports
El-Haj, M., Rayson, P., Young, S., Walker, M. 2014 In: LREC'14 Ninth International Conference on Language Resources and Evaluation . Reykjavik, Iceland : European Language Resources Association (ELRA) p. 1335-1338. 4 p.
Paper

Language independent evaluation of translation style and consistency: comparing human and machine translations of Camus’ novel “The Stranger”
El-Haj, M., Rayson, P., Hall, D. 2014 In: Text, speech and dialogue. Springer International Publishing p. 116-124. 9 p.
Paper

An experiment in automatic indexing using the HASSET thesaurus
El-Haj, M., Balkan, L., Barbalet, S., Bell, L., Shepherdson, J. 17/09/2013 In: Computer Science and Electronic Engineering Conference (CEEC), 2013 5th. IEEE p. 13-18. 6 p.
Paper

Multi-document multilingual summarization corpus preparation, Part 1: Arabic, English, Greek, Chinese, Romanian
Li, L., Forascu, C., El-Haj, M., Giannakopoulos, G. 08/2013 In: Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization. Sofia, Bulgaria : Association for Computational Linguistics p. 1-12. 12 p.
Paper

Using a keyness metric for single and multi document summarisation
El-Haj, M., Rayson, P. 08/2013 In: Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document summarization . Sofia, Bulgaria : Association for Computational Linguistics p. 64-71. 8 p.
Paper

Arabic topic detection using automatic text summarisation
Koulali, R., El-Haj, M., Meziane, A. 2013 In: Computer Systems and Applications (AICCSA), 2013 ACS International Conference on. IEEE Computer Society p. 1-4. 4 p.
Paper

KALIMAT a multipurpose Arabic corpus
El-Haj, M., Koulali, R. 2013
Conference paper

UKDA keyword indexing with a SKOS version of HASSET thesaurus
El-Haj, M. 2013 Cologne, Germany : iAssist
Working paper

Arabic multi-document text summarisation
El-Haj, M. 2012 University of Essex. 165 p.
Doctoral Thesis

Assessing crowdsourcing quality through objective tasks
Aker, A., El-Haj, M., Albakour, M., Kruschwitz, U. 2012 In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). Istanbul, Turkey : European Language Resources Association (ELRA) p. 1456-1461. 6 p.
Paper

Experimenting with automatic text summarization for Arabic
El-Haj, M., Kruschwitz, U., Fox, C. 2011 In: Human language technology - challenges for computer science and linguistics. Berlin : Springer p. 490-499. 10 p.
Paper

Exploring clustering for multi-document Arabic summarisation
El-Haj, M., Kruschwitz, U., Fox, C. 2011 In: Information Retrieval Technology. Berlin : Springer p. 550-561. 12 p.
Paper

Multi-document Arabic text summarisation
El-Haj, M., Kruschwitz, U., Fox, C. 2011 In: Computer Science and Electronic Engineering Conference (CEEC), 2011 3rd. IEEE p. 365-369. 5 p. ISBN: 9781457713002.
Conference contribution

TAC 2011 MultiLing pilot overview
Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J., Varma, V. 2011 In: Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot. Maryland, USA : TAC 17 p.
Paper

University of Essex at the TAC 2011 Multilingual Summarisation Pilot
El-Haj, M., Kruschwitz, U., Fox, C. 2011 In: Proceedings of the Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA. Maryland, USA : TAC 7 p.
Paper

Understanding the Quran: a new grand challenge for computer science and artificial intelligence
Atwell, E., Habash, N., Louw, B., Abu Shawar, B., McEnery, T., Zaghouani, W., El-Haj, M. 2010
Conference paper

Using mechanical Turk to create a corpus of Arabic summaries
El-Haj, M., Kruschwitz, U., Fox, C. 2010 In: Language Resources (LRs) and Human Language Technologies (HLT) for Semitic Languages workshop held in conjunction with the 7th International Language Resources and Evaluation Conference (LREC 2010). Valletta, Malta : LREC 2010 p. 36-39. 4 p.
Conference contribution

Enhancing retrieval effectiveness of diacritisized Arabic passages using stemmer and thesaurus
Hammo, B., Sleit, A., El-Haj, M. 2008 In: The 19th Midwest Artificial Intelligence And Cognitive Science Conference Maics2008. p. 189–196. 8 p.
Conference contribution

Evaluation of query-based Arabic text summarization system
El-Haj, M., Hammo, B. 2008 In: Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on. Beijing, China : IEEE Computer Society p. 1-7. 7 p. ISBN: 9781424445158. Electronic ISBN: 9781424427802.
Conference contribution

Experimenting with automatic summarization of Arabic text
El-Haj, M. 2008
Master's Thesis

Effectiveness of query expansion in searching the Holy Quran
Hammo, B., Sleit, A., El-Haj, M. 2007 In: The Second International Conference on Arabic Language Processing CITALA'07. Rabat, Morocco p. 1-10. 10 p.
Conference contribution