Mahmoud EL-Haj |
|
NLP Research Associate
|
|
Lancaster University
·
School of Computing and Communications
|
Skills (14)
-
28 Questions271 Followers
-
0 Questions12 Followers
-
211 Questions16962 Followers
-
132 Questions10202 Followers
-
15 Questions374 Followers
-
24 Questions264 Followers
-
125 Questions6411 Followers
-
13 Questions178 Followers
-
7 Questions207 Followers
-
0 Questions5 Followers
-
61 Questions2811 Followers
Research experience
-
Teaching: Web Development and Digital Systems Architecture.
-
Teaching: Graduate Teaching/Lab Assistant at Essex University. CE161
-
Teaching: CE154 and CE203 Computer Courses. Which include: Java Programming
-
Sep 2011–
Nov 2011Research: MEDIE Search Engine
National Institute of Informatics · Natural Language Processing · National Institute of InformaticsNLP · TokyoWorking on clustering and summarising the results of the syntactic and semantic search engine (MEDIE) that searches millions of medical journals. -
Jul 2011–
Jul 2011Research: Hadoop Hackathon
Edinburgh University · Informatics · Edinburgh UniversityNLP · EdinburghWorking on extracting useful information from a sea of garbage data (billions of words corpus). The project was a two days hackathon, we were a group of 5 people from 5 different universities. Our group was succesful in extracting (statistically) useful information before the end of the hackathon. No competition between groups as the idea was to learn how to use Hadoop tool. -
Jun 2011–
Dec 2012Research: SKOS-HASSET project
University of Essex · UK Data Archive · http://hassetukda.wordpress.com/United Kingdom · ColchesterThe objective of this project is to bring HASSET, the leading and well-respected English language social science thesaurus, into the Linked Data web. http://hassetukda.wordpress.com/ -
Feb 2011–
Dec 2012Research: Upgrade the Archive's Systems and Preservation Service
UK Data Archive · Digital Preservation and SystemsUK's largest collection of social and economic data -
Jan 2009–
presentResearch: University of Essex
University of Essex · School of Computer Science and Electronic EngineeringUnited Kingdom · Colchester
Education
-
Jan 2009–
Aug 2012Essex University
Arabic Multi-document Text Summarisation · PhDUnited Kingdom · Colchester
Awards & achievements
-
Nov 2009Award: Best Paper Award at LTC 2009 Poznań, Poland.
Other
-
LanguagesArabic, English
-
Scientific MembershipsUniversity Centre for Computer Corpus Research on Language
http://ucrel.lancs.ac.uk/
Publications (17) View all
-
Conference Proceeding: Koulali, R., El-Haj, M., Meziane, A. Arabic Topic Detection using Automatic Text Summarization. The 10th ACS/IEEE ICCSA 2013
Mahmoud El-Haj, Rim KoulaliThe 10th ACS/IEEE ICCSA; 01/2013 -
Conference Proceeding: El-Haj, M., Koulali, R. "KALIMAT a Multipurpose Arabic Corpus" at the Second Workshop on Arabic Corpus Linguistics (WACL-2) 2013
Mahmoud EL-Haj, Rim KoulaliSecond Workshop on Arabic Corpus Linguistics (WACL-2), Lancaster, UK; 01/2013 -
SourceAvailable from: Mahmoud EL-Haj
Conference Proceeding: Assessing Crowdsourcing Quality through Objective Tasks
[show abstract] [hide abstract]
ABSTRACT: Exploring the possibilities and limits of crowd sourcing methods for creating NLP resources is a major research task at the moment. The paper presents experiments on the influence of the presentation method and the payment of the workers, respectively.Language Resources and Evaluation (LREC 2012); 01/2012 -
Conference Proceeding: Exploring Clustering for Multi-document Arabic Summarisation.
Mahmoud El-Haj, Udo Kruschwitz, Chris FoxInformation Retrieval Technology - 7th Asia Information Retrieval Societies Conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011. Proceedings; 01/2011 -
Conference Proceeding: TAC 2011 MultiLing Pilot Overview. In Text Analysis Conference (TAC) 2011
[show abstract] [hide abstract]
ABSTRACT: The Text Analysis Conference MultiLing Pilot of 2011 posed a multi-lingual summarization task to the summarization community, aiming to quantify and measure the performance of multi-lingual, multi-document summarization systems. The task was to create a 240-250 word summary from 10 news texts, describing a given topic. The texts of each topic were provided in seven languages (Arabic, Czech, English, French, Greek, Hebrew, Hindi) and each participant generated summaries for at least 2 languages. The evaluation of the summaries was performed using automatic (AutoSummENG, Rouge) and manual processes (Overall Responsiveness score). The participating systems were 8, some of which providing summaries across all languages. This paper provides a brief description for the collection of the data, the evaluation methodology, the problems and challenges faced, and an overview of participation and corresponding results.Text Analysis Conference (TAC 2011), MultiLing Summarisation Pilot, Maryland, USA.; 01/2011