Saeed-Ul Hassan

Saeed-Ul Hassan

About

135
Publications
50,797
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
3,690
Citations

Publications

Publications (135)
Article
Full-text available
The growing prevalence of text reuse and plagiarism in various fields has led to an urgent need for reliable computational methods for detection. However, current commercial plagiarism detection systems are ineffective in identifying paraphrased cases of text reuse, highlighting the need for improvement. Previous research on paraphrased text reuse...
Article
Full-text available
Poetry represents the oldest and most esteemed literary form, allowing poets to convey ideas while carefully attending to elements such as meaning, coherence, poetic quality, and fluency. Notably, the creation of good poetry entails considerations of rhyme and meter. With the advent of artificial intelligence (AI), significant advancements have bee...
Article
Full-text available
Machine translation has revolutionized the field of language translation in the last decade. Initially dominated by statistical models, the rise of deep learning techniques has led to neural networks, particularly Transformer models, taking the lead. These models have demonstrated exceptional performance in natural language processing tasks, surpas...
Article
Full-text available
Simplifying summaries of scholarly publications has been a popular method for conveying scientific discoveries to a broader audience. While text summarization aims to shorten long documents, simplification seeks to reduce the complexity of a document. To accomplish these tasks collectively, there is a need to develop machine learning methods to sho...
Article
Full-text available
Image caption generation has emerged as a remarkable development that bridges the gap between Natural Language Processing (NLP) and Computer Vision (CV). It lies at the intersection of these fields and presents unique challenges, particularly when dealing with low-resource languages such as Urdu. Limited research on basic Urdu language understandin...
Article
Graph representation methods have recently become the de facto standard for downstream machine learning tasks on graph-structured data and have found numerous applications, e.g., drug discovery & development, recommendation, and forecasting. However, the existing methods are specially designed to work in a centralized environment, which limits thei...
Article
Non-textual document elements such as charts, diagrams, algorithms and tables play an important role to present key information in scientific documents. Recent advances in information retrieval systems tap this information to answer more complex user queries by mining text pertaining to non-textual document elements from full text. Algorithms are c...
Article
Full-text available
The quality of scientific publications can be measured by quantitative indices such as the h‐index, Source Normalized Impact per Paper, or g‐index. However, these measures lack to explain the function or reasons for citations and the context of citations from citing publication to cited publication. We argue that citation context may be considered...
Article
Full-text available
Automatic paraphrase detection is the task of measuring the semantic overlap between two given texts. A major hurdle in the development and evaluation of paraphrase detection approaches, particularly for South Asian languages like Urdu, is the inadequacy of standard evaluation resources. The very few available paraphrased corpora for these language...
Article
Full-text available
Mega-events have always been an attractive topic for sports management academics. We used scientometric software packages to look at the studies on this topic that have been added to the Web of Science database in the last 68 years. Not only did we give an overview of the background information of the researchers, the status of their collaborations...
Article
Full-text available
Urdu is morphologically rich language and lacks the resources available in English. While several studies on the image captioning task in English have been published, this is among the pioneer studies on Urdu generative image captioning. The study makes several key contributions: (i) it presents a new dataset for Urdu image captioning, and (ii) it...
Article
This article aims to exploit social exchanges on scientific literature, specifically tweets, to analyse social media users’ sentiments towards publications within a research field. First, we employ the SentiStrength tool, extended with newly created lexicon terms, to classify the sentiments of 6,482,260 tweets associated with 1,083,535 publications...
Article
Shape matching is a long-studied problem and lies at the core of many applications in statistical shape analysis, virtual reality and human-computer interaction. This paper presents an automatic dense correspondence method to match the mesh vertices of two 3D shapes under near-isometric and non-rigid deformations. The goal is achieved by combining...
Article
Containerized workloads are gaining traction due to microservices architecture adaptation in many fields, including healthcare, finance, Internet of Things, and smart cities. Modern data centers are containerized to facilitate this growing demand. Most of the existing resource allocation methods for data centers used efficient scheduling algorithms...
Article
Purpose This study aims to provide guidelines for exploring the research landscape in developing countries by gauging the prospects of growth, research impact and innovation. This study interrogates, analyses and visualizes the impact, nuances and evolution of stated research themes. For this purpose, this study presents an in-depth analysis of pub...
Article
Full-text available
Analysing big data job posts in Saudi cyberspace to describe the future market need for sustainable skills, this study used the power of artificial intelligence, deep learning, and big data technologies. The study targeted three main stakeholders: students, universities, and job providers. It provides analytical insights to improve student satisfac...
Article
This research provides a comprehensive, first-of-its-kind, in-depth, data-driven analysis of the discussions on "curriculum alignment" in the light of "learned skills" and "acquired skills", as illustrated by cross-disciplinary records in Scopus. It was undertaken from 2010 to 2021 on 10,214 data points obtained to fully grasp the issues, names and...
Article
This paper proposes two novel approaches to measure the similarity of co-cited authors for the task of document clustering, a) paragraph-level content-based author co-citation analysis (PCACA) and b) section-level content-based author co-citation analysis (SCACA), by mining the textual cited reference at the paragraph and the section level within a...
Article
Full-text available
We argue that social computing and its diverse applications can contribute to the attainment of sustainable development goals (SDGs)—specifically to the SDGs concerning gender equality and empowerment of all women and girls, and to make cities and human settlements inclusive. To achieve the above goals for the sustainable growth of societies, it is...
Article
The authorship identification task aims at identifying the original author of an anonymous text sample from a set of candidate authors. It has several application domains such as digital text forensics and information retrieval. These application domains are not limited to a specific language. However, most of the authorship identification studies...
Article
Full-text available
Student retention is a widely recognized challenge in the educational community to assist the institutes in the formation of appropriate and effective pedagogical interventions. This study intends to predict the students at-risk of low performances during an on-going course, those at-risk of graduating late than the tentative timeline and predictin...
Article
Full-text available
Malaria microscopy, microscopic examination of stained blood slides to detect parasite Plasmodium, is considered to be a gold standard for detecting life-threatening disease malaria. Detecting the plasmodium parasite requires a skilled examiner and may take up to 10 to 15 minutes to completely go through the whole slide. Due to a lack of skilled me...
Preprint
Full-text available
Scientific contribution and research performance of a university, research group, or institute needs to be evaluated all the more with the increasing volume and fast-developing disciplines of research. The need of the time is to develop tools for strategic planning and management that will help research bodies to rank and benchmark themselves again...
Preprint
Full-text available
In this study, we use research publication data from the field of social science to identify collaboration networks among social science research communities of India and Pakistan. We have used Scopus database to extract information of social science journals for both countries India and Pakistan. Study of this data is significant as both countries...
Preprint
Full-text available
This paper deploys bibliometric indices and semantic techniques for understanding to what extent research grants are likely to impact publications, research direction, and co-authorship rate of principal investigators. The novelty of this paper lies within the fact that it includes semantic analysis in the research funding evaluation process in ord...
Article
Automatic recommendation of citations has been a focal point of research in scholarly digital libraries. Many graph-based citation recommendation algorithms have been proposed; however, most of them utilize local citation behavior from the citation network that results in recommending papers in the same proximity as the query article. In this paper...
Article
Full-text available
Students’ evaluation of teaching, for instance, through feedback surveys, constitutes an integral mechanism for quality assurance and enhancement of teaching and learning in higher education. These surveys usually comprise both the Likert scale and free-text responses. Since the discrete Likert scale responses are easy to analyze, they feature more...
Article
Full-text available
Finding an optimal location for a future facility amidst existing sites is a challenging task—and potentially has numerous applications in resource planning. Formally, given a set of candidate sites S and existing sites E, the Optimal Site Selection (OSS) problem aims to find the optimal location s∈S\documentclass[12pt]{minimal} \usepackage{amsmath...
Article
Full-text available
Ethereum, the second-largest cryptocurrency after Bitcoin, has attracted wide attention in the last few years and accumulated significant transaction records. However, the underlying Ethereum network structure is still relatively unexplored. Also, very few attempts have been made to perform link predictability on the Ethereum transactions network....
Chapter
The ability to automatically identify causal relations from surgical textbooks could prove helpful in the automatic construction of ontologies for dentistry and building learning-assistant tools for dental students where questions about essential concepts can be auto-generated from the extracted ontologies. In this paper, we propose a neural networ...
Article
Full-text available
We investigated the scientific research dissemination by analyzing the publications and citation data, implying that not all citations are significantly important. Therefore, as alluded to existing state-of-the-art models that employ feature-based techniques to measure the scholarly research dissemination between multiple entities, our model implem...
Article
Full-text available
Machine learning specific scholarly full-text documents contain a number of result-figures expressing valuable data, including experimental results, evaluations, and cross-model comparisons. The scholarly search system often overlooks this vital information while indexing important terms using conventional text-based content extraction approaches....
Preprint
Full-text available
Machine learning specific scholarly full-text documents contain a number of result-figures expressing valuable data, including experimental results, evaluations, and cross-model comparisons. The scholarly search system often overlooks this vital information while indexing important terms using conventional text-based content extraction approaches....
Article
In the context of smart city research, finding patterns in crime data to explore trends in crime and to locate the presence of crime has been an exciting research field. Our aim in this paper is optimizing the location of police substations within the jurisdiction of a police station so that law enforcement agencies could work efficiently. These su...
Article
Full-text available
Most existing studies are focused on popular languages like English, Spanish, Chinese, Japanese, and others, however, limited attention has been paid to Urdu despite having more than 60 million native speakers. In this paper, we develop a deep learning model for the sentiments expressed in this under-resourced language. We develop an open-source co...
Article
Full-text available
In large-scale collaborative software development, building a team of software practitioners can be challenging, mainly due to overloading choices of candidate members to fill in each role. Furthermore, having to understand all members’ diverse backgrounds, and anticipate team compatibility could significantly complicate and attenuate such a team f...
Article
Full-text available
Most existing studies are focused on popular languages like English, Spanish, Chinese, Japanese, and others, however, limited attention has been paid to Urdu despite having more than 60 million native speakers. In this paper, we develop a deep learning model for the sentiments expressed in this under-resourced language. We develop an open-source co...
Article
Class imbalance is a challenging problem especially in a supervised learning setup, as most classification algorithms are designed for balanced class distributions. Although various up-sampling approaches exist for eliminating the class imbalance, however, they do not handle the complexities of sequential data. In this study, using the data of over...
Article
Full-text available
Most existing studies are focused on popular languages like English, Spanish, Chinese, Japanese, and others, however, limited attention has been paid to Urdu despite having more than 60 million native speakers. In this paper, we develop a deep learning model for the sentiments expressed in this under‐resourced language. We develop an open‐source co...
Article
In-text citation analysis is one of the most frequently used methods in research evaluation. We are seeing significant growth in citation analysis through bibliometric metadata, primarily due to the availability of citation databases such as the Web of Science, Scopus, Google Scholar, Microsoft Academic, and Dimensions. Due to better access to full...
Article
Full-text available
Big data analytics have shown a tremendous impact on modern politics—among which the election forecasting modeling is notable that utilizes the large scale heterogeneous data sources, such as polls, surveys, and social media popularity to build prediction models by exploiting the power of machine learning and artificial intelligence. In this articl...
Article
We argue that citations in scholarly documents do not always perform equivalent functions or possess equal importance. To address this problem, we worked with a corpus of over 21 k citations from the Association for Computational Linguistics, from which 465 citations were randomly annotated by experts as either important or unimportant. We used an...
Article
We argue that citations, as they have different reasons and functions, should not all be treated in the same way. Using the large, annotated dataset of about 10K citation contexts annotated by human experts, extracted from the Association for Computational Linguistics repository, we present a deep learning–based citation context classification arch...
Preprint
Full-text available
Malaria microscopy, microscopic examination of stained blood slides to detect parasite Plasmodium, is considered to be a gold-standard for detecting life-threatening disease malaria. Detecting the plasmodium parasite requires a skilled examiner and may take up to 10 to 15 minutes to completely go through the whole slide. Due to a lack of skilled me...
Article
Full-text available
Graph encoding methods have been proven exceptionally useful in many classification tasks - from molecule toxicity prediction to social network recommendations. However, most of the existing methods are designed to work in a centralized environment that requires the whole graph to be kept in memory. Moreover, scaling them on very large networks rem...
Article
This paper aims at an important task of computing the webometrics university ranking and investigating if there exists a correlation between webometrics university ranking and the rankings provided by the world prominent university rankers such as QS world university ranking, for the time period of 2005–2016. However, the webometrics portal provide...
Article
Foreign Exchange or Forex is the sale purchase market point of foreign currency pairs. Due to the high volatility in the forex market, it is difficult to predict the future price of any currency pair. This study shows that a significant enhancement in the prediction of forex price can be achieved by incorporating domain knowledge in the process of...
Article
Full-text available
Recent advancements in learning from graph-structured data have shown promising results on the graph classification task. However, due to their high time complexities, making them scalable on large graphs, with millions of nodes and edges, remains a challenge. In this paper, we propose NetKI, an algorithm to extract sparse representation from a giv...
Article
The advancements of search engines for traditional text documents have enabled the effective retrieval of massive textual information in a resource-efficient manner. However, such conventional search methodologies often suffer from poor retrieval accuracy especially when documents exhibit unique properties that behoove specialized and deeper semant...
Article
Full-text available
Text simplification and text summarisation are related, but different sub-tasks in Natural Language Generation. Whereas summarisation attempts to reduce the length of a document, whilst keeping the original meaning, simplification attempts to reduce the complexity of a document. In this work, we combine both tasks of summarisation and simplificatio...
Conference Paper
Full-text available
Effective teaching of surgical decision making requires providing students with a deep understanding of the domain so that they have the ability to make decisions in novel situations. This means providing them with a thorough understanding of causal relations between actions and their possible effects in the context of various states of the patient...
Article
Full-text available
Altmetric’s mission is to help others understand the influence of research online.We collate what people are saying about published research in sources such as the mainstream media, policy documents, social networks, blogs, and other scholarly and non-scholarly forums to provide a more robust picture of the influence and reach of scholarly work. Al...
Preprint
This article aims to exploit social exchanges on scientific literature, specifically tweets, to analyse social media users' sentiments towards publications within a research field. First, we employ the SentiStrength tool, extended with newly created lexicon terms, to classify the sentiments of 6,482,260 tweets associated with 1,083,535 publications...
Preprint
Full-text available
Citation analysis is one of the most frequently used methods in research evaluation. We are seeing significant growth in citation analysis through bibliometric metadata, primarily due to the availability of citation databases such as the Web of Science, Scopus, Google Scholar, Microsoft Academic, and Dimensions. Due to better access to full-text pu...
Chapter
In this paper, we address the problem of identifying the quality of citation as important or unimportant to the developments presented in the research papers. We gather features represented by four state-of-the-art machine learning techniques and combined them with newly engineered, natural language-based features. Using a known dataset of 465 cita...
Article
Full-text available
In the context of smart cities, it is crucial to filter out falsified information spread on social media channels through paid campaigns or bot-user accounts that significantly influence communication networks across the social communities and may affect smart decision-making by the citizens. In this paper, we focus on two major aspects of the Twit...
Article
Although over 64 million people worldwide speak Urdu language and are well aware of its Roman script, limited research and efforts have been made to carry out sentiment analysis and build language resources for the Roman Urdu language. This article proposes a deep learning model to mine the emotions and attitudes of people expressed in Roman Urdu -...
Article
Full-text available
Altmetrics are often praised as an alternative or complement to classic bibliometric metrics, especially in the social sciences discipline. However, empirical investigations of altmetrics concerning the social sciences are scarce. This study investigates the extent to which economic research is shared on social media platforms with an emphasis on m...
Article
In the current socio-economic environment, to face challenges such as the emergence of new technologies, globalisation and increasing demands from their clients it is inevitable that enterprises will collaborate with others and progressively shift their boundaries. In this context, interoperability has become a prerequisite in the jigsaw of such co...
Article
Full-text available
The purpose of the study is to (a) contribute to annotating an Altmetrics dataset across five disciplines, (b) undertake sentiment analysis using various machine learning and natural language processing-based algorithms, (c) identify the best-performing model and (d) provide a Python library for sentiment analysis of an Altmetrics dataset. First, t...
Article
Full-text available
One of the major advances in artificial intelligence nowadays is to understand, process and utilize the humans’ natural language. This has been achieved by employing the different natural language processing (NLP) techniques along with the aid of the various deep learning approaches and architectures. Using the distributed word representations to s...
Article
We argue that classic citation-based scientific document clustering approaches, like co-citation or Bibliographic Coupling, lack to leverage the social-usage of the scientific literature originate through online information dissemination platforms, such as Twitter. In this paper, we present the methodology Tweet Coupling, which measures the similar...
Article
We propose a new metric called ‘alt-index’, which is analagous to the h-index, but uses altmetrics data to measure both the volume and social media activity of scientific literature. The dataset includes over 1.1 million papers and their associated altmetrics score. A correlation analysis of the h-index and alt-index is conducted at three different...
Article
Native Language Identification (NLI) aims at identifying the native languages of authors by analyzing their text samples written in a non-native language. Most existing studies investigate this task for educational applications such as second language acquisition and require the learner corpora. This article performs NLI in a challenging context of...
Article
Traditional bibliometric techniques gauge the impact of research through quantitative indices based on the citations data. However, due to the lag time involved in the citation-based indices, it may take years to comprehend the full impact of an article. This paper seeks to measure the early impact of research articles through the sentiments expres...
Article
Full-text available
In higher education, predicting the academic performance of students is associated with formulating optimal educational policies that vehemently impact economic and financial development. In online educational platforms, the captured clickstream information of students can be exploited in ascertaining their performance. In the current study, the ti...
Article
Full-text available
Purpose The purpose of this paper is to present a novel approach for mining scientific trends using topics from Call for Papers (CFP). The work contributes a valuable input for researchers, academics, funding institutes and research administration departments by sharing the trends to set directions of research path. Design/methodology/approach The...
Article
The abundance of accessible educational data, supported by the technology-enhanced learning platforms, provides opportunities to mine learning behavior of students, addressing their issues, optimizing the educational environment, and enabling data-driven decision making. Virtual learning environments complement the learning analytics paradigm by ef...
Article
Is it possible to identify crime suspects by their mobile phone call records? Can the spatial-temporal movements of individuals linked to convicted criminals help to identify those who facilitate crime? Might we leverage the usage of mobile phones, such as incoming and outgoing call numbers, coordinates, call duration and frequency of calls, in a s...
Article
Full-text available
In the present era of Big Data, with continuously increasing amounts of user-generated content, it is becoming a challenge to understand the relation between the content that is available on the Web and the users who are generating that content. Researchers have come up with many ways to understand today's Web better. One of the recently introduced...
Conference Paper
Abstract Social media metrics are often praised as an alternative or complement to traditional bibliometric metrics, especially in the social sciences. However, empirical investigations of the social sciences are scarce. This research in progress paper explores the extent economic research is communicated on social media platforms with an emphasis...
Conference Paper
Full-text available
We present a novel bibliometric view to create a taxonomy of the interdisciplinary field of Information Communication Technology for Development (ICTD or ICT4D), using scientific documents published in the fields related to information communication technologies that were indexed in the Scopus database from 2001 to 2015. Our research approach utili...
Chapter
This chapter presents a novel scientific research landscape of the Gulf Cooperation Council (GCC) in order to access the research productivity, scholarly impact, and international collaborations across all GCC countries over the time period of 2008–2018, using the Scopus database. While we observe a significant increase in investing the resources i...
Article
Full-text available
Social networking sites play a significant role in altmetrics. While 90% of all altmetric mentions come from Twitter, the known microscopic and macroscopic properties of Twit-ter altmetrics data are limited. In this study, we present a large-scale analysis of Twitter alt-metrics data using social network analysis techniques on the 'mention' network...
Article
Computer sciences and related disciplines evolve around developing, evaluating, and applying algorithms. Typically, an algorithm is not developed from scratch, but uses and builds upon existing ones, which often are proposed and published in scholarly articles. The ability to capture this evolution relationship among these algorithms in scientific...
Article
The current evolution in multidisciplinary Learning Analytics Research, poses significant challenges for the exploitation of behavior analysis by fusing data streams towards advanced decision making. The identification of students that are at risk of withdrawals in higher education is connected to numerous educational policies, to enhance their com...
Article
This paper offers a detailed, first-ever, in-depth, data-driven, review of debates pertaining to international migration (IM) as depicted by cross-disciplinary records collected in Scopus. Accordingly, the paper also makes a case for the value added of bibliometric analysis and new ways of its application. Specifically, to gain a thorough understan...
Preprint
Full-text available
This study uses the article content and metadata of four important computer networking periodicals-IEEE Communications Surveys and Tutorials (COMST), IEEE/ACM Transactions on Networking (TON), ACM Special Interest Group on Data Communications (SIGCOMM), and IEEE International Conference on Computer Communications (INFOCOM)-obtained using ACM, IEEE...
Preprint
This study uses the article content and metadata of four important computer networking periodicals-IEEE Communications Surveys and Tutorials (COMST), IEEE/ACM Transactions on Networking (TON), ACM Special Interest Group on Data Communications (SIGCOMM), and IEEE International Conference on Computer Communications (INFOCOM)-obtained using ACM, IEEE...
Article
The relationship between influential tweeters and highly cited articles in the field of information sciences was analysed using Twitter data gathered by Altmetric.com from July 2011 through February 2017. The dataset consists of more than 10,000 tweets, and these mentions, retweets and followers were used to generate a connected, undirected graph....
Article
Full-text available
Recently, tremendous advances have been observed in information retrieval systems designed to search for relevant knowledge in scientific publications. Although these techniques are quite powerful, there is still room for improvement in the area of searching for metadata relating to algorithms in full-text publication datasets—for instance, efficie...
Conference Paper
This paper describes the design process by which we designed an Android application equipped with audio, textual menus and visuals components for use by farmers of diverse literacy levels looking for vital weather information after the conclusion of research-work that productivity lags due to information inadequacies. The intervention provides more...
Chapter
Despite the increase in the adoption of online educational platforms, student retention is still a challenging task with a number of students having low performance margins during these courses. This chapter intends to predict student performance based on their learning behavior on the basis of their logging data history, using the publicly availab...
Chapter
Recent advancements in information retrieval systems significantly rely on the context-based features and semantic matching techniques to provide relevant information to users from ever-growing digital libraries. Scientific communities seek to understand the implications of research, its importance and its applicability for future research directio...
Chapter
In this paper (Note that the dataset and code to reproduce the results can be accessed at the following URL: https://github.com/slab-itu/hsm), we intend to assess the quality of scientific publications by measuring the relationship between full text papers with that of their abstracts. A hybrid summarization model is proposed that combines text sum...
Article
Information retrieval systems for scholarly literature rely heavily not only on text matching but on semantic- and context-based features. Readers nowadays are deeply interested in how important an article is, its purpose and how influential it is in follow-up research work. Numerous techniques to tap the power of machine learning and artificial in...
Article
Purpose The purpose of this paper is to analyze the scientific collaboration of institutions and its impact on institutional research performance in terms of productivity and quality. The researchers examined the local and international collaborations that have a great impact on institutional performance. Design/methodology/approach Collaboratio...
Preprint
Information retrieval systems for scholarly literature rely heavily not only on text matching but on semantic-and context-based features. Readers nowadays are deeply interested in how important an article is, its purpose and how influential it is in follow-up research work. Numerous techniques to tap the power of machine learning and artificial int...

Network

Cited By