PreprintPDF Available
Preprints and early-stage research may not have been peer reviewed yet.

Abstract and Figures

Large language models have recently attracted significant attention due to their impressive performance on a variety of tasks. ChatGPT developed by OpenAI is one such implementation of a large, pre-trained language model that has gained immense popularity among early adopters, where certain users go to the extent of characterizing it as a disruptive technology in many domains. Understanding such early adopters' sentiments is important because it can provide insights into the potential success or failure of the technology, as well as its strengths and weaknesses. In this paper, we conduct a mixed-method study using 10,732 tweets from early ChatGPT users. We first use topic modelling to identify the main topics and then perform an in-depth qualitative sentiment analysis of each topic. Our results show that the majority of the early adopters have expressed overwhelmingly positive sentiments related to topics such as Disruptions to software development, Entertainment and exercising creativity. Only a limited percentage of users expressed concerns about issues such as the potential for misuse of ChatGPT, especially regarding topics such as Impact on educational aspects. We discuss these findings by providing specific examples for each topic and then detail implications related to addressing these concerns for both researchers and users.
Content may be subject to copyright.
"I think this is the most disruptive technology"
Exploring Sentiments of ChatGPT Early Adopters
using Twitter Data
Mubin Ul Haque, Isuru Dharmadasa, Zarrin Tasnim Sworna, Roshan Namal Rajapakse, Hussain Ahmad*
{mubinul.haque, isuru.mahaganiarachchige, zarrintasnim.sworna, roshan.rajapakse, hussain.ahmad}@adelaide.edu.au
School of Computer Science, University of Adelaide, Australia
Abstract—Large language models have recently attracted sig-
nificant attention due to their impressive performance on a
variety of tasks. ChatGPT developed by OpenAI is one such
implementation of a large, pre-trained language model that
has gained immense popularity among early adopters, where
certain users go to the extent of characterizing it as a disruptive
technology in many domains. Understanding such early adopters’
sentiments is important because it can provide insights into the
potential success or failure of the technology, as well as its
strengths and weaknesses. In this paper, we conduct a mixed-
method study using 10,732 tweets from early ChatGPT users.
We first use topic modelling to identify the main topics and then
perform an in-depth qualitative sentiment analysis of each topic.
Our results show that the majority of the early adopters have
expressed overwhelmingly positive sentiments related to topics
such as Disruptions to software development,Entertainment and
exercising creativity. Only a limited percentage of users expressed
concerns about issues such as the potential for misuse of Chat-
GPT, especially regarding topics such as Impact on educational
aspects. We discuss these findings by providing specific examples
for each topic and then detail implications related to addressing
these concerns for both researchers and users.
Index Terms—ChatGPT, Generative Pretrained Transformer,
Early adopters, Twitter, Sentiment Analysis, Topic Modeling
I. INTRODUCTION
ChatGPT is an artificial intelligence (AI) chatbot that under-
stands and generates natural human language with remarkable
sophistication, sensitivity, and usability [1]. ChatGPT is an
application of the latest version of GPT-3 (Generative Pretrained
Transformer 3), a state-of-the-art language processing AI model
developed by the OpenAI
1
foundation, that enables it to
generate human-like text. Unlike traditional chatbots, ChatGPT
remembers what the user said earlier in the conversation for
follow-up questions, rejects inappropriate requests, and chal-
lenges incorrect responses [2]. Moreover, ChatGPT provides
answers, solutions, and descriptions to complex questions,
including potential ways to solve layout problems, write code,
and answer optimization queries [3]. Given the advantages of
ChatGPT over traditional chatbots, ChatGPT has attracted more
than 1 million users in just one week after it was launched,
leaving behind other popular online platforms such as Netflix,
All the authors contributed equally to this paper.
*Corresponding author: hussain.ahmad@adelaide.edu.au
1https://openai.com/
Facebook, and Instagram in terms of adoption rates [4]. In
addition, there is a rising number of commentators who predict
that ChatGPT will replace Google in the near future [5]. Some
early adopters of ChatGPT believe that it will eventually
obsolete several professions related to content creation, such as
programmers, professors, playwrights, and journalists [1]. For
example, it has been demonstrated that ChatGPT is capable
of producing high-quality responses to a variety of challenges,
including solving coding challenges and generating accurate
responses to exam queries [6].
As described above, ChatGPT offers several benefits to
a wide range of users. However, being a new technology,
identifying early adopter sentiments is of high importance due
to several reasons. Firstly, early adopters are usually the most
enthusiastic and influential users of a product, and their opinions
and sentiments can help to shape the broader perception of
new technology. This information can provide critical insights
into the potential success or failure of the product.
Secondly, early adopters are often the first to encounter any
issues or problems with new technology, and their feedback
can help to identify and fix these issues before they become
widespread. Therefore, exploring the early adopter sentiments,
particularly for a disruptive technology such as ChatGPT, would
increase the chances of success of the tool in the market.
To investigate the sentiments of ChatGPT early adopters,
we analyze data from Twitter, which allows users to read and
share 140-character messages called “tweets". Unlike other
social networking platforms (e.g., Facebook and Instagram),
unregistered users can access and read tweets [7]. Moreover,
Twitter is a famous and large social networking micro-
blogging site [8]. With the burgeoning popularity of Twitter,
researchers and practitioners are increasingly using Twitter data
to get untapped information from potential customers [9]. For
example, Twitter has been explored for investigating the public
perception of "Internet of Things" [9], COVID-19 symptoms
emergence [10], drug-related adverse events detection [11],
work emotion and stress analysis [12], mining public health
data [13], and influenza epidemics detection [14]. Therefore,
in this study, we decided to use Twitter as a data source to
analyze the sentiments of early adopters of ChatGPT.
In this paper, we conduct a mixed-method study on 10,732
tweets from early adopters. We first use topic modelling to
1
arXiv:2212.05856v1 [cs.CL] 12 Dec 2022
LDA
Topic Modeling
   Filter non-
English tweets
 Keyword-  
     based search
User Location
User occupation
User Verification
Tweets
Characteristics analysis
of ChatGPT early adopters
Duplication
and Noise
Removal
Stop-word
Removal
Lowercasing Lemmatization
* ( ! ;
adding
added
adds
add
Data pre-processing
Sentiment Analysis
of ChatGPT early adopters
ChatGPTTweet Dataset Construction
RQ1
RQ2
RQ3
Fig. 1: Overview of our research methodology
identify the main topics of discussion. Next, we perform manual
sentiment analysis to qualitatively analyze a selected set of
tweets from each topic.
In summary, our paper makes the following contributions.
It provides an overarching analysis of topics discussed
by early adopters of ChatGPT through their tweets. We
describe each discussed topic in detail with specific tweet
examples. Moreover, we show the total number of tweets
against each topic to demonstrate its significance compared
to other topics.
It presents a high-level investigation of the sentiments
of early ChatGPT adopters for each identified topic.
The users’ sentiments are categorized based on positive,
negative, and neutral. This analysis shows the users’
perception of each topic.
We describe potential future research directions and
areas where ethical and societal implications are present
regarding ChatGPT use.
The rest of the paper is organized as follows. Section II
presents the related work of this study. While Section III
reports the research methodology, Section IV describes the
results. Section V presents the implications of our study for
researchers and users. Section VI describes the threats to the
validity of this study. Lastly, the conclusion of this study is
presented in Section VII.
II. RE LATE D WOR K
As ChatGPT technology is quite new, we have not found
substantial research directly related to ChatGPT. However, there
exist literature on the GPT family of text-generating AIs (e.g.,
GPT-2 and GPT-3). Also, we identified sufficient literature on
Twitter data mining to investigate users’ sentiments. Therefore,
in the following, we report some closely related studies to our
study.
Bian et al. [9] analyzed Twitter data to understand the public
sentiments about Internet of Things (IoT). They performed
sentiment analysis and topic modelling to identify prominent
topics and public attitudes toward each topic. As a result, they
found users are more interested in business and technology
compared to other domains of IoT, and have favourable
sentiments toward IoT. Similarly, Trivedi et al. [15] introduced
aRobustly Optimized BERT Pre-training Approach (RoBERTa)
to investigate public sentiments through their tweets on hybrid
work arrangements. The RoBERTa revealed that the majority
of users have positive sentiments for the hybrid work model.
Another study [16] reported Twitter temporal data mining for
predicting a movie’s success. The authors proposed a rating
prediction model and a temporal product popularity model to
forecast users’ satisfaction and movie popularity among users,
respectively. Similarly, Nawaz et al. [17] proposed a prediction
model to predict the political election results of Pakistan. The
authors showed 98% accuracy and efficiency of their model
in predicting the results through Twitter data as compared to
alternative approaches.
To the best of our knowledge, our work is the first study that
conducted a qualitative analysis of early adopter sentiments
and feedback on ChatGPT. We contribute to the literature by
providing a snapshot of the early public responses to this latest
technology.
III. RESEARCH METHODOLOGY
This section presents our research methodology, where
we discuss our research questions, dataset construction, pre-
processing steps, identification of discussed ChatGPT topics,
and sentiment analysis on these topics. Fig. 1 presents the
overview of our research methodology.
A. Research Questions
The following RQs motivated our empirical study.
RQ1
. What are the characteristics of ChatGPT early
adopters?
RQ2
. ChatGPT Topics - What are the main topics that
are being discussed about ChatGPT on Twitter?
RQ3
. ChatGPT Sentiments - What are the sentiments that
are being expressed about ChatGPT topics on Twitter?
2
Fig. 2: Qualitative analysis of tweets: Initial version of the code sheet for T1
B. ChatGPT Tweet Dataset Construction
To assess public sentiments on the early adoption of Chat-
GPT, we collected social media data, specifically from Twitter.
We collected tweets from December 5, 2022 to December 7,
2022. While collecting the tweets, we only considered the
tweets that included the keyword "ChatGPT". Besides, we
only collected the tweets that were written in the English
language. We used Python and Twitter API to extract Twitter
data. In our
ChatGP T T w eet
dataset, we have 18K tweets,
where we collected the text, user location, user occupation,
user verification, posted date, and hashtags for each tweet. To
answer RQ1, we analyzed user location, user occupation, and
user verification status information.
C. Data Pre-processing
We pre-processed our
ChatGP T T w eet
dataset using the
following steps:
1) Duplication Removal: We removed retweets and duplicate
tweets where a user repeated a tweet of another user. After
duplicate tweet removal, our ChatGPTTweet dataset includes
10,732 tweets that we used in our analysis.
2) Lowercasing: We lowercased the tweets that represent
words in different cases (e.g., StackOverflow and Stackover-
flow) to the same lower-case form (e.g., StackOverflow).
3) Noise Removal: We removed noises (e.g., punctuation
marks) to retain only the alphanumerical data for cleaning our
ChatGP T T w eet
dataset. In the noise removal step, we also
removed URLs, Emojis, and Twitter handles.
4) Stop words Removal: We removed stop-words that appear
frequently (e.g., this, are, and a) but do not help to distinguish
one tweet from another. We removed stop-words using the
NLTK [18] English stop-word list. Besides, we removed the
top three frequently appearing key domain specific words, such
as ChatGPT, OpenAI, and AI.
5) Lemmatization: We performed WordNet-based lemmati-
zation using NLTK [18]. We used lemmatization to represent
a word’s inflected forms (e.g., getting, gets) to its dictionary-
based root form (e.g., get), which is a common practice in the
existing literature [19].
D. Identification of ChatGPT Topics
To answer RQ2, we identified a set of ChatGPT key
topics using the Latent Dirichlet Allocation (LDA) modelling
technique [20]. LDA is a commonly used technique for
topic modelling in the existing Software Engineering (SE)
literature [21], [22]. LDA is used to group tweets of our
ChatGP T T w eet
dataset into a set of topics using word co-
occurrence and frequency. A set of probabilities are assigned
to each tweet by LDA. Here, the probabilities refer to the
chances of a tweet being related to a specific topic. We used
MALLET [23] implementation of LDA, which is commonly
used in existing literature [24], [25].
Identification of the optimal number of topics
N
, while
implementing LDA is critical as LDA may generate a huge
number of narrow topics for a high value of
N
. In contrast,
LDA may create broad generalized topics for a low value
of
N
. Hence, we executed a broad range of experiments to
analyze the coherence scores, which were achieved by varying
N
values from 5 to 60, with steps=2 and iteration=100, 500,
and 1000 using MALLET. The coherence score depicts the
understandability of LDA topics that is relevant to human
comprehensibility [26]. We obtained a comparatively high
coherence score by running MALLET when the number of
topics ranged from 9 to 12.
To ensure that we select the optimal number of topics, we
examined randomly selected 15 tweets from each topic for 9
N12. Based on our examination, we found that N= 9
(i.e, the optimal number of topics is 9) serves the purpose of
balancing the comprehensibility of our dataset. This approach
is followed in similar studies to find the optimal number of
3
Fig. 3: Geographic distribution of the early adopters of ChatGPT based on tweets
topics [22]. We then ran LDA using MALLET with N=9 and
generated a CSV file for each topic. Each CSV file was sorted
based on the highly relevant documents of the topic from LDA
topic modeling method.
E. Sentiment Analysis on ChatGPT Topics
To answer RQ3, we performed sentiment analysis for each
of our identified topics on ChatGPT. For this task, we initially
used Python’s NLTK Library to automatically classify the
tweets per topic. However, upon inspecting the results, we
were not satisfied with some of the automated classifications
of the library. For example, certain tweets that we identified
as neutral (discussing both negative and positive aspects of
ChatGPT) were classified as negative by the library. In addition,
a substantial amount of tweets were accompanied by images
(e.g., Screenshots). We preferred to have a look at the complete
tweet with these images for an overall view of the content
of the tweet. Finally, our previous work, which explored the
most recent trends and emerging solutions of a highly evolving
field, showed that qualitative analysis enables a more in-depth
and nuanced data analysis [27]. Therefore, we chose to do a
manual qualitative analysis of the tweets. We aim to extend
this work with the inclusion of a larger dataset in the future
to assess overall trends with time, where we plan to use an
automated analysis method.
For sentiment analysis, we manually labelled 9 datasets,
where each sample dataset (
Si
, where
i
ranges from 1 to 9)
includes 100 randomly selected tweets for a specific topic
using MS excel. We labelled a tweet with -1, 0, and 1,
where -1 represents negative sentiment, 1 represents positive
sentiment, and 0 represents neutral sentiment. We kept the
neutral sentiment label for tweets that discuss both positive
and negative aspects and tweets that were ambiguous to identify
a specific positive or negative sentiment. We then did open
coding [28] for each tweet to enable us to identify the common
patterns of discussion within a topic. In this task, we identified
keypoints (i.e., summarised points) and then codes (i.e., a phrase
that further summarizes the key point in 2 or 3 words) and
recorded them in the same spreadsheets as an example snapshot
is shown in Fig. 2.
All five authors were involved in the manual labelling and
qualitative analysis of our data. Here, each data set per topic
was labelled by two authors. All disagreements were resolved
through open discussions among the annotators to reach an
agreement for mitigating any possible errors.
IV. RES ULT ANA LYSIS
We describe the results for our RQ1, RQ2, and RQ3 in
Section IV-A, IV-B, and IV-C, respectively.
A. RQ1. What are the characteristics of ChatGPT early
adopters?
Motivation
In this RQ, we aim to analyze the characteristics
of the early adopters of ChatGPT. We analyze the characteristics
in terms of user location, occupation and account verification
status. This analysis will enable us to focus on the character-
istics of the early adopters to understand users from which
professions are interested in ChatGPT and the demographic
distribution of these users.
Approach
We quantitatively analyze the user loca-
tion, user occupation, and user verification status of our
ChatGP T T w eet dataset to answer this RQ.
Result
For user location, Fig. 3 demonstrates that early
adopters are geographically dispersed, where the majority of
the tweets originated from the North American and Asian
4
2%
98%
Verified
Not Verif ied
Fig. 4: Distribution of the verification status of the early
adopters of ChatGPT based on Twitter
16%
10%
8%
6%
6%
6%
6%
5%
37%
Software Practitioner
Academics/Researcher
Student
Data Scientist
Investor
Business Analyst
Entertainer
Journalist
Other
Fig. 5: Occupation distribution of the early adopters of ChatGPT
based on tweets
regions, and a similar number of tweets from Europe, Australia
and South America. In our analysis, we found USA, India,
UK, Canada, and Germany are the top-5 countries in terms of
expressing their opinion while adopting ChatGPT.
Besides, our analysis identifies that only 2% of the early
adopters of ChatGPT are verified Twitter users as shown in
Fig. 4. It implies that ChatGPT is not confined only to verified
users; rather, it is widely adopted across all Twitter users.
For user occupation, we identified a broad, wide, and
diverse range of early adopter communities for ChatGPT.
In our analysis, we also identified Twitter users with no
specific user job description, which we showed as other in
our occupation distribution that is depicted in Fig. 5. The
top 3 user occupations of the early adopters of ChatGPT
are software practitioners, academics, and students. Other
occupations include data scientists, investors, business analysts,
entertainers (e.g., artists, singers, actors), and journalists, as
shown in Fig. 5. It implies the popularity and interest in
adopting AI-based ChatGPT across diverse occupations.
For RQ1, we identified that the early adopters of ChatGPT
are located in geographically dispersed regions with a diverse
and broad range of professions.
2044
1790
3072
2044
1981
1234
1981
2578
4435
T1 Disruptions for Software…
T2 Entertainment and Exercising…
T3 Natural Language Processing
T4 Impact on Educational Aspects
T5 Chatbot Intelligence
T6 Impact on Business…
T7 Implications for Search Engines
T8 Q&A Testing
T9 Future Career & Opportunite
Number of Tweets
Fig. 6: Distribution of topics based on the number of tweets
captured by Topic Modeling
B. RQ2. ChatGPT Topics - What are the main topics that are
being discussed about ChatGPT in Twitter?
Motivation.
ChatGPT has attained significant attraction
over broad and diverse communities, which includes not only
researchers, managers, and practitioners, but also entertainers,
business analysts, and educationists.
For instance, the registered number of users for ChatGPT
has risen up to 1 Million within 5 days since its beta-
release [4], whereas Facebook, Netflix, and Instagram required
approximately 300, 1200, and 75 days to reach 1 Million users.
An answer to this RQ will identify the most common and
pressing ChatGPT topics that the communities have frequently
encountered while using ChatGPT. This identification will
enable us to understand how diverse communities express their
experience and insights over different domains. We assert that
this understanding is invaluable to assess ChatGPT’s capability,
effectiveness, and usability.
Approach.
We identified the optimal number of topics for
our
ChatGP T
dataset by using LDA as discussed in Section
III. After identifying the optimal number of topics, we examined
the top 20 keywords and randomly selected 30 tweets from
each of the identified topics to select a suitable name, which
provides the best representation for the group of tweets under
that topic. All the authors were involved in the examination
and discussed extensively to reach a consensus on naming the
topics. This approach for naming the topics is common and
widely adopted in the literature [21], [22], [24], [25], [29],
[30].
Results.
We identified 9 topics as shown in Table I that have
been discussed among the early adopters of ChatGPT. Fig. 6
depicts the distribution of these topics based on the number of
tweets captured by Topic Modelling. We describe these results
in detail below.
1) Disruptions to software development: A key area dis-
cussed in the extracted dataset is the potential disruptions
ChatGPT will cause to current software development practices.
For example, users discussed many examples of how ChatGPT
can be used to generate code, assist with debugging, and even
5
TABLE I: Topic Names and top 10 words (lemmatized topic words) for our ChatGP T T weet topics
Sl Topic Name Topic keywords
T1 Disruptions for Software Development code, write, create, program, generate, python, script, developer, error, run
T2 Entertainment and Exercising Creativity write, story, poem, love, fun, short, joke, style, funny, movie
T3 Natural Language Processing model, language, generate, data, text, prompt, human, conversation, learn, response
T4 Impact on Educational Aspects write, student, paper, essay, plan, research, education, school, assignment, teach, homework
T5 Chatbot Intelligence chatbot, intelligence, artificialintelligence, machinelearning, artificial, user, million, robot, security, app
T6 Impact on Business development time, startup, business, company, service, true, idea, control, market, customer
T7 Implications for Search Engines google, search, answer, engine, replace, result, source, StackOverflow, query, internet, information, avlable
T8 Q&A Testing question, answer, wrong, test, response, correct, amp, pretty, simple, solve
T9 Future Careers & Opportunities tool, future, time, people, technology, potential, job, world, change, learn
perform tasks like summarizing and translating code.
2) Entertainment and exercising creativity: Twitter users
were widely using ChatGPT for entertainment purposes, gener-
ating poems, jokes or other humorous write-ups. To generate
entertaining outputs using a ChatGPT, users need to provide
the model with some initial text to work with, such as a prompt
or seed text. A popular use case in this area was efforts to
combine characteristics of different entities (e.g., movie or TV
characters, popular personalities and concepts) in one amusing
write-up.
3) Natural Language Processing: The promising Natural
Language Processing (NLP) capabilities of ChatGPT en-
able it to generate understandable natural human language
that facilitates ChatGPT users in perceiving, comprehending,
and projecting generated text. Moreover, the ChatGPT NLP
capability significantly enhances its usability, utility, and
efficiency. Therefore, we have observed that early adopters
have meticulously discussed the NLP aspect of ChatGPT in
the extracted dataset. For example, users have discussed the
amalgamation of AI and NLP in ChatGPT technology in their
tweets. This enables ChatGPT to generate and understand
human-like texts as an outcome.
4) Impact on Educational Aspects: The early adopters of
ChatGPT are considering ChatGPT as one of the technologies
to change the traditional way of education. Users are discussing
how ChatGPT can be used for different purposes in the
education domain. For example, users discussed the use of
ChatGPT for early childhood learning, developing syllabi,
scientific literature review, and crisis management learning,
such as safety plans for thoughts of suicide.
5) Chatbot Intelligence: Twitter data reveals that the intelli-
gence of ChatGPT has become a prominent discussion point
among early adopters. The AI capability of ChatGPT enables
it to understand and respond to users’ queries in a meaningful
manner. With the help of AI, ChatGPT can entertain quite
complex and challenging problems that traditional chatbots fail
to understand accurately. For example, ChatGPT can write and
debug complex codes, solve large-scale optimization problems,
and provide accurate responses to complicated queries.
6) Impacts on Business Development: The application of
ChatGPT in business analysis is being widely discussed on
Twitter. Users are discussing the use of ChatGPT for developing
startup pitches, generating business use cases, and creating
business plans. Users are also asking ChatGPT to provide
financial advice.
7) Implications for Search Engines: One of the widely
discussed topics is using ChatGPT as a search engine to query
and retrieve a wide range of information. Exceeding the existing
search engine (e.g., Google Search, Bing) functionalities,
ChatGPT has provided users with a novel experience of
presenting information conveniently by selecting the most
appropriate information and explaining it in simple terms.
8) Q&A Testing: Q&A Testing is the third largest topic in
our
ChatGP T T w eet
dataset. In this topic, adopters typically
use ChatGPT to learn, compare, and verify answers for different
academic subjects (e.g., physics, mathematics, chemistry),
or/and conceptual subjects (e.g., philosophy, religion), among
others. We identified adopters who also ask open-ended and
analytical questions to ChatGPT in order to understand the
capability of ChatGPT. Furthermore, we observed several
questions on complex technical and emerging subjects as well.
9) Future Careers & Opportunities: This topic is the largest
discussed topic for ChatGPT, where early adopters share
their insights and ideas on how this advent AI technology
can influence future career opportunities. Early adopters also
discussed what we require in terms of skills, knowledge,
values, and behaviour to cope-up with this advanced technology.
Besides, we identified early adopters’ significant insight on
how the industry and research organizations can collaborate to
stay up-to-date on the latest developments in AI and ensure
that sustainable growth and progress are achieved and remain
relevant to the social, emotional, and cognitive aspects of
humans.
C. RQ3. ChatGPT Sentiments - What are the sentiments that
are being expressed about ChatGPT topics on Twitter?
Motivation.
Earlier research efforts [9], [15] in analyzing
Twitter sentiments (i.e., positive, negative, or neutral opinion)
asserted that the sentiment analysis is extremely helpful in
determining the public or user perception towards a product,
service, or technology. Sentiment analysis is also significant
since it can impact the longevity of a product, service, and
technology [31]–[33]. As a highly trending chatbot technology,
user communities expressed their sentiments while using
ChatGPT on Twitter. Answers to this RQ will help the ChatGPT
decision makers, where they need to act promptly, as Twitter
sentiments typically provide emotion-rich information.
Approach.
We followed the approach described in Section
III-E.
6
81
92
83
52
78
75
54
38
75
6
1
14
32
20
5
15
40
16
13
7
3
16
2
20
31
22
9
010 20 30 40 50 60 70 80 90 100
T1 Disruptions for Software Development
T2 Entertainment and Exercising Creativity
T3 Natural Language Processing
T4 Impact on Educational Aspects
T5 Chatbot Intelligence
T6 Impact on Business Development
T7 Implications for Search Engines
T8 Q&A Testing
T9 Future Career & Opportunite
Sentiment Polarity (%)
Positive Negitive Neutral
Fig. 7: Results of the qualitative sentiment analysis per topic
Results.
Fig. 7 depicts the summary of the percentage values
(Positive, Negative and Neutral) for each topic. Below, we
describe the results in detail for each topic.
1) Disruptions to software development: The sentiment
analysis returned 81% positive for this topic. Early adopters
were particularly impressed by ChatGPT’s abilities to assist in
coding tasks. We captured many tweets of users giving specific
examples of how ChatGPT assisted them with their software
development activities.
If you are a programmer like me (kind of lazy one!)
and not using GitHub #copilot, just try #ChatGPT.
This chatty lady wrote an API in #Reactjs in seconds!
Yes, I’m a programmer now. Thanks to #AI and
#OpenAI.
I think this is the most disruptive tech [..].
Another popular use case with positive sentiments was the
assistance this technology provides with regard to debugging
and error handling. Users expressed their satisfaction with how
ChatGPT provided them with specific assistance with sorting
out their coding-related troubleshooting as compared with the
other available services.
ChatGPT is a game changer for programmers and
coders! It can help with everything from debug-
ging and troubleshooting to providing guidance and
suggestions for code improvements. I’m loving the
assistance it provides. #chatgpt #python #R #HTML
#SQL #Java #JavaScript #C #Swift #PHP
Only a very less amount of users (i.e., 6) expressed concerns
about the disruptions ChatGPT will cause to the software
development practices. Some users noted that users need to be
mindful of how they use ChatGPT for specific development
assistance.
As many have said, #chatGPT runs the risk of spitting
out plausible-sounding but wrong answers in some
cases. It’s especially bad for generating code that
uses external libraries that change over time, as one
might expect. Simple logic is good tho
We categorized this tweet as neutral as it points out both
positive and negative aspects of ChatGPT use. Overall, users
are particularly positive that this technology has the potential
to save developers time and effort in their day-to-day tasks
drastically.
2) Entertainment and exercising creativity: Naturally, the
sentiment analysis for this topic turned out to be overwhelm-
ingly positive (i.e., 92%). Twitter users were widely using
ChatGPT for entertainment purposes, generating poems, jokes
or other humorous write-ups. To generate entertaining outputs
using a ChatGPT, users need to provide the model with some
initial text to work with, such as a prompt or seed text. A
popular use in this area was efforts to combine characteristics
of different entities (e.g., movie or TV characters, popular
personalities and concepts) in one amusing write-up.
I asked #ChatGPT “In the style of a Shakespearean
play, write a scene from the sitcom Friends, in which
Chandler accuses Joey of stealing his banana which
he had been saving to eat for his lunch.
Users also attempted to exercise their creativity by generating
interesting ideas, such as short stories or poems.
7
#ChatGPT write a short story about a dog named
Baxter who has died and is in heaven enjoying
cheeseburgers and beaches, who would like to solve
the mystery of the stolen Christmas dinner ham, in
the style of Arthur Conan Doyle.
We observed that a screenshot of the ChatGPT query output
is usually accompanied with these tweets. By analyzing the
selected tweets for sentiment analysis on this topic on Twitter
itself, we recognised that the ChatGPT-generated output was
very positively received as they had a high engagement. This
might have contributed to the drastic rise in the popularity of
this technology, as evidenced by the large initial number of
subscribers.
3) Natural Language Processing.: We observed diverse
sentiments (i.e., positive, negative, and neutral) of early adopters
of ChatGPT technology in our Twitter dataset. While most
tweets (i.e., 83%) imply satisfaction with ChatGPT, some
tweets (14%) have expressed their concerns. Only 3% of the
tweets reported a neutral viewpoint. For example, early adopters
seem happy with the amalgam of NLP and AI in ChatGPT
technology.
I’m really impressed with the natural language
processing capabilities of ChatGPT’s AI-powered
chatbot. It’s a great example of the power of #NLP
#AI technology. #chatgpt
Early adopters were also surprised by the realistic human-like
text generation of ChatGPT.
#ChatGPT is a prototype dialogue-based #AI chatbot
capable of understanding natural human language
and generating impressively detailed human-like
written text.
The use of human AI trainers and supervised fine-tuning
improves the ChatGPT text generation and perception. However,
some users have shown their concerns about the quality
of the generated text. They argue that ChatGPT provides
misinformation due to the lack of critical-thinking skills,
nuance, and ethical decision-making ability of ChatGPT.
I can always spot generated text - it lacks the depth
and complexity of human thought
In summary, though early adopters raised some negative
sentiments, the majority of ChatGPT users showed positive
responses. We believe ChatGPT is a promising stepping stone
to the development of an absolute human-like chatbot.
4) Chatbot Intelligence.: In the extracted dataset, we have
identified both the positive and negative sentiments of early
adopters regarding the intelligence capability of ChatGPT. As
per our quantitative analysis, 78% tweets showed a favourable
sentiment toward ChatGPT intelligence, 20% tweets raised
harmful impacts of AI for ChatGPT, and 2% tweets had a
neutral opinion. This shows that the majority of early adopters
are in favour of ChatGPT intelligence capability.
[...] ChatGPT could be a big help and see the
amazing wonders of AI (Artificial Intelligence) in
the future. @OpenAI I typed "Create React App"
and it provided me a clear to read instructions on
creating the application. #chatGPT
On the other hand, early adopters have also raised their
negative sentiments about AI involvement in ChatGPT. For
example, users were concerned about the ominous impacts of
AI-based ChatGPT on society such as an increase in terrorism,
hacking, and unemployment.
ChatGPT artificial intelligence developed by OpenAI
explains how to bomb and theft
In conclusion, ChatGPT’s intelligence capability provides
both positive and negative impacts on society. However, we
recommend an effective policy/regulation development for the
usage of ChatGPT. This mitigates the negative impacts of Chat-
GPT intelligence and hence improves ChatGPT acceptability
among its users in the future.
5) Impact on Educational Aspects: Unlike the other topics
(e.g., software development) where ChatGPT is accepted
overwhelmingly, the adoption of ChatGPT for educational
purposes raised both positive and negative perceptions among
the users. Our analysis identified 52% positive, 32% negative,
and 16% neutral views on the use of ChatGPT for educational
purposes.
Users are accepting ChatGPT for diverse application areas
in the education domain. For example, users discussed many
examples of how ChatGPT can be used for grading papers,
assessing students’ learning, and preparing syllabi. ChatGPT is
also considered as a good personal teacher for a student. Users
are considering ChatGPT as one of the many technologies
that will revolutionise the traditional way of teaching and
assessment.
#ChatGPT will change education as we know it. I am
hopeful. Perhaps this is what will springboard age-
old assessment practices to ones that authentically
assess student learning
On the contrary, the use of ChatGPT by students for writing
essays, preparing assignments, and home-works was raised
as a common public concern on Twitter. Users are concerned
about how the use of ChatGPT for preparing assignments
by students can hinder their learning process. Other concerns
include plagiarism detection for the students’ assignments and
shallow answers to ChatGPT in response to research questions.
With the rise of the amazing #ChatGPT, I am sure
many students will use it to write essays confidently
yet they learn nothing. Also plagiarism software
cannot detect this, even this may not be considered
as plagiarism at all
In summary, though there are some negative perceptions
regarding the adoption ChatGPT for preparing assignments
by students, many users are arguing against this concern as
they believe that ChatGPT is exceptionally good at identifying
text produced by AI, which enables it to mitigate students’
plagiarism.
ChatGPT is amazing, but it will be neither an
effective way for students to plagiarize nor replace
humanistic education.
8
6) Impact on Business Development: The adoption of
ChatGPT in business is being discussed positively with a ray
of hope for future possibilities. Our analysis identified 75%
positive, 5% negative, and 20% neutral views on the use of
ChatGPT in business.
Users are amazed at the response of ChatGPT to build an
elevator pitch for a new idea to investors which can significantly
support the start-up companies. For example, startup people
are hopeful to imagine ChatGPT as a technical co-founder.
ChatGPT is also expected to strongly support business decision-
making.
With ChatGPT and AI-assisted coding, tech busi-
nesses will become smaller and flatter. ChatGPT will
help good coders become massively more productive.
AND it will help management at all levels make better
decisions.
Users are also discussing the use of ChatGPT to create a
business plan for app development, write trading strategies, and
generate business-specific use cases. Besides, users of ChatGPT
are discussing diverse potential application areas of ChatGPT
from a business perspective. For example, the use of ChatGPT
to provide technical customer support and financial advice.
Imagine using ChatGPT as a technical support
specialist at Apple - personalized and accurate
responses to customer inquiries in seconds! This
is just one example of how ChatGPT can improve
customer service for any company.
Our analysis identified that ChatGPT is positively accepted
to have a promising future in the business domain with its
positive support in all business aspects from business plan
development to customer service support.
7) Implications for Search Engines: Our analysis identified
54% positive, 15% negative and 31% neutral sentiments on this
topic. It implies that most of the tweets present either a positive
or neutral sentiment over negative sentiments. For example,
some users have mentioned ChatGPT as a #googlekiller,
indicating that ChatGPT poses a significant threat to existing
search engines. This indicates that the users tend to consider
ChatGPT as a replacement for the current search engines.
Just relaized I have been using google to access
#ChatGPT the apparent #googlekiller.
Furthermore, some ChatGPT users have stated that ChatGPT
performs better in search speed and accuracy than the current
search engines and knowledge-sharing platforms, respectively.
This implies the potential future risks not only for search engine
services but also for other knowledge-sharing related platforms
such as Stack Overflow, Quora and Wikipedia.
On information side #ChatGPT is like: - Google but
more exact - Quora but faster [...].
We came across several instances of negative sentiments
concerning ChatGPT’s ability to provide accurate information
as it is leveraging the internet as its primary data source.
Furthermore, its also been remarked that some users are yet
to understand these limitations of ChatGPT. Thus, users are
encouraged to be critical and verify ChatGPT results from
other sources before using them in their intended applications.
It has become evident that you cannot blindly trust
#ChatGPT; data sourced from the internet is not
completely "verified" or accurate. Most people are
able to recognize this fact for ChatGPT, but not for
the internet as a whole. Be critical of the information
you find online.
Interestingly, several solutions (e.g., Google Chrome exten-
sion) have already emerged to address the above-identified
concerns. These plugins can retrieve and present search results
from ChatGPT and existing search engines, such as Google,
in a single view, facilitating the user to make better-informed
decisions. Therefore, we foresee ChatGPT significantly revolu-
tionizing and enhancing the user experience through improved
user interaction models in future search engines and knowledge-
sharing platforms.
8) Q&A Testing: Our analysis identified 38% positive, 40%
neutral, and 22% negative sentiments while adopting ChatGPT
for testing its capability in terms of questions and answering.
The positive sentiments for Q&A Testing topics are evolved
due to the quality, human-friendly interaction, fast responses,
and the provision of reasons for the generated answers. For
instance, we observed positive sentiments for providing users
with quality answers, which enables us to get the expected
value from ChatGPT.
I recently used #Chatgpt to help me better understand
conditional types in English, and I was impressed
by the quality of the exercises it provided. I found
that by giving it more precise and well-structured
questions, I was able to get even more value out of
it.
Besides, adopters are enthusiastic about the way ChatGPT
interacts with the users while answering their questions.
This is really spot on. I played 20 questions with
#ChatGPT yesterday. First, it guessing correctly who
I and my daughter was thinking about (Rapunzel).
And then us guessing that it was thinking about Billie
Eilish. Really felt like an interaction with a new kind
of entity.
Another cause for the positive sentiments for this topic is the
capability of ChatGPT for providing the reasons behind the
answer.
Nurturing curiosity in children is the best thing par-
ents, grandparents, teachers, caretakers, etc. could
do. when the eventual 1st, 2nd...n "why" question
hits you, might as well use #ChatGPT instead of the
catastrophic "because I told you so".
The negative sentiments for Q&A Testing topics are evolved
due to the wrong, incorrect, and sometimes invalid answers
for their questions.
"#OpenAI #ChatGPT I used this AI to find my
chem[istry] answers but got the wrong answers. for
example, Answer has to be in gms, and it’s giving
me in moles...
9
Besides, we identified some adopters have justified their
sentiments in this topic by mentioning that Stack Overflow has
banned to post answers generated by ChatGPT.
Stack Overflow has temporarily banned responses
generated by OpenAI’s #ChatGPT AI, citing “a high
rate" of incorrect answers. ChatGPT, can answer
questions about coding problems but often produces
“plausible-sounding but incorrect or nonsensical an-
swers,".
One significant concern raised by the early adopters of ChatGPT
in Q&A topic is the confidence shown for wrong or incorrect
answers, which may hamper the ChatGPT’s wider adoptability.
The primary problem is the answers which #ChatGPT
produces have a high rate of being incorrect...The
scary part was just how confidently incorrect it was.
9) Future Careers & Opportunities: Our analysis identified
75% positive, 9% neutral, and 16% negative sentiments while
early adopters were expressing their opinions regarding future
careers and opportunities. The positive sentiments for Future
Careers & Opportunities topics are evolved due to the fast and
effective solution provided by the ChatGPT, which can be a
crucial factor for successful careers.
AI coding and decision support systems will enable
very small teams to create massively valuable prod-
ucts, services, and companies.
Besides, we observed positive sentiments for ChatGPT’s
innovation, learning, and adaptation to make the job tasks
convenient and easy.
AI is the future! With its ability to learn and adapt,
it’s no surprise that more and more companies are
turning to this technology to improve their operations
and better serve their customers. Exciting times
ahead! #AI #innovation #ChatGPT #OpenAI
Besides, adopters are enthusiastic about the customization
feature of ChatGPT to provide personalized messages, services,
or answers, which adopters believed to help them to be more
engaged in future with their customers.
Imagine using #ChatGPT as a technical support spe-
cialist at [...] - personalized and accurate responses
to customer inquiries in seconds! This is just one
example of how ChatGPT can improve customer
service for any company.
The negative sentiments for Future Careers & Opportunities
topics are evolved due to fear among people of losing their
jobs to ChatGPT.
#ChatGPT ready to replace Product Managers...
Besides, we identified some adopters who have expressed to
lose their software programmer jobs as ChatGPT can perform
a magnitude of software development activities within a very
short time, which may take a long time for a human software
programmer.
More importantly, GPT will create a huge turmoil in
the IT industry. Many coding jobs will be taken over
by this AI, as it is able to do a multitude of coding
jobs within seconds (which takes humans many weeks
to do). This is a watershed moment in tech history.
V. IMPLICATIONS
Considering the increasing popularity of ChatGPT technol-
ogy, we conducted this study to identify and analyze topics with
regard to the sentiments of ChatGPT early adopters. The topic
modelling and sentiment analysis results provide an idea of
how ChatGPT technology is being perceived among early users,
and it also indicates the potential for ChatGPT’s acceptance in
the future. In this section, we discuss implications for users
and researchers based on our results.
A. Implications for Users
Our study identifies several benefits for users of ChatGPT.
For example, as described in
IV-B
1, ChatGPT has the potential
to change traditional ways of software development, which
shows that software developers can utilize ChatGPT while
creating software and ensuring an effective software develop-
ment process. Similarly, as described in
IV-B
6, ChatGPT can
help business developers create their feasibility reports and
business cases to enable smooth operations and other business
development-related tasks. For example, a user might query
business ideas for a specific context, and the model could
generate a list of possible ideas which can be used as a starting
point for brainstorming.
Our study has recognized several pitfalls that users must
consider and weigh when integrating ChatGPT into their
workflows and applications. For example, since ChatGPT
results are not verified or fact-checked by any established
authority, users must not solely depend on ChatGPT results
to perform any critical task. Therefore, users must critically
analyze ChatGPT results and consult other verified data sources
to make sensible decisions. Similarly, users must be cautious
about sharing their personal and confidential data with ChatGPT,
as adversarial attacks on ChatGPT language models can lead
to possible data breaches, leaving ChatGPT users vulnerable. It
is also essential to highlight the responsible usage of ChatGPT
from the users’ end, as integrating ChatGPT’s creations and
compositions into commercial and academic works can lead
to legal and ethical concerns.
B. Implications for Researchers
Our study provides potential future research directions to
researchers for exploring concerns related to the ethical use
of ChatGPT and other practical implications. For example,
users raised concerns about this technology being misused
in writing or completing educational activities such as essays
or assignments, which would hamper students’ learning. This
issue was captured in the Impacts on Educational Aspects topic.
Further, the topics Q&A Testing captured concerns related to
misinformation or inaccurate outputs being generated from
ChatGPT. These issues indicate that researchers should explore
means of mitigating these ethical and practical implications.
For example, effective protocols for ChatGPT’s usage can be
developed to ensure its ethical usage across different domains.
10
VI. TH RE ATS TO VALIDITY
Firstly, our study focuses on tweets from Twitter as a
representative of the sentiments of early adopters of ChatGPT.
Twitter was the 9th most visited website globally in 2021
[34], which is an enriched knowledge base for analyzing
human sentiment. However, future research can focus on
other resources, such as Stack Overflow and blogs, to further
generalize our findings.
Secondly, manual topic-wise sentiment analysis can be
subject to human judgment bias. To mitigate this threat to
validity, each sample dataset of a specific topic was labelled by
two researchers, and any disagreements were resolved through
discussion.
Thirdly, while threat modelling is useful in handling a large
amount of data, its usage introduces some threats. For example,
identifying the optimal value for a number of topics
N
is a
potential threat. To mitigate it, we followed the commonly
used method of experimentation with a broad range of values
for N[21], [35].
VII. CONCLUSION
In conclusion, our study of early adopters’ sentiments
about ChatGPT revealed overwhelming excitement and limited
concerns about this application of a large language model.
The majority of users were impressed by the performance
of ChatGPT and the potential of large language models to
assist with tasks related to several domains (e.g., Software
development, Business initiatives and analysis, NLP). However,
there are also important ethical implications that need to be
considered in ChatGPT use and further development. For
example, some users were concerned about the negative
effect it would have on the education industry activities such
as take-home assignments and essay writing for students.
Overall, our study provides valuable insights into the sentiments
of early adopters of ChatGPT and highlights the need for
continued research and dialogue to develop best practices for
the responsible use of large language models.
REFERENCES
[1]
S. Lock. (2022) What is ai chatbot phenomenon chatgpt and
could it replace humans? Accessed December 10, 2022. [On-
line]. Available: https://www.theguardian.com/technology/2022/dec/05/
what-is- ai-chatbot- phenomenon-chatgpt- and-could- it-replace- humans
[2]
(2022) Chatgpt: Optimizing language models for dialogue. Accessed
December 10, 2022. [Online]. Available: https://openai.com/blog/chatgpt/
[3]
K. Naidu. (2022) Chatgpt is a new ai chatbot that can find errors in
your code and write you a story. Accessed December 10, 2022. [Online].
Available: https://crazyengineer.in/chatgpt-is-a- new-ai-chatbot/
[4]
J. Sier. (2022) Chatgpt takes the internet by
storm, bad poetry and all. Accessed December 10,
2022. [Online]. Available: https://www.afr.com/technology/
chatgpt-takes- the-internet- by-storm- bad-poetry- and-all- 20221207-p5c4hv
[5]
A. Friedman, “Ai chatbot predicted to replace google in a couple of
years,” Dec 2022. [Online]. Available: https://www.phonearena.com/
news/ai-chatbot-could-replace-google-soon_id144120
[6]
A. Hern. (2022) Ai bot chatgpt stuns academics with essay-
writing skills and usability. Accessed December 10, 2022. [On-
line]. Available: https://www.theguardian.com/technology/2022/dec/04/
ai-bot- chatgpt-stuns- academics-with- essay-writing- skills-and- usability
[7]
H. Anber, A. Salah, and A. Abd El-Aziz, A literature review on
twitter data analysis,” International Journal of Computer and Electrical
Engineering, vol. 8, no. 3, p. 241, 2016.
[8]
V. Sahayak, V. Shete, and A. Pathan, “Sentiment analysis on twitter data,
International Journal of Innovative Research in Advanced Engineering
(IJIRAE), vol. 2, no. 1, pp. 178–183, 2015.
[9]
J. Bian, K. Yoshigoe, A. Hicks, J. Yuan, Z. He, M. Xie, Y. Guo,
M. Prosperi, R. Salloum, and F. Modave, “Mining twitter to assess
the public perception of the “internet of things”,” PloS one, vol. 11,
no. 7, p. e0158450, 2016.
[10]
J.-W. Guo, C. L. Radloff, S. E. Wawrzynski, and K. G. Cloyes, “Mining
twitter to explore the emergence of covid-19 symptoms, Public Health
Nursing, vol. 37, no. 6, pp. 934–940, 2020.
[11]
J. Bian, U. Topaloglu, and F. Yu, “Towards large-scale twitter mining for
drug-related adverse events, in Proceedings of the 2012 international
workshop on Smart health and wellbeing, 2012, pp. 25–32.
[12]
W. Wang, I. Hernandez, D. A. Newman, J. He, and J. Bian, “Twitter
analysis: Studying us weekly trends in work stress and emotion,” Applied
Psychology, vol. 65, no. 2, pp. 355–378, 2016.
[13]
M. Paul and M. Dredze, “You are what you tweet: Analyzing twitter for
public health,” in Proceedings of the International AAAI Conference on
Web and Social Media, vol. 5, no. 1, 2011, pp. 265–272.
[14]
E. Aramaki, S. Maskawa, and M. Morita, “Twitter catches the flu:
detecting influenza epidemics using twitter, in Proceedings of the 2011
Conference on empirical methods in natural language processing, 2011,
pp. 1568–1576.
[15]
S. Trivedi and N. Patel, “Mining public opinion about hybrid working
with roberta,” Empirical Quests for Management Essences, vol. 2, no. 1,
pp. 31–44, 2022.
[16] B. Alhijawi and A. Awajan, “Prediction of movie success using twitter
temporal mining,” in Proceedings of Sixth International Congress on
Information and Communication Technology. Springer, 2022, pp. 105–
116.
[17]
A. Nawaz, T. Ali, Y. Hafeez, M. R. Rashid et al., “Mining public opinion:
a sentiment based forecasting for democratic elections of pakistan,”
Spatial Information Research, vol. 30, no. 1, pp. 169–181, 2022.
[18]
E. K. Steven Bird and E. Loper, Natural Language Processing with
Python. O’Reilly Media Inc, 2009.
[19]
Z. T. Sworna, C. Islam, and M. A. Babar, “Apiro: A framework for
automated security tools api recommendation,” ACM Transactions on
Software Engineering and Methodology, 2022.
[20]
D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,”
Journal of machine Learning research, vol. 3, no. Jan, pp. 993–1022,
2003.
[21]
M. U. Haque, L. H. Iwaya, and M. A. Babar, “Challenges in docker
development: A large-scale study using stack overflow,” in Proceedings
of the 14th ACM/IEEE International Symposium on Empirical Software
Engineering and Measurement (ESEM), 2020, pp. 1–11.
[22]
M. Zahedi, R. N. Rajapakse, and M. A. Babar, “Mining questions asked
about continuous software engineering: A case study of stack overflow,”
in Proceedings of the evaluation and assessment in software engineering,
2020, pp. 41–50.
[23]
A. K. McCallum, “Mallet: A machine learning for language toolkit,”
2002, http://mallet.cs.umass.edu.
[24]
S. Ahmed and M. Bagherzadeh, “What do concurrency developers ask
about? a large-scale study using stack overflow,” in Proceedings of
the 12th ACM/IEEE International Symposium on Empirical Software
Engineering and Measurement (ESEM), 2018, pp. 1–10.
[25]
M. Bagherzadeh and R. Khatchadourian, “Going big: a large-scale study
on what big data developers ask, in Proceedings of the 2019 27th
ACM Joint Meeting on European Software Engineering Conference and
Symposium on the Foundations of Software Engineering, 2019, pp. 432–
442.
[26]
M. Röder, A. Both, and A. Hinneburg, “Exploring the space of topic
coherence measures,” in Proceedings of the eighth ACM international
conference on Web search and data mining, 2015, pp. 399–408.
[27]
R. N. Rajapakse, M. Zahedi, and M. A. Babar, “Collaborative application
security testing for devsecops: An empirical analysis of challenges, best
practices and tool support,” arXiv preprint arXiv:2211.06953, 2022.
[28]
A. Strauss and J. M. Corbin, Grounded theory in practice. Sage, 1997.
[29]
Z. Wan, X. Xia, and A. E. Hassan, “What is discussed about blockchain?
a case study on the use of balanced lda and the reference architecture
of a domain to capture online discussions about blockchain platforms
across the stack exchange communities,” IEEE Transactions on Software
Engineering, no. 01, pp. 1–1, 2019.
[30]
A. Abdellatif, D. Costa, K. Badran, R. Abdalkareem, and E. Shihab,
“Challenges in chatbot development: A study of stack overflow posts,”
11
in Proceedings of the 17th international conference on mining software
repositories, 2020, pp. 174–185.
[31]
A. Pak and P. Paroubek, “Twitter as a corpus for sentiment analysis and
opinion mining,” in Proceedings of the Seventh International Conference
on Language Resources and Evaluation (LREC’10), 2010.
[32]
B. Gokulakrishnan, P. Priyanthan, T. Ragavan, N. Prasath, and A. Perera,
“Opinion mining and sentiment analysis on a twitter data stream,” in
International conference on advances in ICT for emerging regions
(ICTer2012). IEEE, 2012, pp. 182–188.
[33]
S. Wakade, C. Shekar, K. J. Liszka, and C.-C. Chan, “Text mining for
sentiment analysis of twitter data,” in Proceedings of the International
Conference on Information and Knowledge Engineering (IKE). The
Steering Committee of The World Congress in Computer Science,
Computer . . . , 2012, p. 1.
[34]
C. Beveridge. (2022) 33 twitter stats that matter to marketers
in 2023. Accessed December 10, 2022. [Online]. Available: https:
//blog.hootsuite.com/twitter-statistics/
[35]
K. Stevens, P. Kegelmeyer, D. Andrzejewski, and D. Buttler, “Exploring
topic coherence over many models and many topics, in Proceedings of
the 2012 Joint Conference on Empirical Methods in Natural Language
Processing and Computational Natural Language Learning. Association
for Computational Linguistics, 2012, pp. 952–961.
12
... Hansson and Ellréus (2023) have illuminated code correctness and quality topics. Haque et al. (2022) also explored the PROGRAMMERS AND AI-CHATBOTS 14 broader implications of disruptive technologies. Likewise, Hörnemalm (2023) examined the practical utilization of AI, including tools like ChatGPT in software development. ...
... In the rapidly evolving landscape of technology, programmers consistently demonstrated adaptability to new tools and methodologies. Haque et al. (2022) researched programmers' interactions with AI tools in programming, revealing predominantly positive sentiments toward ChatGPT. Qualitative sentiment analysis among users highlighted satisfaction, particularly with topic distributions in discussions related to software development. ...
... The gap in this literature was the insufficient examination of how the positive sentiments and adaptability towards AI tools like ChatGPT specifically impacted long-term programming practices and team dynamics. While Haque et al. (2022) identified programmers' positive responses and the increasing use of AI technologies, the research did not explore how these tools influenced overall development methodologies, the PROGRAMMERS AND AI-CHATBOTS 62 integration of AI into existing workflows, or the broader implications for innovation and efficiency in the software development process. ...
Thesis
Full-text available
In recent years, chat and conversational AI technologies have seen growing integration across various industries, improving customer service, streamlining operations, and enhancing user experiences. This qualitative study examined the transformative impact of AI chatbots on the programming field and their broader effects. The study examined how AI chatbots have changed programming practices, boosted productivity, and introduced new opportunities and challenges. The study used hermeneutic phenomenology to interpret and understand these changes and their implications. The findings underscored the importance of addressing ethical issues, such as data privacy and unintended biases in AI systems, while highlighting the need for cross-disciplinary collaboration and a focus on user experience to maximize the benefits of AI chatbots. The study revealed that the role of programmers must evolve to integrate AI and chatbots effectively, emphasizing the importance of adapting to new responsibilities, including ethical considerations and continuous learning. Recommendations included upskilling in AI and machine learning and updating educational programs to prepare professionals for an AI-driven world. The findings highlighted the necessity for additional research to examine the broader effects of AI on related fields beyond programming. Future studies can explore the relationship between job characteristics, deskilling, and adaptation strategies across various industries. Such research would offer valuable insights into the broader effects of AI on different roles and career outcomes across multiple sectors.
... Esta notoriedad mediática ha despertado el interés de usuarios procedentes de múltiples áreas (Rudolph et al., 2023) y, en especial, de la comunidad de estudiantes (Haque et al., 2022). ...
... Por otro lado, las personas participantes tampoco tienen una opinión clara acerca de ChatGPT, ni sobre si este representa una amenaza para la docencia universitaria. La tendencia observada es que el profesorado considera que la herramienta tendrá un mayor potencial en el futuro (Tabla 6, Ítem 8), pese a que la literatura más reciente evidencia que el uso predominante actual pertenece a la comunidad de estudiantes (Haque et al., 2022). ...
Article
Full-text available
The emergence of ChatGPT poses new challenges in the educational field. Among them, the open discussion on the potentially negative consequences that the program's use may generate in the learning and evaluation processes of students. The present study investigates the level of knowledge and perception of ChatGPT among university educators, as well as their proficiency in discerning student-authored texts from those generated by artificial intelligence. For this purpose, 51 professors at the University of Barcelona, specializing in communication and philology, were presented with a sample of texts extracted from an authentic academic assignment that included versions written by students themselves, together with outputs generated ad hoc by ChatGPT. The accuracy rate of the authorship assignment performed by teachers was 31%, a value that reveals a new obstacle in teaching, learning, and evaluation processes in higher education. Additionally, there was a tendency for ChatGPT-generated texts to be rated more favorably than those written by the students themselves. Finally, the article presents several suggestions aimed at anticipating the potential impact of the unethical use of artificial intelligence on the development of skills and abilities among university students. La irrupción de ChatGPT plantea nuevos desafíos en el ámbito educativo. Entre ellos, destaca el debate abierto en torno a las consecuencias -potencialmente negativas- que los usos de la herramienta pueden generar en los procesos de aprendizaje y evaluación del alumnado d. El siguiente trabajo explora tanto el grado de conocimiento y percepción sobre ChatGPT del profesorado universitario, como la capacidad para distinguir trabajos de autoría humana de otros originados por la inteligencia artificial. Para ello, 51 docentes de la Universidad de Barcelona, de áreas de conocimiento asociadas a la comunicación y la filología, tuvieron acceso a los textos resultantes de una actividad académica real, a partir de versiones redactadas por los propios alumnos, y de otras generadas ad hoc a través de ChatGPT. Los resultados revelaron un porcentaje de acierto en la asignación de autoría del 31 %, un valor que evidencia un nuevo obstáculo en los procesos de enseñanza, aprendizaje y evaluación en la educación superior. Paralelamente, se observó que tendían a otorgar una valoración más positiva a las muestras elaboradas por ChatGPT frente a aquellas redactadas por el alumnado. Finalmente, el artículo recoge una serie de propuestas para anticipar el impacto que podría tener un uso deshonesto de la inteligencia artificial en la adquisición de competencias y habilidades del alumnado universitario.
... As of right now, ChatGPT, a brand-new AI invention from OpenAI that attracted a million users in just five days, is ambitious enough to advance this support (Haque et al., 2022). OpenAI's large-scale language model, known as Generative Pre-trained Transformer 3, or GPT-3, can generate text with 175 billion parameters. ...
Article
Full-text available
This paper reviews the practice of ChatGPT in the field of finance and banking through a natural language processing (NLP) system developed by OpenAI. We also discuss applications of ChatGPT in finance and banking, such as the detection of fraud, risk management, forecasting and analysis of the market, management of investments, chatbots for customer service, personal financial management, impact investing, and asset management along with its challenges, limitations and extremely encouraging future potentials. We also recommend taking necessary steps by the authorities to create and enhance ChatGPT-related legislation and regulations to avoid illegal and criminal offences. The paper concludes that ChatGPT will perform better and offer a better user experience for a variety of banking and finance sectors. It is worth remembering that in a relatively short period of time, ChatGPT has garnered significant attention from research, industry, and academia. This study will be helpful for policy makers, practitioners, and researchers.
... Following its initial release, ChatGPT was introduced as an updated version of the GPT-3 language model. It was trained on additional datasets of chatbot interactions, incorporating extensive parameters to generate text that sounds natural in conversations (Haque et al. 2022;Zhang et al. 2022). ...
Article
Full-text available
The emergence of artificial intelligence (AI) is transforming how humans live and interact, raising both excitement and concerns—particularly about the potential for AI consciousness. For example, Google engineer Blake Lemoine suggested that the AI chatbot LaMDA might become sentient. At that time, GPT-3 was one of the most powerful publicly available language models, capable of simulating human reasoning to a certain extent. The notion of GPT-3 having some degree of consciousness could be linked to its ability to produce human-like responses, hinting at a basic level of understanding. To explore this further, we administered both objective and self-assessment tests of cognitive (CI) and emotional intelligence (EI) to GPT-3. Results showed that GPT-3 outperformed average humans on CI tests requiring the use and demonstration of acquired knowledge. However, its logical reasoning and EI capacities matched those of an average human. GPT-3’s self-assessments of CI and EI didn’t always align with its objective performance, with variations comparable to different human subsamples (e.g., high performers, males). A further discussion considered whether these results signal emerging subjectivity and self-awareness in AI. Future research should examine various language models to identify emergent properties of AI. The goal is not to discover machine consciousness itself, but to identify signs of its development, occurring independently of training and fine-tuning processes. If AI is to be further developed and widely deployed in human interactions, creating empathic AI that mimics human behavior is essential. The rapid advancement toward superintelligence requires continuous monitoring of AI’s human-like capabilities, particularly in general-purpose models, to ensure safety and alignment with human values.
... However, the current study examined a two-month timeframe, wide enough to present research from a broader perspective. As a natural consequence of this, the amount of data collected is also reported by Haque et al. (2022), which is approximately 72 times higher [58]. This is an indication that the study includes more generalizable results. ...
... Since Chat-GPT is based on two closed models and no training details are provided, researchers are exploring its concerns and capabilities. The concerns involve ethical risks [50,51], patient privacy [52], fabricated misinformation [53,54], education integrity [55] and legal challenges [56]. For its capabilities, researchers evaluate the performance of ChatGPT on different tasks, including stance detection [57], question-answering [58], machine translation [9], sentiment analysis [59] and other general NLP tasks [60,61,62,10]. ...
Preprint
Full-text available
Human-like large language models (LLMs), especially the most powerful and popular ones in OpenAI's GPT family, have proven to be very helpful for many natural language processing (NLP) related tasks. Therefore, various attempts have been made to apply LLMs to information extraction (IE), which is a fundamental NLP task that involves extracting information from unstructured plain text. To demonstrate the latest representative progress in LLMs' information extraction ability, we assess the information extraction ability of GPT-4 (the latest version of GPT at the time of writing this paper) from four perspectives: Performance, Evaluation Criteria, Robustness, and Error Types. Our results suggest a visible performance gap between GPT-4 and state-of-the-art (SOTA) IE methods. To alleviate this problem, considering the LLMs' human-like characteristics, we propose and analyze the effects of a series of simple prompt-based methods, which can be generalized to other LLMs and NLP tasks. Rich experiments show our methods' effectiveness and some of their remaining issues in improving GPT-4's information extraction ability.
... A sub-field of education that has been deeply influenced by the rise of generative AI is second language (L2) writing/composition instruction, perhaps because of the wide overlap between the cognitive skills taught in writing classes and generative AI capabilities. The unprecedented capabilities of ChatGPT to process and generate texts hold both great potentials and challenges to L2 writing instruction, but most of the existing research on the implications of ChatGPT for L2 writing has emphasized ethical challenges to traditional L2 writing assessments (Cotton et al., 2024;Francke & Alexander, 2019;Gao et al., 2023;Haque et al., 2022;King & ChatGPT, 2023;Khalil & Er, 2023;Susnjak, 2022;Yeadon et al., 2022). As a result, little is known about the potential of ChatGPT to support L2 writing instruction and development. ...
Article
Full-text available
The release of ChatGPT has marked the dawn of a new information revolution that will transform how people communicate and make meaning. However, to date, little is known about the implications of ChatGPT for L2 composition instruction. To address this gap, the present study uses a systematic review design to synthesize available research on the educational potentials of ChatGPT as an instructional assistant, outline the implications of these potentials for L2 writing instruction, and discuss their practical applications. The findings, based on a meta-analysis of 42 research articles, demonstrate that ChatGPT can enhance L2 writing instruction by boosting learners’ motivation, automating instructional tasks, and offering instantaneous, personalized feedback to learners. These findings have important implications for harnessing the instructional potential of generative AI in L2 writing classes.
Article
This study investigates the association of socioeconomic status (SES) and digital and AI literacy with types of Chat GPT use by college students, with subsequent implications for academic self-efficacy and creativity, conditioned by trust. Analyses of a survey of U.S. college students (N = 947) show that SES has a greater association with AI literacy than with general digital literacy. Two dimensions of Chat GPT activities emerge: academic support and displacement. Structural equation modeling reveals that AI literacy is positively associated with both activity dimensions, while digital literacy is unexpectedly a negative contributor. Further, academic support is strongly linked to positive outcomes whereas academic displacement is negatively associated. Attitudinal trust in Chat GPT moderates the overall relationships. Our findings suggest that conventional digital inequality persists and evolves with generative AI, traditional digital literacy becomes insufficient in the age of AI, and trust in this new and opaque digital technology influences these relationships.
Article
Full-text available
The text attempts to define linguistic artificial intelligence in the context of GPT‑4 chat linguis-tic algorithms. The experiments carried out and described refer to the description of the chat functionality as highly intelligent in terms of language in terms of correctness, efficiency and linguistic culture. By referring to Gardners understanding of linguistic intelligence, and his crit-ics, the ability of artificial intelligence to use a standardized language is described, and errors in the data and the process of coding language models in GPT‑4 chat are marked.
Article
Full-text available
As the businesses recover from the COVID-19 epidemic, a new working paradigm is emerging: the hybrid work arrangement. A hybrid work method is a working approach that enables workers to work from several places, such as at home, on the move, or in the workplace. People are expressing their opinions on different social media outlets about the new work model. Organizations and businesses value public views. Because public perspectives will allow decision-makers to adapt promptly to rapidly transforming cultural, commercial, and social environments. Opinion mining is traditionally used to summarize the quantity of positive and negative responses in a given text using sentiment analysis techniques. Opinionated material from social media sites is used to identify people's enthusiasm or displeasure with a certain issue under debate. This study analyzes the public sentiments (positive, negative, and neutral) on a hybrid work model using Twitter API and the Robustly Optimized BERT Pre-training Approach (RoBERTa). Out of 1 thousand tweets containing the term "hybrid work", 37 (4.2%), 305 (33.3%), and 658 (62.5%) tweets were classified as negative, neutral, and positive, respectively. We also compared the public sentiments about hybrid work with those of remote work. The RoBERTa classified 8(1.6%), 436 (85.9 %), and 62 (12.5%) tweets as negative, neutral, and positive, respectively. The results showed that The majority of individuals showed favorable sentiment toward the hybrid work arrangement. The findings also demonstrate that "hybrid work" has an affinity with "remote work", "ai", "digital transformation" and "future of work".
Conference Paper
Full-text available
Chatbots are becoming increasingly popular due to their benefits in saving costs, time, and effort. This is due to the fact that they allow users to communicate and control different services easily through natural language. Chatbot development requires special expertise (e.g., machine learning and conversation design) that differ from the development of traditional software systems. At the same time, the challenges that chatbot developers face remain mostly unknown since most of the existing studies focus on proposing chatbots to perform particular tasks rather than their development. Therefore, in this paper, we examine the Q&A website, Stack Overflow, to provide insights on the topics that chatbot developers are interested and the challenges they face. In particular, we leverage topic modeling to understand the topics that are being discussed by chatbot developers on Stack Overflow. Then, we examine the popularity and difficulty of those topics. Our results show that most of the chatbot developers are using Stack Overflow to ask about implementation guidelines. We determine 12 topics that developers discuss (e.g., Model Training) that fall into five main categories. Most of the posts belong to chatbot development, integration , and the natural language understanding (NLU) model categories. On the other hand, we find that developers consider the posts of building and integrating chatbots topics more helpful compared to other topics. Specifically, developers face challenges in the training of the chatbot's model. We believe that our study guides future research to propose techniques and tools to help the community at its early stages to overcome the most popular and difficult topics that practitioners face when developing chatbots.
Article
Full-text available
Blockchain-related discussions have become increasingly prevalent in programming Q\&A websites, such as Stack Overflow and other Stack Exchange communities. Analyzing and understanding those discussions could provide insights about the topics of interest to practitioners, and help the software development and research communities better understand the needs and challenges facing developers as they work in this new domain. Prior studies propose the use of LDA to study the Stack Exchange discussions. However, a simplistic use of LDA would capture the topics in discussions blindly without keeping in mind the variety of the dataset and domain-specific concepts. Specifically, LDA is biased towards larger sized corpora; LDA-derived topics are not linked to higher level domain-specific concepts. We propose an approach that combines balanced LDA (which ensures that the topics are balanced across the domain) with the reference architecture of a domain to capture and compare topics of discussions across the Stack Exchange communities. We make a number of interesting observations, including: (1) Bitcoin, Ethereum, Hyperledger Fabric and Corda are the four most commonly-discussed blockchain platforms on the Stack Exchange communities; (2) A broad range of topics are discussed at distinct layers in our derived reference architecture. The consensus layer topics are most commonly discussed; (3) We observe an overall growth in the absolute impact for all architectural layer topics. The application layer topics have the greatest absolute impact over time in comparison to other layer topics; (4) Application layer, API layer, consensus layer and network layer topics are commonly discussed across the studied blockchain platforms. Based on our findings, we highlight future directions and provide recommendations for practitioners and researchers.
Conference Paper
Full-text available
Background Software developers are increasingly required to write concurrent code. However, most developers find concurrent programming difficult. To better help developers, it is imperative to understand their interest and difficulties in terms of concurrency topics they encounter often when writing concurrent code. Aims In this work, we conduct a large-scale study on the textual content of the entirety of Stack Overflow to understand the interests and difficulties of concurrency developers. Method First, we develop a set of concurrency tags to extract concurrency questions that developers ask. Second, we use latent Dirichlet allocation (LDA) topic modeling and an open card sort to manually determine the topics of these questions. Third, we construct a topic hierarchy by repeated grouping of similar topics into categories and lower level categories into higher level categories. Fourth, we investigate the coincidence of our concurrency topics with findings of previous work. Fifth, we measure the popularity and difficulty of our concurrency topics and analyze their correlation. Finally, we discuss the implications of our findings. Results A few findings of our study are the following. (1) Developers ask questions about a broad spectrum of concurrency topics ranging from multithreading to parallel computing, mobile concurrency to web concurrency and memory consistency to run-time speedup. (2) These questions can be grouped into a hierarchy with eight major categories: concurrency models, programming paradigms, correctness, debugging, basic concepts, persistence, performance and GUI. (3) Developers ask more about correctness of their concurrent programs than performance. (4) Concurrency questions about thread safety and database management systems are among the most popular and the most difficult, respectively. (5) Difficulty and popularity of concurrency topics are negatively correlated. Conclusions The results of our study can not only help concurrency developers but also concurrency educators and researchers to better decide where to focus their efforts, by trading off one concurrency topic against another.
Article
Full-text available
Social media analysis has shown tremendous potential to understand public's opinion on a wide variety of topics. In this paper, we have mined Twitter to understand the public's perception of the Internet of Things (IoT). We first generated the discussion trends of the IoT from multiple Twitter data sources and validated these trends with Google Trends. We then performed sentiment analysis to gain insights of the public's attitude towards the IoT. As anticipated, our analysis indicates that the public's perception of the IoT is predominantly positive. Further, through topic modeling, we learned that public tweets discussing the IoT were often focused on business and technology. However, the public has great concerns about privacy and security issues toward the IoT based on the frequent appearance of related terms. Nevertheless, no unexpected perceptions were identified through our analysis. Our analysis was challenged by the limited fraction of tweets relevant to our study. Also, the user demographics of Twitter users may not be strongly representative of the population of the general public.
Article
Security Orchestration, Automation, and Response (SOAR) platforms integrate and orchestrate a wide variety of security tools to accelerate the operational activities of Security Operation Center (SOC). Integration of security tools in a SOAR platform is mostly done manually using APIs, plugins, and scripts. SOC teams need to navigate through API calls of different security tools to find a suitable API to define or update an incident response action. Analyzing various types of API documentation with diverse API format and presentation structure involves significant challenges such as data availability, data heterogeneity, and semantic variation for automatic identification of security tool APIs specific to a particular task. Given these challenges can have negative impact on SOC team’s ability to handle security incident effectively and efficiently, we consider it important to devise suitable automated support solutions to address these challenges. We propose a novel learning-based framework for automated security tool API R ecommendation for security O rchestration, automation, and response, APIRO . To mitigate data availability constraint, APIRO enriches security tool API description by applying a wide variety of data augmentation techniques. To learn data heterogeneity of the security tools and semantic variation in API descriptions, APIRO consists of an API-specific word embedding model and a Convolutional Neural Network (CNN) model that are used for prediction of top 3 relevant APIs for a task. We experimentally demonstrate the effectiveness of APIRO in recommending APIs for different tasks using 3 security tools and 36 augmentation techniques. Our experimental results demonstrate the feasibility of APIRO for achieving 91.9% Top-1 Accuracy. Compared to the state-of-the-art baseline, APIRO is 26.93%, 23.03%, and 20.87% improved in terms of Top-1, Top-2, and Top-3 Accuracy and outperforms the baseline by 23.7% in terms of Mean Reciprocal Rank (MRR).
Article
Twitter has emerged as outstanding and most prominent social media in today’s technological age. The data proliferates in quick and words with its activities trigger get fast responses from the users. This platform is perfect for promoting political perspectives, particularly when election campaigns are on its peak. Political trends on Twitter media has been contemplated in the course of recent years. In the past research, both supervised and unsupervised methodologies have been used to analyze the Twitter trends. Most of the Tweet classification approaches utilized built in Dictionaries, Naïve Bayes, K-Nearest Neighbors (KNN), decision tree and Support Vector Machines (SVM) classifiers. However, in case of democratic election these trends can be mined to predict the winning party. However, all such approaches produce poor results due to language issues, low accuracy, limited access to internet and lower literacy rate in less developed countries such as Pakistan. This research study, find the best possible way for collection of tweets related to different political parties and build a prediction model that may analyze sentiments and opinions expressed by peoples in their Tweets. In this research work, a prediction based model along with novel similarity measure has been proposed to predict the election results of political parties in Pakistan. The proposed work is composed of data collection, preprocessing, aspect extraction, aspect refinement and final prediction using Bayesian theorem. Form the experimental results, it is concluded that proposed approach perform better than existing techniques by obtaining almost 98% accuracy and efficiently cover the limitations of existing studies.
Article
Background: The Centers for Disease Control and Prevention (CDC) in United States initially alerted the public to three COVID-19 signs and symptoms-fever, dry cough, and shortness of breath. Concurrent social media posts reflected a wider range of symptoms of COVID-19 besides these three symptoms. Because social media data have a potential application in the early identification novel virus symptoms, this study aimed to explore what symptoms mentioned in COVID-19-related social media posts during the early stages of the pandemic. Methods: We collected COVID-19-related Twitter tweets posted in English language between March 30, 2020 and April 19, 2020 using search terms of COVID-19 synonyms and three common COVID-19 symptoms suggested by the CDC in March. Only unique tweets were extracted for analysis of symptom terms. Results: A total of 36 symptoms were extracted from 30,732 unique tweets. All the symptoms suggested by the CDC for COVID-19 screening in March, April, and May were mentioned in tweets posted during the early stages of the pandemic. Discussion: The findings of this study revealed that many COVID-19-related symptoms mentioned in Twitter tweets earlier than the announcement by the CDC. Monitoring social media data is a promising approach to public health surveillance.
Conference Paper
Software developers are increasingly required to write big data code. However, they find big data software development challenging. To help these developers it is necessary to understand big data topics that they are interested in and the difficulty of finding answers for questions in these topics. In this work, we conduct a large-scale study on Stackoverflow to understand the interest and difficulties of big data developers. To conduct the study, we develop a set of big data tags to extract big data posts from Stackoverflow; use topic modeling to group these posts into big data topics; group similar topics into categories to construct a topic hierarchy; analyze popularity and difficulty of topics and their correlations; and discuss implications of our findings for practice, research and education of big data software development and investigate their coincidence with the findings of previous work.