Conference PaperPDF Available

A Preliminary Study on Quality of Experience Assessment of Compressed Audio File Format

Authors:

Abstract

Audio compression formats reduce the file size before downloading and uploading on the Cloud and social media platforms. An audio codec performs this operation. The uncompressed or lossless format can become lossless and lossy compressed files shrinking file size due to compression. Uncompressed file format change not only decreases the original size but also cuts the audio quality. The bit rate also drops. One assesses the quality of experience (QoE) of compressed audio file formats by comparing them to the original audios. One needs to conduct subjective QoE assessment experiments of both formats. This can be done by uploading a compressed file on the Cloud depending on its quality and sample rate to determine the satisfaction level of the end-user and which audio quality can be uploaded on the cloud, including other social media platforms.
A Preliminary Study on Quality of Experience
Assessment of Compressed Audio File Format
Jawwad Ali Baloch
Department of Computer Science
ILMA University
Karachi, Pakistan
jawwad.balooch001@gmail.com
Vania V. Estrela
Department of Telecommunication
Fluminense Federal University (UFF)
RJ, Brazil
vania.estrela.phd@ieee.org
Awais Khan Jumani
Department of Computer Science
ILMA University
Karachi, Country
awaisjumani@yahoo.com
Ricardo T.Lopes
LIN-COPPE, UFRJ
RJ, Braazil
ricardo@lin.ufrj.br
Asif Ali Laghari
Department of Computer Science
Sindh Madressatul Islam University
Karachi, Pakistan
asif.laghari@smiu.edu.pk
AbstractAudio compression formats reduce the file size
before downloading and uploading on the Cloud and social
media platforms. An audio codec performs this operation. The
uncompressed or lossless format can become lossless and lossy
compressed files shrinking file size due to compression.
Uncompressed file format change not only decreases the original
size but also cuts the audio quality. The bit rate also drops. One
assesses the quality of experience (QoE) of compressed audio file
formats by comparing them to the original audios. One needs to
conduct subjective QoE assessment experiments of both
formats. This can be done by uploading a compressed file on the
Cloud depending on its quality and sample rate to determine the
satisfaction level of the end-user and which audio quality can be
uploaded on the cloud, including other social media platforms.
KeywordsQuality of Experience (QoE), Audio File Formats,
MP3, ACC.
I. INTRODUCTION
Connectivity mediums differ regarding platforms where
users share feelings, events, and blisses globally. Different
mobile devices accessing the internet can upload/download
audio files from social media platforms, e.g., Twitter, Sound
Cloud, YouTube, Facebook, and Instagram. These platforms
entail compression of video and audio files to reduce the file
size and economize space. Many algorithms permit file size
reduction with codecs, viz. MPEG-4, H.264/AVC, and
HEVC [1]. The same goes for audio, where codecs compress
the data and convert the audio into digital format. Compressed
or lossy audio formats, such as MP3 and ACC, cut the audio
content and reduce the audio file size.
Different audio formats like uncompressed WAV and
AIFF contain full, original audio at the expense of disk space.
Others are compressed formats like lossless Free Lossless
Audio Codec (FLAC) and lossy MP3, Advanced Audio
Coding (AAC) audio files [2]. The former reduces the file size
without losing content, while the latter loses the audio content.
Audio digitization requires sampling with adequate
frequencies, such as a standard sample rate (16 bit) of 44.1
kHz for CD [3]. Video file formats are composed of containers
and codecs. The former encloses video data, including audio
data, metadata, and subtitles, while the second performs the
decoding operation of compressed video or audio. Due to the
incomplete information about codec, different containers may
work better with different codecs. Most social media
platforms recommend MP4 compression with H.264 or
MPEG-4 if the format is not known. It is also evident that the
compressed uploaded video on social media will not remain
the same as the original because of noise and distortion [4]. As
we know that video file formats comprise containers and
codecs, the former contains video data, including audio data,
metadata, and subtitles. At the same time, the latter decodes
compressed video or audio [5]. Due to the incomplete
information about a codec, containers may differ. Many video
compression schemes exist, e.g., H.264, MPEG-4, audio,
MP3, metadata, XML, etc. Most social media platforms
endorse compression of MP4 with H.264 or MPEG-4 for
unknown format [6, 7]. It is also patent that the compacted
uploaded video on social media will not stay the same as the
original owing to noise and distortion.
Recently, Quality of Experience (QoE) has become critical
to evaluate users' positive or negative experiences with
products or services (user's satisfaction level). It can deliver
an assessment to enhance user's satisfaction on performance
trustworthiness depends on the subjective or objective
measure concerning any service [8, 9]. This investigation
evaluates the QoE of the user on the different compressed
audio file formats. It analyzes how the compression happens
and which file formats suit the end-users. We use subjective
methods for perceived answers while considering the end
user's satisfaction level on different quality file formats. This
solution makes available us to measure the satisfaction level
of the end-user. Several QoE assessments (QoEAs) have been
made on video quality on Clouds, comprising video
streaming. Each compressed audio format scheme performs
differently from the other in terms of audio content. The QoE
was used in different domains to get user reviews about the
devices, products and improve the services using subjective
QoE of the end user's for different audio file formats, listening
2021 IEEE URUCON
978-1-6654-2443-1/21/$31.00 ©2021 IEEE 161
2021 IEEE URUCON | 978-1-6654-2443-1/21/$31.00 ©2021 IEEE | DOI: 10.1109/URUCON53396.2021.9647114
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on December 28,2021 at 12:20:33 UTC from IEEE Xplore. Restrictions apply.
qualities, sample rates, and bit-rates. QoEA on a compressed
audio file and its quality on the Cloud has never been
investigated by any research work using QoE on audio
compression parameters.
The 2nd section surveys literature. The third explains
audio details. Section 4 is based on QoEA experiments, and
the 5th section presents results and discussion. And finally, we
conclude our work in section 6.
II. RELATED WORK
Yang [10] explored MP3s and converted low bit-rate to
high bit-rate to double MP3 compression. Hence, to determine
the double compression of audio, different methods have been
used to detect the double compression and avoid many fake
quality MP3 files available on the internet. To identify these
fake MP3 audio files which are low bit-rate and recompressed
into high bit-rate. Moreover, Benford's law has been used to
find out the distribution of MP3 digits. Assorted experiments
help distinguish between single and double compression audio
files.
Similarly, Rasool et al. [11] the less time-consuming
PEAQ algorithm for audio quality at low bit-rates with
objective QoEA worked very well. The perpetual audio
quality depends on codecs and encoders. Furthermore, Sloan
et al. [12] compared objective metric Virtual Speech Quality
Objective Listener (ViSQOL) against others, e.g., Perceptual
Evaluation of Audio Quality (PEAQ), Perception Model
Quality Assessment (PEMO-Q), and Perceptual objective
listening quality assessment (POLQA). ViSQOL
outperformed other models based on accuracy and
performance for two out of three datasets.
A QoE web-based platform gauges/improves customer
satisfaction, provides QoS quality, offers backend storage, and
allows complaints' registration. An evaluating application
system has been created. The system can analyze and provide
a solution to customer's feedback. This can also evaluate the
performance of an employee. Based on this QoE and quality
of service (QoS), the banking sector can determine its
performance and internal structural problems that are the
biggest impediments to the best possible service delivery.
Laghari et al. [13] have investigated QoE between Cloud and
end-user. The QoE experiment handled different video
formats having additional quality features designated on two
other locations, long and nearby. The Ping command has been
used to identify the network delay time. By recognizing the
video from the Cloud on which users have shared their
satisfaction level. The findings showed that nearby locations
got higher ratings for video streaming than the far Clouds,
identified as a cause of network delay.
Sajida et al. [14] investigated video quality in the social
Cloud-based to determine the video quality analysis (VQA)
and exploring the reference metrics in quality degradation.
Experiments involved uploading selected low-quality videos
on social clouds. Objective video metrics for cloud hosting
helped check emotion during a social media interaction. The
quality and resolution of different videos have been affected
by the compression. Gunawan et al. [15] investigates QoE
assessment (QoEA) of the IEEE 1857.2 Advanced Audio
Coding, a lossless audio codec alternative to other codecs.
This study checked the codec performance and operation and
found that lossless compression improved the compression
operation due to the pre-processing block. Still, it increased
the encoding period and algorithm complexity. For lossless
compression, the IEEE 1857.2 compression performance is
much better than others. Bouraqia et al. [16] analyzed QoEA
for diverse networks and mentioned measurement methods
and worthy definitions. Online user's satisfaction with
streaming using search engines needs new rating acquisition
procedures.
III. AUDIO TEXT
Audio quality is an essential feature attracting users to
listen to audio and share his/her response as feedback. The
audio quality relies on some crucial elements such as sample
rate, bit rate, bit depth, and audio file format. The reason
behind this research is to relate QoE to the compression of
audio file formats. As far as format is concerned, digital audio
is stored on a computer. An audio file can be uncompressed or
compressed. The uncompressed audio file formats are WAV,
AIFF, and AU; they contain complete and original audio
formats and takes a lot of disk space.
On the contrary, the other two are compressed formats
such as lossless FLAC and lossy MP3 and AAC. The former
reduces the file size without losing the audio content, while
the latter reduces the audio file size and loses the audio
content. If we further describe the uncompressed audio file
formats, this format consists of original audio waves with full
data content as captured or taken. Because of no compression,
it takes a lot of space to be stored. E.g., WAV is an
uncompressed format with PCM. Still, few persons argue that
WAV is not an uncompressed audio file format but a window
container with multiple audio formats. A lossy audio file
format like MP3 reduces the audio file size and loses the audio
excellence during data compression. MP3 is also a lossy file
format. MP3 files can easily work on different audio devices
because of their less size. It also drops the sound or audio so
that it could not be audible for the user. The lossless audio file
format is the one that reduces the file format during
compression but without losing the audio data. The free
lossless audio codec is an open-source format. It can compress
the audio file without losing any data.
Fig. 1. Fig.1. Process of compression
Fig. 1 shows compression from 16-bit to 8-bit files
measuring sample rate during audio digitizing. A standard
sample rate (16 bit) for CDs is 44.1 kHz. Video file formats
have containers and codecs. The former contains video data,
including audio, metadata, and subtitles, while the latter
decodes compressed video or audio. In the past, a higher
sample rate took more storage and power to run, but
nowadays, digital storage resolved this issue. Bit rate defines
the audio stream quality, and a higher bit rate describes the
2021 IEEE URUCON
162
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on December 28,2021 at 12:20:33 UTC from IEEE Xplore. Restrictions apply.
better quality of audio. The amount of audio file content and
its file size can increase because of the high sample rate and
bit depth, and that can only be possible with the help of high
bandwidth for the production quality audio. Low bit rate takes
less bandwidth and size but affects the audio quality. A bit
depth is several bits per sample, which is an active range of
sound. The active range of sound differentiates between the
highest and the lowest sound during the recording. The bit rate
calculation follows:
bit rate = Frequency*channels*bit depth
A series of binary numbers (bits) are produced through an
analog to digital converter by taking a snapshot of sound to
describe the sample. Further, digitization means several
samples or snapshots from analog audio went through an A/D
converter. Table 1 below shows some multimedia purposes
applications, and it also indicates bit rates and sample rates of
each.
TABLE I. APPLICATION WITH SAMPLE RATE AND BIT RATE
Quality
Mono/
Stereo
Bits
per
Sample
Sample
Rate
Frequency
Band
Data
Rate
Telephone
Mono
8
8
200-3,400
Hz
8k
byt/sec
AM Radio
Mono
8
11.025
11.0k
byt/sec
FM Radio
Stereo
16
22.050
88.2k
byt/sec
CD
Stereo
16
44.1
20-20,000
Hz
176.4k
byt/sec
DAT
Stereo
16
48
20-20,000
Hz
192.0k
byt/sec
IV. QUALITY OF EXPERIENCE
The uncompressed WAV and AIFF file formats have
better quality. The file size of these audio formats is much
larger than the compressed one, which takes a lot of space to
be stored. On the other side, the compressed audio file formats
are twofold: lossless and lossy. The lossless format like FLAC
reduces the file size during the compression without
compromising on the quality of audio content. A compressed
lossy format, such as MP3 and AAC, reduces the original file
size and losing content. We used two forms of WAV audio;
two music audios and two voice speeches. Where music one
audio size was 56 seconds and music two audio sizes was 54
seconds while the size of both voice speeches was 46 seconds
and then converted these WAV audio formats into compressed
lossy file formats like MP3 and AAC. We used uncompressed
WAV and then converted it into compressed lossy file formats
like MP3 and AAC. We used different techniques to transform
the WAV into MP3 and AAC to perform this compressed
operation. Then, we checked the QoE accuracy between
compressed formats such as MP3 and ACC. After that, the
outcome of the MP3 and AAC compressed files has been
transferred and uploaded into the other listening devices for
QoEA wherein to get the best compressed audio file among
MP3 and AAC. Fig 2 depicts this study on a step-by-step
basis, from audio compression and towards QoEA.
Fig. 2. QoE model for compressed audio format
To investigate QoEA, the MP3 and AAC compressed
audio files formats went to listening devices. Secondly, a quest
poll of audio listening for the QoE study allowed 200
participants. Most of them were undergraduate students and
masters' degree holders. Out of 200 participants, 152 males
and 48 females participated in this poll, and their ages were
between 21 to 35 years. Viewpoints came from distributed
questionnaires for rating the audio. Firstly, we played original
audio through listening devices. Then, we played MP3 and
AAC compressed audio and allowed participants to rate the
best-compressed format. The rating mechanism or mean
opinion score format relied on questionnaires tabular form.
We used absolute category rating (ACR) format to rank the
best compressed audio file format in table 2. The mean
opinion score (MOS) technique has been used for the QoE
assessment to get the score for each audio format.
TABLE II. QUALITY PARAMETERS
Quality
Excellent
Good
Fine
Fair
Poor
V. RESULTS AND DISCUSSION
The VLC Player version 3.0 and Windows Media Player
version 12 for audio compressions have been used.
Uncompressed WAV aid in checking the difference of audio
quality and size between compressed lossy and uncompressed
formats. The WAV format carries the original audio, and it
outclassed all other formats. However, the adverse point of the
WAV is its losses. Leveraging over-compressed audio formats
consumes storage space. This is why people go towards
compression schemes like MP3 and AAC against WAV. MP3
and AAC have compressed the WAV format. The QoEA
experiment relies on compression formats. For that purpose,
2021 IEEE URUCON
163
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on December 28,2021 at 12:20:33 UTC from IEEE Xplore. Restrictions apply.
we conducted a public or user opinion-based poll on these
compressed audio formats, where participants of this poll had
listened to these formats and gave their opinion on the best
audio format.
TABLE III. FEEDBACK FOR ACC
Advanced Audio Coding (AAC)
Quality
Participants
Excellent
59
Good
67
Fine
65
Fair
9
Poor
0
TABLE IV. FEEDBACK FOR MP3
MP3
Quality
Participants
Excellent
42
Good
40
Fine
44
Fair
63
Poor
11
Tables 3 and 4 display outcomes in absolute category
rating (ACR) for 200 participants, where 59 marked Excellent
for AAC, while 42 marked Excellent for MP3. Moreover, 67
participants rated AAC as Good, while 40 marked Good for
MP3. Likewise, for ACC, 65 marked Fine, and 9 chose Fair.
However, 44 ranked Fine, 63 marked Fair, and 11 chose Poor
for MP3.
Fig. 3. Comparison of audio formats (AAC and MP3)
Results show users' MOS in Table 5 and Fig 3 with high
scores to AAC compared to MP3. As discussed above, both
audio formats belong to the same lossy family, but the
effective compression algorithm AAC performs well than
MP3. AAC audio file format is much more effective than
MP3, but it contains more original content than MP3 while
having identical file sizes and bit rates. Plus, it is also the fact
that AAC performs much better than MP3 in low bit-rates
such as less than 128 kbps. Consequently, the above tables and
fig. 4 illustrate that how AAC is better in performance than
MP3.
Fig. 4. Audio formats' comparison
VI. CONCLUSION
This paper measured the quality of experience (QoE) of
audio file format by a public opinion poll known as mean
opinion score (MOS). The uncompressed WAV file has been
compressed into MP3 and AAC using VLC Player, Windows
Media Player, and other online platforms. QoEA involved 200
participants invited to hear audio formats and rete quality
audio. As per the QoEA, it can be said that the AAC audio
format quality is better than MP3. MP3 is more popular
among the masses. This research work provides an opinion
regarding compressed file formats with less quality than
uncompressed audio formats like WAV. However, we have
also found out that many users most commonly use
compressed audio formats. It is just because of the
compressing scheme due to which these audio formats take
less storage space.
REFERENCES
[1] N. Sadeghi, M. Fahiminia, and M. Teimouri, “Dataset for file fragment
classification of video file formats,” BMC Res. Notes, vol. 13, no. 1,
2020, doi: 10.1186/s13104-020-05037-x.
[2] J. Oh and E. S. Jang, “MP3-based point cloud compression,” 2021 Int.
Conf. Electron. Information, Commun. ICEIC 2021, 2021, doi:
10.1109/ICEIC51217.2021.9369717.
[3] G. Schuller, “Predictive Lossless Audio Coding,” Filter Banks and
Audio Coding, pp. 161166, 2020, doi: 10.1007/978-3-030-51249-
1_7.
[4] A. Alfa et al., “A Comparative Study of Methods for Hiding Large Size
Audio File in Smaller Image Carriers,” Commun. Comput. Inf. Sci.,
vol. 985, pp. 179191, 2019, doi: 10.1007/978-981-13-8300-7_15.
[5] Q. Huang, R. Wang, D. Yan, and J. Zhang, “AAC audio compression
detection based on QMDCT coefficient,” Lect. Notes Comput. Sci.
(including Subser. Lect. Notes Artif. Intell. Lect. Notes
Bioinformatics), vol. 11068 LNCS, pp. 347359, 2018, doi:
10.1007/978-3-030-00021-9_32.
[6] S. Cunningham and I. McGregor, “Subjective Evaluation of Music
Compressed with the ACER Codec Compared to AAC, MP3, and
Uncompressed PCM,” Int. J. Digit. Multimed. Broadcast., vol. 2019,
2019, doi: 10.1155/2019/8265301.
[7] K. D. Singh, Y. Hadjadj-Aoul, and G. Rubino, “Quality of experience
estimation for adaptive HTTP/TCP video streaming using
H.264/AVC,” 2012 IEEE Consum. Commun. Netw. Conf.
CCNC’2012, pp. 127–131, 2012, doi: 10.1109/CCNC.2012.6181070.
[8] A. Laghari, H. He, S. Karim, H. A. Shah, and N. K. Karn, “Quality of
Experience Assessment of Video Quality in Social Clouds,” Wirel.
Commun. Mob. Comput., vol. 2017, 2017, doi:
10.1155/2017/8313942.
2021 IEEE URUCON
164
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on December 28,2021 at 12:20:33 UTC from IEEE Xplore. Restrictions apply.
[9] F. De Turck, S. Petrangeli, J. Van Der Hooft, and T. Wauters, “Quality
of experience-centric management of adaptive video streaming
services: Status and challenges,” ACM Trans. Multimed. Comput.
Commun. Appl., vol. 14, no. 2s, 2018, doi: 10.1145/3165266.
[10] R. Yang, Y. Q. Shi, and J. Huang, “Detecting double compression of
audio signal,” Media Forensics Secur. II, vol. 7541, p. 75410K, 2010,
doi: 10.1117/12.838695.
[11] R. Rassool, “VMAF reproducibility: Validating a perceptual practical
video quality metric,” IEEE Int. Symp. Broadband Multimed. Syst.
Broadcast. BMSB, 2017, doi: 10.1109/BMSB.2017.7986143.
[12] Sloan, N. Harte, D. Kelly, A. C. Kokaram, and A. Hines, “Objective
Assessment of Perceptual Audio Quality Using ViSQOLAudio,” IEEE
Trans. Broadcast., vol. 63, no. 4, pp. 693705, 2017, doi:
10.1109/TBC.2017.2704421.
[13] A. Laghari, H. He, M. Shafiq, and A. Khan, “Assessing effect of Cloud
distance on end user’s Quality of Experience (QoE),” 2016 2nd IEEE
Int. Conf. Comput. Commun. ICCC 2016 - Proc., pp. 500505, 2017,
doi: 10.1109/CompComm.2016.7924751.
[14] S. Karim, H. He, A. R. Junejo, and M. Sattar, “Measurement of
Objective Video Quality in Social Cloud Based on Reference Metric,”
Wirel. Commun. Mob. Comput., vol. 2020, 2020, doi:
10.1155/2020/5028132.
[15] T. S. Gunawan, M. K. Mat Zain, F. A. Muin, and M. Kartiwi,
“Investigation of lossless audio compression using IEEE 1857.2
advanced audio coding,” Indones. J. Electr. Eng. Comput. Sci., vol. 6,
no. 2, pp. 422430, 2017, doi: 10.11591/ijeecs.v6.i2.pp422-430.
[16] K. Bouraqia, E. Sabir, M. Sadik, and L. Ladid, “Quality of Experience
for Streaming Services: Measurements, Challenges and Insights,”
IEEE Access, vol. 8, pp. 1334113361, 2020, doi:
10.1109/ACCESS.2020.2965099.
2021 IEEE URUCON
165
Authorized licensed use limited to: SOUTH CHINA UNIVERSITY OF TECHNOLOGY. Downloaded on December 28,2021 at 12:20:33 UTC from IEEE Xplore. Restrictions apply.
... Cloud gaming gives more GE in the multiplayer scenario with VR boxes. Yet, the CG quality hinders services due to the middle network speed and connection [81,82,83,84]. Researchers still need to address how dynamic network traffic affects CG quality. ...
Article
Full-text available
Virtual reality (VR) gaming (VRG) captures, examines, analysis, and renders the game data virtually instead of the users’ devices over efficient networks. E.g., VR is crucial in visually guided surgery and manifold therapies. Video Quality Assessment (VQA) helps VRG software and devices execute high-quality inputs/outputs. This paper debates VRG processes, environments, VQA metrics, Quality of Experience (QoE), platforms, current methods, tools, use in medicine, existing operational gains/challenges, and highlights future directions.
... Acoustic monitoring through bat detector usually requires a lot of storage space, as recording at a high sample rate results in very large audio files (Baloch et al. 2021). This issue is significantly reduced when using mobile devices, as they record at a lower sample rate. ...
Article
Full-text available
Citizen science has become a crucial tool in biodiversity monitoring, often facilitated by the diffusion of mobile devices, such as smartphones and tablets. High costs of professional equipment often limit large-scale monitoring, particularly in bat monitoring programmes based on acoustic surveys. Here we present the potential of using mobile devices for bat monitoring, allowing for large-scale, volunteer-based monitoring programmes. We initially compared mobile devices’ performance with a professional bat detector for recording low-frequency bat calls. We then conducted a citizen science pilot study to test the method’s feasibility in a real-world setting, recording echolocation and social calls of nine European bat species. We found high similarity in spectrogram quality (0.74 ± 0.09 for Samsung, 0.90 ± 0.01 for Huawei, 0.86 ± 0.09 for Xiaomi, 0.69 ± 0.09 for Apple) and average peak frequency (differences of 0.2 ± 0.5 kHz for Samsung, 0.1 ± 0.7 kHz for Huawei, 0.5 ± 1.0 kHz for Xiaomi, 0.1 ± 0.8 kHz for Apple) between calls recorded by mobile devices and professional bat detectors. The number of recorded bat calls per sampling session was also similar. However, differences in sound quality and effectiveness among mobile device brands were found. iOS devices outperformed professional detectors at recording bat calls at increasing distances. The citizen science pilot study tested 35 mobile device models, all of which effectively recorded bats. This study suggests that mobile devices could be an accessible, no-cost tool for large-scale bat monitoring. Incorporating mobile devices into existing monitoring networks or creating new dedicated programmes could not only enhance data collection, but also boost public knowledge and awareness about bats, ultimately promoting informed decision-making and better conservation strategies.
... Analysis of subjective QoE is important for an organization to differentiate between the negative and positive opinions of the user to offer services according to SLA, so still, there is research and development required to find accuracy in the negative and positive opinions of users, when they are using cloud service [142]. QoE monitoring of cloud resources such as virtual migration, CPU power, video codecs and resolution, available bandwidth in internal cloud networks, terminal capabilities, and packet loss or reordering and in a middle network environment, which include wireless or broadband traffic, SLA, and applications, which used by the client to use cloud services [143,144]. Fixed policy for the end-user from organizations is also a problem, which will be solved by designing a new mechanism that upgrades or changes SLA for the time being when resources are utilized at peak level [145]. ...
Article
Full-text available
The services are delivered to the user through a protocol interface of different resources. Service delivery conferring on quality of service (QoS) mentioned in service level agreement (SLA) and user satisfaction has been a major problem to access cloud services. To avoid such situations quality of experience (QoE) domain was added to the cloud computing environment to assess user satisfaction and the need for service delivery. In this paper, we present a review of QoE-based solutions, frameworks, and models, which have been proposed for different cloud computing environments. Finally, we discuss the advantages of frameworks and models, their limitations, and open issues for future research in cloud computing.
Article
Full-text available
Cloud gaming has rapidly transformed the gaming industry, allowing users to play games on demand from anywhere without the need for powerful hardware. Cloud service providers are striving to enhance user Quality of Experience (QoE) using traditional assessment methods. However, these traditional methods often fail to capture the actual user QoE because some users are not serious about providing feedback regarding cloud services. Additionally, some players, even after receiving services as per the Service Level Agreement (SLA), claim that they are not receiving services as promised. This poses a significant challenge for cloud service providers in accurately identifying QoE and improving actual services. In this paper, we have compared our previous proposed novel technique that utilizes a deep learning (DL) model to assess QoE through players’ facial expressions during cloud gaming sessions in a virtual reality (VR) environment. The EmotionNET model technique is based on a convolutional neural network (CNN) architecture. Later, we have compared the EmotionNET technique with three other DL techniques, namely ConvoNEXT, EfficientNET, and Vision Transformer (ViT). We trained the EmotionNET, ConvoNEXT, EfficientNET, and ViT model techniques on our custom-developed dataset, achieving 98.9% training accuracy and 87.8% validation accuracy with the EmotionNET model technique. Based on the training and comparison results, it is evident that the EmotionNET model technique predicts and performs better than the other model techniques. At the end, we have compared the EmotionNET results on two network (WiFi and mobile data) datasets. Our findings indicate that facial expressions are strongly correlated with QoE.
Conference Paper
This design handles the “Quality of Service” (QoS)/“Quality of Experience” (QoE) in transmedia ventures with fog computing services and agent technologies. A suitable deployment must check the service supplier QoE to the client and any connected hardware objectively and subjectively. Also, it lets partakers impart service feedback. The resulting structure upgrades services safeguarding QoS when a partaker does not get benefits during high-use hours and delivers improved service, thus retaining the QoE level throughout service consumption. Framework improvements for grander results in the hereafter have been scrutinized. Math is one of Brazil’s most scary disciplines in STEM education, yet it is plagued with problems. As a result, a big effort must do a noninvasive assessment, as evaluating transmedia sessions intimidates learners.
Chapter
Full-text available
The gaming industry is growing daily, and gaming traffic is now a significant part of internet traffic. Gaming is continuously moving from desktop to portable devices through cloud environments. Game development organizations are generating vast amounts of revenue through these games. PUBG is a world-renowned online game popular among all age groups, especially teenagers. People play games anywhere as per their choice and availability of time. In this research, we invite PUBG game players to play games in different physical environments, like while traveling by Car or through Public Transport and while staying at home or during a visit to the park. We assess Quality of Experience (QoE) through the International Telecommunication Union (ITU) scale. Data was collected through a survey form, and collected data was analyzed through SPSS. Two variables are the Game’s Speed and the Satisfaction Level with given physical environments. Results show that the QoE is high in the home physical environment over a WIFI internet connection for both variables. Researchers assess the user QoE in different environments, such as on Desktops, Mobile phones, or different machines of different configurations. Still, researchers have yet to work on the QoE of a game’s players who play the game in the physical environment. Users rate the game on Google Play Store. Currently, no method exists to check the user’s physical background. This research will help Game development organizations to look into the reason for a low rating due to players’ comfortable physical zone.
Article
Full-text available
This paper presents a novel web-based crowdsourcing platform for the assessment of the subjective and objective quality of experience (QoE) of the video service in the cloud-server environment. The user has the option to enter subjective QoE data for video service by filling out a web questionnaire. The objective QoE data of the cloud-server, network condition, and the user device is automatically captured by the crowdsourcing platform. Our proposed system collects both objective and subjective QoE simultaneously in real-time. The paper presents the key technologies used in the development of the platform and describes the functional requirements and design ideas of the system in detail. The system collects real-time comprehensive data to enhance the quality of the user experience to provide a valuable reference. The system is tested in a real-time environment and the test results are given in terms of the system performance. The crowdsourcing platform has new features of real-time network monitoring, the client device, and cloud monitoring, which currently has not been provided by existing web platforms and crowdsourcing frameworks. The results show that 1MB buffer is filled 100% very soon after starting watching videos from the crowdsourcing platform.
Article
Full-text available
Objectives: File fragment classification of video file formats is a topic of interest in network forensics. There are some publicly available datasets for file fragments of various file types such as textual, audio, and image file formats. However, there is no public dataset for file fragments of video file formats. So, in order to evaluate and compare the performance of the classification methods, a challenge is the need to have such datasets. Data description: In this study, we present a dataset that contains file fragments of 10 video file formats: 3GP, AVI, ASF, FLV, MKV, MOV, MP4, WebM, OGV, and RMVB. Corresponding to each format, the dataset contains the file fragments of video files with different video codec types: H.263, MPEG-4, WMV, H.264, FLV1, H.265, VP8, VP9, Theora, and RealVideo. Totally, 20 different pairs of video format and codec are employed. For each pair of video format and codec, 30,000 file fragments are provided. Totally, the dataset contains 600,000 file fragments.
Article
Full-text available
This paper explores the objective of the present video quality analysis (VQA) and measures the full reference metrics keeping in view the quality degradation. During the research work, we conduct experiments on different social clouds (SCs) and low-quality videos. Selected videos are uploaded to SC to assess differences in video service and quality. WeChat shows that the average of all videos (Avg = 100), peak signal-to-noise ratio (PSNR), has no impact on other indicators. Therefore, we believe that WeChat provides the best video quality and multimedia services to their users to meet Quality of Service (QoS)/Quality of Experience (QoE).
Article
Full-text available
Over the last few years, the evolution of network and user handsets’ technologies, have challenged the telecom industry and the Internet ecosystem. Especially, the unprecedented progress of multimedia streaming services like YouTube, Vimeo and DailyMotion resulted in an impressive demand growth and a significant need of Quality of Service (QoS) (e.g., high data rate, low latency/jitter, etc.). Mainly, numerous difficulties are to be considered while delivering a specific service, such as a strict QoS, human-centric features, massive number of devices, heterogeneous devices and networks, and uncontrollable environments. Thenceforth, the concept of Quality of Experience (QoE) is gaining visibility, and tremendous research efforts have been spent on improving and/or delivering reliable and added-value services, at a high user experience. In this paper, we present the importance of QoE in wireless and mobile networks (4G, 5G, and beyond), by providing standard definitions and the most important measurement methods developed. Moreover, we exhibit notable enhancements and controlling approaches proposed by researchers to meet the user expectation in terms of service experience.
Article
Full-text available
Audio data compression has revolutionised the way in which the music industry and musicians sell and distribute their products. Our previous research presented a novel codec named ACER (Audio Compression Exploiting Repetition), which achieves data reduction by exploiting irrelevancy and redundancy in musical structure whilst generally maintaining acceptable levels of noise and distortion in objective evaluations. However, previous work did not evaluate ACER using subjective listening tests, leaving a gap to demonstrate its applicability under human audio perception tests. In this paper, we present a double-blind listening test that was conducted with a range of listeners (N=100). The aim was to determine the efficacy of the ACER codec, in terms of perceptible noise and spatial distortion artefacts, against de facto standards for audio data compression and an uncompressed reference. Results show that participants reported no perceived differences between the uncompressed, MP3, AAC, ACER high quality, and ACER medium quality compressed audio in terms of noise and distortions but that the ACER low quality format was perceived as being of lower quality. However, in terms of participants’ perceptions of the stereo field, all formats under test performed as well as each other, with no statistically significant differences. A qualitative, thematic analysis of listeners’ feedback revealed that the noise artefacts that produced the ACER technique are different from those of comparator codecs, reflecting its novel approach. Results show that the quality of contemporary audio compression systems has reached a stage where their performance is perceived to be as good as uncompressed audio. The ACER format is able to compete as an alternative, with results showing a preference for the ACER medium quality versions over WAV, MP3, and AAC. The ACER process itself is viable on its own or in conjunction with techniques such as MP3 and AAC.
Chapter
Full-text available
The last 13 years or more has been characterized by unmatched progress in multimedia content and undercover communication. Sound file is chief among mode of transmitting classified and confidential information across the cyberspace and channels. Steganography is a traditional solution to address the problems of data security on public channels and transmitters. In general, the practice of image steganography favored smaller size secret message transmission in larger size carrier approach. However, there is a major of challenge of concealing classified large size audio data inside a smaller image cover with lesser distortions and quality retention of the cover medium and original message respectively. To achieve these, the audio file is compressed to appropriate size before embedding it into two distinct image files (grayscale and RGB color) at binary level using both Most Significant Bits (MSBs) and Least Significant Bits (LSBs) methods. This paper established that the most suitable of MSBs and LSBs in terms of robustness, payload size, and retention of perception transparency of media understudied.
Article
Full-text available
Video streaming applications currently dominate Internet traffic. Particularly, HTTP Adaptive Streaming (HAS) has emerged as the dominant standard for streaming videos over the best-effort Internet, thanks to its capability of matching the video quality to the available network resources. In HAS, the video client is equipped with a heuristic that dynamically decides the most suitable quality to stream the content, based on information such as the perceived network bandwidth or the video player buffer status. The goal of this heuristic is to optimize the quality as perceived by the user, the so-called Quality of Experience (QoE). Despite the many advantages brought by the adaptive streaming principle, optimizing users’ QoE is far from trivial. Current heuristics are still suboptimal when sudden bandwidth drops occur, especially in wireless environments, thus leading to freezes in the video playout, the main factor influencing users’ QoE. This issue is aggravated in case of live events, where the player buffer has to be kept as small as possible in order to reduce the playout delay between the user and the live signal. In light of the above, in recent years, several works have been proposed with the aim of extending the classical purely client-based structure of adaptive video streaming, in order to fully optimize users’ QoE. In this article, a survey is presented of research works on this topic together with a classification based on where the optimization takes place. This classification goes beyond client-based heuristics to investigate the usage of server- and network-assisted architectures and of new application and transport layer protocols. In addition, we outline the major challenges currently arising in the field of multimedia delivery, which are going to be of extreme relevance in future years.
Article
Full-text available
Video sharing on social clouds is popular among the users around the world. High-Definition (HD) videos have big file size so the storing in cloud storage and streaming of videos with high quality from cloud to the client are a big problem for service providers. Social clouds compress the videos to save storage and stream over slow networks to provide quality of service (QoS). Compression of video decreases the quality compared to original video and parameters are changed during the online play as well as after download. Degradation of video quality due to compression decreases the quality of experience (QoE) level of end users. To assess the QoE of video compression, we conducted subjective (QoE) experiments by uploading, sharing, and playing videos from social clouds. Three popular social clouds, Facebook, Tumblr, and Twitter, were selected to upload and play videos online for users. The QoE was recorded by using questionnaire given to users to provide their experience about the video quality they perceive. Results show that Facebook and Twitter compressed HD videos more as compared to other clouds. However, Facebook gives a better quality of compressed videos compared to Twitter. Therefore, users assigned low ratings for Twitter for online video quality compared to Tumblr that provided high-quality online play of videos with less compression.
Chapter
This chapter shows an example implementation of an LMS predictive lossless encoder and decoder, using the tools of the previous chapters, and compression tests with audio files.
Chapter
Audio compression history detection is a significant part of audio forensics, which is important to detect whether audio has been tampered or forged. When the structure of AAC audio frame is destroyed, its spectral coefficient distribution is similar to that of audio after the first compression. An algorithm of AAC audio compression detection was presented by using the statistical characteristics of QMDCT coefficients before and after removal of sampling points as the discriminative feature. Experimental results demonstrate that the proposed method can distinguish the single, double compressed AAC audios effectively, and from the low-bit-rate transcoding to high-bit-rate, the average classification accuracy achieves 99.84%, the same-bit-rate compression detection accuracy achieves 98.60%. In addition, the results of comparison experiments show that our algorithm outperforms the state-of-the-art algorithm.