W.A.C. Fernando

University of Surrey, Guilford, England, United Kingdom

Are you W.A.C. Fernando?

Claim your profile

Publications (161)70.02 Total impact

  • [Show abstract] [Hide abstract] ABSTRACT: Demand on multimedia content by consumers' handheld devices over wireless channels is on the increase. In view of the accelerated trend towards consumption of high quality video, power utilization by mobile devices is expected to excessively increase. Hence, it becomes equally important to advance more efficient power minimization techniques, in light of the short battery life in portable devices. However, power minimization algorithms that adopt consumers' perceptual quality of video have not received adequate research. This paper proposes a joint optimization of energy and quality requirements in a multiuser orthogonal frequencydivision multiplexing environment. A multi-objective optimization problem is formulated with the aim to identify bitrate allocations among users such that total power is minimized, and average quality is maximized. For this, a content-aware and energy-efficient resource allocation scheme (CaERAS) is proposed based on genetic and greedy algorithms. Simulation results show that CaERAS, as a lowcomplexity scheme, outperforms comparable methods in terms of efficiency and selectivity of suboptimal solutions. It is shown to acquire a suboptimal solution in as low as 0.0025 of the search space in previous methods. Also, a significant average saving of 85.66% in required energy is observed in broadcast transmission as opposed to unicast transmission.
    No preview · Conference Paper · Aug 2014
  • [Show abstract] [Hide abstract] ABSTRACT: For OFDM channels, we propose a content-aware subcarrier allocation scheme for multiuser video transmission that maximizes user perceived quality while minimizing power consumption. The scheme utilizes utility function-based genetic algorithms to solve the multi-objective optimization problem. It significantly outperforms existing methods in terms of perceived quality and consumed power.
    No preview · Conference Paper · Jan 2014
  • H. Malekmohamadi · W. A. C. Fernando · E. Danish · A. M. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: A neural network based technique is proposed to estimate subjective quality of stereoscopic videos. Moreover, to utilize this model for applications where availability of reference signal is not possible to receiver, it applies objective quality of video with minimum dependency on reference signal. This paper presents fast, accurate and consistent subjective quality estimation. Feasibility and accuracy of the proposed technique is thoroughly analyzed with extensive subjective experiments and simulations. Results illustrate that performance measure of 92.3% in subjective quality estimation can be achieved with the proposed technique.
    No preview · Conference Paper · Jan 2014
  • Zhiru Shi · W.A.C. Fernando · A.M. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: The H.264/AVC coding standard has achieved significant coding gain by using flexible block sizes inter prediction. However, the complexity of inter-mode decision is extremely high. In this paper, a fast inter-mode decision algorithm is presented by exploiting residual homogeneity for both macroblock and sub-macroblock levels. In this algorithm, the residual homogeneity indicators calculated in three directions are used to identify the block's homogeneous characteristic. Normalized motion vector activity is also considered to improve the performance. The experimental results illustrate that the proposed algorithm achieves about 49% of computational complexity saving in terms of encoding time, while with negligible quality degradation, compared to H.264/AVC reference software JM.
    No preview · Conference Paper · Oct 2013
  • H. Malekmohamadi · W.A.C. Fernando · A.M. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: A new reduced reference objective quality metric for stereoscopic video is proposed that incorporates spatial neighbouring information from grey-level co-occurrence matrices and edge properties. Extracted colour and depth features are combined unequally to maximise the performance of the proposed metric. The performance of this metric is validated through a series of subjective tests and the average correlation 0.82 is achieved when colour to depth importance ratio is near 0.80.
    No preview · Article · May 2013 · Electronics Letters
  • Zhiru Shi · W.A.C. Fernando · A. Kondoz
    No preview · Chapter · Oct 2012
  • Source
    Zhiru Shi · W.A.C. Fernando · A.M. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: The video coding standard H.264/AVC although possesses the advantage of higher coding efficiency compared to previous ones, it would result in a high computational complexity due to various block sizes motion estimation for multi-modes decision. In this paper, a hybrid inter-mode decision algorithm is presented, combining coded block pattern (CBP) with motion activity and Rate-Distortion (RD) cost. In this algorithm, CBP and CBP4×4, which indicate none-zero coefficient block, is used to determine candidate modes at macroblock level and sub-macroblock level. Further, the normalized motion activity is used to identify the homogeneity of the block and target the candidate modes set more accurate. An early termination is also made between macroblock level and sub-macroblock level by RD cost comparing. The experimental results show that the proposed algorithm achieves approximately 60% of computational complexity saving in terms of encoding time, while with negligible quality degradation, compared to the conventional method in H.264/AVC.
    Preview · Conference Paper · Jul 2012
  • Source
    R.M.T.P. Rajakaruna · W.A.C. Fernando · J. Calic
    [Show abstract] [Hide abstract] ABSTRACT: Compressed-domain content analysis enables fast content-based video applications. However, conventional encoder implementation, limited to optimising data compression, does not necessarily result in content representative compressed features. In this paper, we present a novel method for video encoder optimisation that enhances reliability of content representation in the compressed video. We propose a configurable encoder model to jointly optimise bit-rate, distortion and content representation of the encoded video. In encoder optimisation for motion estimation, we analyse the behaviour of motion representation and compression efficiency over a range of content complexities. A mathematical model to control the extent of noise in selected motion vectors is presented. We demonstrate that the compressed domain motion information can be enhanced without incurring a rate-distortion overhead. The computational overheads are marginal and can be eliminated by exploiting the inherently parallel nature of the proposed optimisation model.
    Full-text · Conference Paper · Jul 2012
  • Zhiru Shi · W.A.C. Fernando · A. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: The new video coding standard, H.264/AVC, has achieved significant compression gain by utilizing several advanced techniques. Motion estimation (ME) is one of the most important elements to reduce the motion residual. However it results in heavy computational burden. In this paper, a novel fast ME search algorithm, which exposits the correlation character between predicted and optimal motion vectors (MVs), is presented for H.264/AVC. According to motion vector prediction, search window is adaptively divided into several regions with calculation of its motion correlation probability. Specifically, it combines MVs' direction correlation and locality correlation. A novel Priority Region List, which indicates search process and selects the number of search points, is dynamically updated. The simulation results show that the proposed algorithm reduces 23% of motion estimation encoding time and 60% of average search points while maintaining the same RD performance compared with that of UMHexagonS algorithm, which is adopted in H.264 reference software.
    No preview · Conference Paper · May 2012
  • Source
    [Show abstract] [Hide abstract] ABSTRACT: In this paper, the sensitivity of the HVS towards binocular disparity depth cue, which is the most important cue in modern stereoscopic 3D displays, is investigated under different ambient illumination conditions. The experimental results indicate that as ambient illumination increases the sensitivity of the HVS for depth details increases or the Just Noticeable Difference in Depth (JNDD) decreases. It is expected that the investigation results will have important use cases in designing D display setups and 3D content production.
    Full-text · Article · Jan 2012
  • [Show abstract] [Hide abstract] ABSTRACT: Wyner-Ziv video coding presents a new paradigm which offers low-complexity video encoding. However, the Wyner-Ziv paradigm accumulates high complexity at the decoder side and this could involve difficulties for applications which have delay requisites. On the other hand, technological advances provide us with new hardware which supports parallel data processing. In this paper, a faster Wyner-Ziv video decoding scheme based on multicore processors is proposed. In this way, each frame is decoded by means of the collaboration between several processing units, achieving a time reduction up to 71% without significant rate-distortion drop penalty.
    No preview · Conference Paper · Oct 2011
  • Source
    [Show abstract] [Hide abstract] ABSTRACT: Interest in 3D video has surged in recent years. However, efforts to improve the quality of compression and transmission schemes are severely hampered by a lack of effective quality evaluation metrics. This is a particularly severe problem for researchers trying to improve the robustness of video transmission to packet loss. Subjective tests for evaluating error robustness present huge requirements in terms of time and resources. To solve this problem, this paper presents a quality metric for 3D video, and evaluates its effectiveness for the measurement of quality in the presence of packet loss. A key feature of the work is the use of depth planes to enable the metric to better model how the Human Visual System (HVS) perceives 3D video. The quality metric results are compared with subjective test results. The correlation between the proposed quality metric and the subjective test results is shown to be stronger than standard quality metrics, such as Video Quality Metric (VQM).
    Full-text · Conference Paper · Aug 2011
  • Zhiru Shi · W. A. C. Fernando · Ahme Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: The new video coding standard, H.264/AVC, has achieved significant compression gain by utilizing several advanced techniques. Block-Matching Motion Estimation is one of the most important elements to reduce the motion residual. However, it results in heavy computational burden and limits the application for real-time video service. In this paper, a fast motion estimation algorithm called Simulated Annealing Adaptive Search (SAAS) is proposed to reduce the computational load. The basic idea of the proposed scheme is based on adjusting search pattern not only for each frame, but also for each block. Initially, the adaptive search pattern is performed by statistical analysis of previous frame's Motion Vector Correlation. Then the search pattern is adjusted for each block according to Predicted Motion Vector. According to motion vector correlation statistics information, search region is adaptively divided and Simulated Annealing (SA) mechanism is adopted to select search power for each region and to avoid trapping into local minima. Experimental results indicate that the proposed algorithm offers considerable improvement in computing time and motion search points at the same rate-distortion performance, compared to the conventional fast motion estimation algorithm. 1
    No preview · Article · Aug 2011 · IEEE Transactions on Consumer Electronics
  • R. M. T. P. Rajakaruna · W. A. C. Fernando · J. Calic
    [Show abstract] [Hide abstract] ABSTRACT: Performance of real-time video processing appli- cations such as surveillance systems, content-based search, is limited by the complexity of video content analysis in the pixel domain. A low complex alternative is to analyse the video in the compressed domain, where content features already available in the compressed video are directly used in the analysis. However, this is achieved at the expense of output precision and reliability, due to compression-efficiency driven feature selection at the encoder. Therefore, video applications could benefit from enhanced reliability of data embedded in the compressed video. In this paper, we present a scalable optimization model that addresses the accuracy of content features in parallel with the conventional rate-distortion optimization criterion. We analyse and optimize rate-distortion performance of video encoder under content description accuracy constrain, using a motion calibrated synthetic data set containing a range of scene and motion complexity levels. Finally, using a natural video data set, we demonstrate that the proposed optimization framework can be used to enhance compressed feature accuracy without incurring a rate-distortion overhead.
    No preview · Article · Aug 2011
  • Source
    R. M. T. P. Rajakaruna · W. A. C. Fernando · J. Calic
    [Show abstract] [Hide abstract] ABSTRACT: The proliferation of video consumption, especially over mobile devices, has created a demand for efficient inter- active video applications and high-level video analysis. This is particularly significant in real-time applications and resource- limited scenarios. Pixel-domain video processing is often inef- ficient for many of these applications due to its complexity, whereas compressed domain processing offer fast but unreliable results. In order to achieve fast and effective video processing, this paper proposes a novel video encoding architecture that facilitate efficient compressed domain processing, while maintaining com- pliance with the mainstream coding standards. This is achieved by optimizing the accuracy of motion information embedded in the compressed video, in addition to compression efficiency. In a motion detection application, we demonstrate that the motion estimated by the proposed encoder can be directly used to extract object information, as opposed to conventionally coded video. The incurred rate distortion overheads can be weighed against the reduced processing required for video analysis targeting a wide spectrum of computer vision applications.
    Full-text · Article · Aug 2011
  • Source
    D.V.S.X. De Silva · E. Ekmekcioglu · W.A.C. Fernando · S.T. Worrall
    [Show abstract] [Hide abstract] ABSTRACT: This paper addresses the sensitivity of human vision to spatial depth variations in a 3-D video scene, seen on a stereoscopic display, based on an experimental derivation of a just noticeable depth difference (JNDD) model. The main target is to exploit the depth perception sensitivity of humans in suppressing the unnecessary spatial depth details, hence reducing the transmission overhead allocated to depth maps. Based on the JNDD model derived, depth map sequences are preprocessed to suppress the depth details that are not perceivable by the viewers and to minimize the rendering artefacts that arise due to optical noise, where the optical noise is triggered by the inaccuracies in the depth estimation process. Theoretical and experimental evidences are provided to illustrate that the proposed depth adaptive preprocessing filter does not alter the 3-D visual quality or the view synthesis quality for free-viewpoint video applications. Experimental results suggest that the bit rate for depth map coding can be reduced up to 78% for the depth maps captured with depth-range cameras and up to 24% for the depth maps estimated with computer vision algorithms, without affecting the 3-D visual quality or the arbitrary view synthesis quality.
    Full-text · Article · May 2011 · IEEE Journal of Selected Topics in Signal Processing
  • D. V. S. X. De Silva · W. A. C. Fernando · S. T. Worrall · A. M. Kondoz
    [Show abstract] [Hide abstract] ABSTRACT: While the depth maps of D video are represented as luminance images, they are used to aid rendering of novel views and are not viewed by an end user. Therefore, metrics that measure the quality of images that are for end user viewing does not necessarily reflect the quality of depth maps in terms of its ability to render views. This paper investigates the relationship between the quality of the rendered views and different quality measures of the depth map. A novel depth map quality metric is proposed based on a distortion model that approximates rendering errors due to pixel errors in the depth map. The proposed depth map quality metric correlates very well with the quality of the rendered views, as compared to the PSNR and SSIM of the depth map. The application of the proposed depth map quality metric is further illustrated by incorporating the metric at the encoding mode selection stage of a video encoder. Experimental results suggest that with the proposed encoding mode selection scheme bit rate savings of up to 30% can be achieved compared to traditional encoding mode selection scheme based on sum of squared errors.
    No preview · Article · May 2011
  • Source
    D. Varuna S. X. De Silva · Erhan Ekmekcioglu · Warnakulasuriya Anil Chandana Fernando · Stewart T. Worrall
    Full-text · Article · Jan 2011
  • [Show abstract] [Hide abstract] ABSTRACT: Depth maps are used for rendering novel views in 3-Dimensional (3D) Television systems. When depth maps are compressed using existing codecs, the compression artifacts will cause undesirable distortions in the rendered views. This paper proposes an adaptive bilateral filtering technique to eliminate such artifacts at the receiver end. The experimental results demonstrate that the proposed method significantly improves the quality of rendered views up to 1.5dB, with minimal increase in complexity.
    No preview · Article · Jan 2011
  • [Show abstract] [Hide abstract] ABSTRACT: Depth maps, which can be represented as greyscale images, are used to aid rendering of novel views in three-dimensional (3D) video systems. However, compressing them using existing video codecs, such as H.264/AVC, leads to low quality rendered views. Presented is a sharpening method based on adaptive bilateral filtering to eliminate certain artifacts observed in compressed depth maps to improve the quality of rendered views. Experimental results demonstrate that significant rendering quality improvements of up to 1.9 dB can be achieved with the proposed method.
    No preview · Article · Dec 2010 · Electronics Letters

Publication Stats

1k Citations
70.02 Total Impact Points


  • 2007-2014
    • University of Surrey
      • • Department of Electronic Engineering
      • • Centre for Vision, Speech and Signal Processing (CVSSP)
      • • Faculty of Engineering and Physical Sciences
      • • Centre for Communication Systems Research (CCSR)
      Guilford, England, United Kingdom
  • 2011
    • Multimedia Communications Research Laboratory
      Ottawa, Ontario, Canada
  • 2008
    • University of Castilla-La Mancha
      • Instituto de Investigación en Informática de Albacete
      Ciudad Real, Castille-La Mancha, Spain
  • 2006-2007
    • Brunel University London
      • • School of Engineering and Design
      • • Department of Electronic and Computer Engineering
      अक्सब्रिज, England, United Kingdom
  • 2003
    • University of Saskatchewan
      • Department of Electrical and Computer Engineering
      Saskatoon, Saskatchewan, Canada
  • 1998-2002
    • Asian Institute of Technology
      • School of Engineering and Technology (SET)
      Krung Thep, Bangkok, Thailand
  • 2001
    • University of Bristol
      • Department of Electrical and Electronic Engineering
      Bristol, ENG, United Kingdom