Marios S Pattichis

Marios S Pattichis
University of New Mexico | UNM · Department of Electrical and Computer Engineering

Professor
My research is currently focused on the development of video processing, communications, and architecture methods.

About

388
Publications
72,300
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
5,122
Citations
Citations since 2017
67 Research Items
2036 Citations
20172018201920202021202220230100200300
20172018201920202021202220230100200300
20172018201920202021202220230100200300
20172018201920202021202220230100200300
Additional affiliations
September 2003 - February 2005
University of Cyprus
Position
  • Visiting Assistant Professor
August 1999 - July 2022
University of New Mexico
Position
  • Professor
Description
  • ivpcl.unm.edu
September 1998 - July 1999
Washington State University

Publications

Publications (388)
Article
Full-text available
Machine learning promises to assist physicians with predictions of mortality and of other future clinical events by learning complex patterns from historical data, such as longitudinal electronic health records. Here we show that a convolutional neural network trained on raw pixel data in 812,278 echocardiographic videos from 34,362 individuals pro...
Article
Full-text available
The manuscript describes fast and scalable architecturesand associated algorithms for computing convolutions and cross-correlations. The basic idea is to map 2D convolutions and cross-correlations to a collection of 1D convolutions and cross-correlations in the transform domain. This is accomplished through the use of the Discrete Periodic Radon Tr...
Preprint
Full-text available
By 2022, we expect video traffic to reach 82% of the total internet traffic. Undoubtedly, the abundance of video-driven applications will likely lead internet video traffic percentage to a further increase in the near future, enabled by associate advances in video devices' capabilities. In response to this ever-growing demand, the Alliance for Open...
Article
Medical image analysis methods require the use of effective representations for differentiating between lesions, diseased regions, and normal structure. Amplitude Modulation - Frequency Modulation (AM-FM) models provide effective representations through physically meaningful descriptors of complex non-stationary structures that can differentiate be...
Article
Full-text available
In this paper, we propose the use of multiscale amplitude-modulation-frequency-modulation (AM-FM) methods for discriminating between normal and pathological retinal images. The method presented in this paper is tested using standard images from the early treatment diabetic retinopathy study. We use 120 regions of 40 x 40 pixels containing four type...
Conference Paper
The objective of this work was the investigation of multiscale Amplitude Modulation-Frequency Modulation (AM-FM) analysis based on Difference of Gaussians (DoG) filterbanks representations in order to predict the risk of stroke by analysing carotid plaques ultrasound images of individuals with asymp-tomatic carotid stenosis. We computed the instant...
Article
The papers in this special section focus on large-scale medical imaging and video analytics for clinical decision support systems. Biomedical images and videos are ubiquitous and overwhelming in volume, amounting to a database that can be measured in zettabytes.With increased access to open image and video datasets and the recent development of eff...
Article
Full-text available
italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Background : Monitoring disease evolution in Multiple sclerosis (MS) subjects may aid in decision making for personalizing treatment and disease evolution prediction. We investigate the use of disability progression, using clinical features, the expand...
Article
Full-text available
This review of our experience in computer-assisted tissue image analysis (CATIA) research shows that significant information can be extracted and used to diagnose and distinguish normal from abnormal endometrium. CATIA enabled the evaluation and differentiation between the benign and malignant endometrium during diagnostic hysteroscopy. The efficac...
Conference Paper
Full-text available
The present study proposes an adaptive video delivery methodology that enhances quality of experience (QoE) and aids towards the wider adoption of wireless medical video communication systems in standard clinical practice.
Conference Paper
At present, there is no established set of ultrasonic features that can identify all the potentially unstable and high risk atherosclerotic carotid plaques in asymptomatic patients. The degree of stenosis is still the main criterion used to decide whether carotid endarterectomy is needed, but it has now been shown to be inaccurate. The overall obje...
Patent
Full-text available
SINGLE -STAGE HARDWARE SORTING BLOCKS AND ASSOCIATED MULTIWAY MERGE SORTING NETWORK
Preprint
Full-text available
Large-scale training of Convolutional Neural Networks (CNN) is extremely demanding in terms of computational resources. Also, for specific applications, the standard use of transfer learning also tends to require far more resources than what may be needed. This work examines the impact of using AM-FM representations as input images for CNN classifi...
Article
Background / Context: After-school programs that focus on integrating computer programming and mathematics in authentic environments are seldomly accessible to students from culturally and linguistically diverse students, in particular bilingual Latina students in rural contexts. Providing a context that broadens Latina students’ participation in m...
Article
Background / Context: Computer Programming is rarely accessible to K-12 students, especially for those from culturally and linguistically diverse backgrounds. Middle school age is a transitioning time when adolescents are more likely to make long-term decisions on their academic choices and interests. Having access to productive and positive knowle...
Preprint
Full-text available
p>In hardware such as FPGAs, Kenneth Batcher’s Odd-Even Merge Sort and Bitonic Merge Sort are the state-of-the-art methodologies used to quickly sort a list of more than 16 input values. Both sorting networks feature merges of 2 sorted input lists into a single sorted output list. For both, a full sort of 64 and 512 input values requires 21 and...
Preprint
Full-text available
p>In hardware such as FPGAs, Kenneth Batcher’s Odd-Even Merge Sort and Bitonic Merge Sort are the state-of-the-art methodologies used to quickly sort a list of more than 16 input values. Both sorting networks feature merges of 2 sorted input lists into a single sorted output list. For both, a full sort of 64 and 512 input values requires 21 and...
Preprint
Full-text available
We have recently introduced a general system for building fast single-stage hardware N-sorters and N-filters, with N>=3. When N-sorter/N-filters designed with this system are constructed using a design logic block in the FPGA central to the Amazon AWS EC2 F1 instance, they were shown to be significantly faster than the prior state-of-the-art sortin...
Preprint
Full-text available
The paper presents the results from a multi-year effort to develop and validate image processing methods for selecting the best physical models based on solar image observations. The approach consists of selecting the physical models based on their agreement with coronal holes extracted from the images. Ultimately, the goal is to use physical model...
Article
Full-text available
Speaker diarization refers to methods for identifying speakers from audio recordings. An important application comes from the need to assess student interactions in collaborative learning environments. Diarization is very difficult in these environments where a single microphone is used to record multiple voices. Although there have been advancemen...
Article
Full-text available
In hardware such as FPGAs, Kenneth Batcher’s Odd-Even Merge Sort and Bitonic Merge Sort are the state-of-the-art methodologies used to quickly sort a list of more than 16 input values. Both sorting networks feature merges of 2 sorted input lists into a single sorted output list. For both, a full sort of 64 and 512 input values requires 21 and 45 se...
Article
Full-text available
The authors’ recently published design system for the creation of single-stage N-sorter/N-filter sorting devices, which were implemented using a particular example hardware block, is here expanded and applied to a second hardware type, FPGA carry chain logic. Although several researchers have published applications which use FPGA carry chain logic,...
Preprint
Full-text available
Speech recognition is very challenging in student learning environments that are characterized by significant cross-talk and background noise. To address this problem, we present a bilingual speech recognition system that uses an interactive video analysis system to estimate the 3D speaker geometry for realistic audio simulations. We demonstrate th...
Preprint
Full-text available
The Discrete Periodic Radon Transform (DPRT) has been extensively used in applications that involve image reconstructions from projections. This manuscript introduces a fast and scalable approach for computing the forward and inverse DPRT that is based on the use of: (i) a parallel array of fixed-point adder trees, (ii) circular shift registers to...
Preprint
Full-text available
The manuscript describes fast and scalable architectures and associated algorithms for computing convolutions and cross-correlations. The basic idea is to map 2D convolutions and cross-correlations to a collection of 1D convolutions and cross-correlations in the transform domain. This is accomplished through the use of the Discrete Periodic Radon T...
Preprint
Full-text available
We introduce the problem of detecting a group of students from classroom videos. The problem requires the detection of students from different angles and the separation of the group from other groups in long videos (one to one and a half hours). We use multiple image representations to solve the problem. We use FM components to separate each group...
Chapter
Face recognition in collaborative learning videos presents many challenges. In collaborative learning videos, students sit around a typical table at different positions to the recording camera, come and go, move around, get partially or fully occluded. Furthermore, the videos tend to be very long, requiring the development of fast and accurate meth...
Chapter
We study the problem of detecting talking activities in collaborative learning videos. Our approach uses head detection and projections of the log-magnitude of optical flow vectors to reduce the problem to a simple classification of small projection images without the need for training complex, 3-D activity classification systems. The small project...
Chapter
Long-term object detection requires the integration of frame-based results over several seconds. For non-deformable objects, long-term detection is often addressed using object detection followed by video tracking. Unfortunately, tracking is inapplicable to objects that undergo dramatic changes in appearance from frame to frame. As a related exampl...
Chapter
Speech recognition is very challenging in student learning environments that are characterized by significant cross-talk and background noise. To address this problem, we present a bilingual speech recognition system that uses an interactive video analysis system to estimate the 3D speaker geometry for realistic audio simulations. We demonstrate th...
Preprint
Full-text available
Face recognition in collaborative learning videos presents many challenges. In collaborative learning videos, students sit around a typical table at different positions to the recording camera, come and go, move around, get partially or fully occluded. Furthermore, the videos tend to be very long, requiring the development of fast and accurate meth...
Preprint
Full-text available
We study the problem of detecting talking activities in collaborative learning videos. Our approach uses head detection and projections of the log-magnitude of optical flow vectors to reduce the problem to a simple classification of small projection images without the need for training complex, 3-D activity classification systems. The small project...
Preprint
Full-text available
Long-term object detection requires the integration of frame-based results over several seconds. For non-deformable objects, long-term detection is often addressed using object detection followed by video tracking. Unfortunately, tracking is inapplicable to objects that undergo dramatic changes in appearance from frame to frame. As a related exampl...
Patent
Full-text available
System and methods for the joint control of reconstructed video quality , computational complexity and compression rate for intra-mode and inter-mode video encoding in HEVC. The invention provides effective methods for (i) generating a Pareto front for intra-coding by varying CTU parameters and the QP , (ii) generating a Pareto front for inter-codi...
Article
Full-text available
By 2022, we expect video traffic to reach 82% of the total internet traffic. Undoubtedly, the abundance of video-driven applications will likely lead internet video traffic percentage to a further increase in the near future, enabled by associate advances in video devices' capabilities. In response to this ever-growing demand, the Alliance for Open...
Article
Full-text available
There is strong interest in developing high-performance hardware sorting systems which can sort a set of elements as quickly as possible. The fastest of the current FPGA systems are sorting networks, in which sets of 2-sorters operate in parallel in each series stage of a multi-stage sorting process. A 2-sorter is a single-stage hardware block whic...
Article
Unsupervised latent variable models—blind source separation (BSS) especially—enjoy a strong reputation for their interpretability. But they seldom combine the rich diversity of information available in multiple datasets, even though multidatasets yield insightful joint solutions otherwise unavailable in isolation. We present a direct, principled ap...
Conference Paper
Full-text available
The prevalence of video driven applications, leveraging over the top video on demand services as well as live video streaming applications, dominate network traffic over today’s internet landscape. As such, they necessitate efficient video compression methods to accommodate the desired quality of service and hence user experience. In this study, we...
Article
Full-text available
Video compression is the core technology in mobile (mHealth) and electronic (eHealth) health video streaming applications. With global video traffic projected to reach 82% of all Internet traffic by 2022, there is a strong need to develop efficient compression algorithms to accommodate expected future growth. For the first time in decades, and espe...
Preprint
In the last two decades, unsupervised latent variable models---blind source separation (BSS) especially---have enjoyed a strong reputation for the interpretable features they produce. Seldom do these models combine the rich diversity of information available in multiple datasets. Multidatasets, on the other hand, yield joint solutions otherwise una...
Article
Full-text available
The paper presents the results from a multi-year effort to develop and validate image processing methods for selecting the best physical models based on solar image observations. The approach consists of selecting the physical models based on their agreement with coronal holes extracted from the images. Ultimately, the goal is to use physical model...
Chapter
Full-text available
Video compression is the core technology in mobile (mHealth) and electronic (eHealth) health video streaming applications. With global video traffic projected to reach 82 % of all internet traffic by 2022, both industry and academia are struggling to develop efficient compression algorithms to match these unprecedented needs. To the best of our kno...
Article
Full-text available
Precision medicine promises better healthcare delivery by improving clinical practice. Using evidence-based sub-stratification of patients, the objective is to achieve better prognosis, diagnosis, and treatment that will transform existing clinical pathways towards optimizing care for the specific needs of each patient. The wealth of today's health...
Conference Paper
This study investigates the compression efficiency of well established (H.265 and VP9), recently standardized (AV1), and emerging (VVC) video codecs and their applicability in the healthcare domain. Preliminary results using an ultrasound video dataset show that VVC achieves the best encoding performance, closely followed by AV1 and HM. To the best...
Chapter
Full-text available
Whilst Science, Technology, Engineering and Mathematics (STEM) interdisciplinary teaching and learning in the USA K-12 education still needs greater promotion, middle school students demonstrated that they can, using low-cost, single board computers that promote the teaching of computer science (in this case Raspberry Pis), successfully engage with...
Article
Full-text available
Over the solar-activity cycle, there are extended periods where significant discrepancies occur between the spacecraft-observed total (unsigned) open magnetic flux and that determined from coronal models. In this article, the total open heliospheric magnetic flux is computed using two different methods and then compared with results obtained from i...
Preprint
Full-text available
We present an interpretable neural network for predicting an important clinical outcome (1-year mortality) from multi-modal Electronic Health Record (EHR) data. Our approach builds on prior multi-modal machine learning models by now enabling visualization of how individual factors contribute to the overall outcome risk, assuming other factors remai...
Article
Full-text available
Cyprus’s goal is to deliver high quality cross border healthcare for both its citizens living abroad as well as visiting citizens from other EU Member States.
Conference Paper
Full-text available
This study proposes an adaptive video encoding framework based on multi-objective optimization that jointly maximizes the encoded video’s (clinical) quality and encoding rate (in frames per second) while minimizing bitrate demands.
Article
Full-text available
The wider adoption of mobile Health (mHealth) video communication systems in standard clinical practice requires real-time control to provide for adequate levels of clinical video quality to support reliable diagnosis. The latter can only be achieved with real-time adaptation to time-varying wireless networks’ state to guarantee clinically acceptab...
Conference Paper
We present a framework for adaptive video encoding based on video content. The basic idea is to analyze the video to determine camera activity (tracking, stationary, or zooming) and then associate each activity with adaptive video quality constraints. We demonstrate our approach on the UT LIVE video quality assessment database. We show that effecti...
Poster
We present a framework for adaptive video encoding based on video content. The basic idea is to analyze the video to determine camera activity (tracking, stationary, or zooming) and then associate each activity with adaptive video quality constraints. We demonstrate our approach on the UT LIVE video quality assessment database. We show that effecti...
Conference Paper
Full-text available
We present a framework for adaptive video encoding based on video content. The basic idea is to analyze the video to determine camera activity (tracking, stationary, or zooming) and then associate each activity with adaptive video quality constraints. We demonstrate our approach on the UT LIVE video quality assessment database. We show that effecti...
Article
Full-text available
This paper presents an electronic registry system for the purposes of the rare congenital conditions that require lifelong follow up and treatment. The main objective of the eRegistry focuses on the prevention of major rare anaemias (RAs) by facilitating the access, at a European level, to the best genetic counselling, diagnosis and clinical manage...
Conference Paper
Full-text available
There is a growing need for developing fast and scalable methods for storing and processing large databases of healthcare images and videos. The paper reviews current medical image analysis techniques and the recent emergence, promise, and challenges associated with large scale video analysis methods. Furthermore, the paper describes large-scale vi...
Conference Paper
The paper proposes an open-source, maintainable system for detecting human activity in video datasets using scalable hardware architectures. The system is validated by detecting writing and typing activities that were collected as part of the Advancing Out of School Learning in Mathematics and Engineering (AOLME) project. The implementation of the...
Conference Paper
We present a new approach for motion estimation from digital videos based on the use of 2D amplitude-modulation frequency-modulation (AM-FM) models. The proposed approach uses an AM-FM representation to derive AM and FM based equations that can be applied to two consecutive frames to derive motion estimates. We test the proposed method using comple...
Article
Full-text available
This study proposes an end-to-end mobile tele-echography platform using a portable robot for remote cardiac ultrasonography. Performance evaluation investigates the capacity of LTE wireless networks to facilitate responsive robot tele-manipulation and real-time ultrasound video streaming that qualifies for clinical practice. Within this context, a...
Article
Full-text available
In the past decade, numerous advances in the study of the human brain were fostered by successful applications of blind source separation (BSS) methods to a wide range of imaging modalities. The main focus has been on extracting “networks” represented as the underlying latent sources. While the broad success in learning latent representations from...
Conference Paper
Full-text available
The wider adoption of mHealth video communication systems in standard clinical practice requires adequate levels of clinical video quality to support reliable diagnosis. The latter dictates that real-time adaptation to time-varying wireless networks’ state to guarantee clinically acceptable performance throughout the streaming session, while confor...