Emma Beauxis-Aussalet

Emma Beauxis-Aussalet
Vrije Universiteit Amsterdam | VU · Department of Computer Science

PhD
Assistant Professor of Ethical Computing (VU), Lab Manager (Civic AI Lab)

About

43
Publications
7,982
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
148
Citations
Additional affiliations
February 2011 - present
Centrum Wiskunde & Informatica
Position
  • PhD Student

Publications

Publications (43)
Conference Paper
Full-text available
Classifiers can provide counts of items per class, but systematic classification errors yield biases (e.g., if a class is often misclassified as another, its size may be under-estimated). To handle classification biases, the statistics and epidemiology domains devised methods for estimating unbiased class sizes (or class probabilities) without iden...
Thesis
Full-text available
Handling classification uncertainty is a crucial challenge for supporting efficient and ethical classification systems. This thesis addresses uncertainty issues from the perspective of end-users with limited expertise in machine learning. We focus on uncertainties that pertain to estimating class sizes, i.e., numbers of objects per class. We aim at...
Article
Full-text available
Classifiers are applied in many domains where classification errors have significant implications. However, end-users may not always understand the errors and their impact, as error visualizations are typically designed for experts and for improving classifiers. We discuss the specific needs of classifiers' end-users and a simplified visualization,...
Conference Paper
Full-text available
[Problem Introduction paper accepted at ICML 2019 workshop on AI for Social Good - https://aiforsocialgood.github.io/icml2019/index.htm] Abusive online behaviors occur at a large scale on all social media, and have dire consequences for their victims. Although the problem is largely acknowledged , technological solutions remain limited to detecting...
Presentation
Full-text available
Keynote at Amsterdam Data Science
Article
The increasing use of artificial intelligence (AI) technologies across application domains has prompted our society to pay closer attention to AI's trustworthiness, fairness, interpretability, and accountability. In order to foster trust in AI, it is important to consider the potential of interactive visualization, and how such visualizations help...
Conference Paper
Full-text available
Machine learning systems can make more errors for certain populations and not others, and thus create discriminations. To assess such fairness issue, errors are typically compared across populations. We argue that we also need to account for the variability of errors in practice , as the errors measured in test data may not be exactly the same in r...
Presentation
Full-text available
Recommender systems greatly impact the scope of information we are exposed to. For critical topics like the climate crisis, the filter bubbles and echo chambers of recommendation systems can shape our opinion, and perhaps our future. Recommendation systems can become instruments of social engineering, in ways we might not wish, nor control. We will...
Presentation
Full-text available
Tips for future PhD students, presented at TECH020 an event of Amsterdam Data Science.
Presentation
Full-text available
What is classification bias? How to measure it? How to visualize it? ...and how to make it understandable.
Article
Classifiers are applied in many domains where classification errors have significant implications. However, end-users may not always understand the errors and their impact, as error visualizations are typically designed for experts and for improving classifiers. We discuss the specific needs of classifiers' end-users, and a simplified visualization...
Thesis
Full-text available
Presentation
Full-text available
Re-submission of the eponym paper published at DSAA 2017
Presentation
Full-text available
We designed a system that supports eXplainable AI (XAI) by letting users explore the specific features that have led an AI to make a specific classification. Users can also explore how these specific features would impact the end-results if different forms of AI were used. Team members: Emma Beauxis-Aussalet, Dan Xu, Abdelrahman Hassan, Aysenur Bi...
Conference Paper
Full-text available
Classifiers are applied in many domains where classification errors have significant implications. However, end-users may not always understand the errors and their impact, as error visualizations are typically designed for experts and for improving classifiers. We discuss a visualization design that addresses the specific needs of classifiers' end...
Technical Report
Full-text available
Classifiers are applied in many domains where classification errors have significant implications. However, end-users may not always understand the errors and their impact, as error visualizations are typically designed for experts and for improving classifiers. We discuss a visualization design that addresses the specific needs of classifiers' end...
Article
Full-text available
Most multi-class classifiers make their prediction for a test sample by scoring the classes and selecting the one with the highest score. Analyzing these prediction scores is useful to understand the classifier behavior and to assess its reliability. We present an interactive visualization that facilitates per-class analysis of these scores. Our sy...
Chapter
Uncertainty impacts many crucial issues the world is facing today – from climate change prediction, to scientific modelling, to the interpretation of medical data. Decisions typically rely on data which can be aggregated from different sources and further transformed using a variety of algorithms and models. Such data processing pipelines involve d...
Conference Paper
Uncertainty impacts many crucial issues the world is facing today – from climate change prediction, to scientific modelling, to the interpretation of medical data. Decisions typically rely on data which can be aggregated from different sources and further transformed using a variety of algorithms and models. Such data processing pipelines involve d...
Code
Visualization components and online user interface for exploring classification errors. The D3 components provide both expert visualizations and simplified visualization for end-users who are not experts in machine learning.
Conference Paper
Full-text available
Computer Vision is a promising technique for in-situ monitoring of ecosystems. It is non-intrusive and cost-effective compared to sending human observers. Automatic animal detection and species recognition support the study of population dynamics and species composition, i.e., the evolution of species populations' size. Fixed cameras support contin...
Chapter
Computer vision technology has been considered in marine ecology research as a innovative, promising data collection method. It contrasts with traditional practices in the information that is collected, and its inherent errors and biases. Ecology research is based on the analysis of biological characteristics (e.g., species, size, age, distribution...
Chapter
Several data analysis steps are required for understanding computer vision results and drawing conclusions about the actual trends in the fish populations. Particular attention must be drawn to the potential errors that can impact the scientific validity of end-results. This chapter discusses the means for ecologists to investigate the uncertainty...
Presentation
Full-text available
Machine Learning classifiers can be used to analyze trends in counts of items per class, e.g., over time or location. This counting task is a basis for a variety of data analysis use cases, such as the study of species populations living in an ecosystem, or the profiling of customers using a service or a product. Classification results are inherent...
Presentation
Full-text available
Machine Learning classifiers can be used to analyze trends in counts of items per class, e.g., over time or location. This counting task is a basis for a variety of data analysis use cases, such as the study of species populations living in an ecosystem, or the profiling of customers using a service or a product. Classification results are inherent...
Article
Full-text available
Machine learning is widely used for mining collections, such as images, sounds, or texts, by classifying their elements into categories. Automatic classification based on supervised learning requires groundtruth datasets for modeling the elements to classify, and for testing the quality of the classification. Because collecting groundtruth is tedio...
Conference Paper
Full-text available
Computer vision enables in-situ monitoring of animal populations at a lower cost and with less ecosystem disturbance than with human observers. However, computer vision uncertainty may not be fully understood by end-users, and the uncertainty assessments performed by technology experts may not fully address end-user needs. This knowledge gap can yi...
Conference Paper
Full-text available
Video analysis tools can provide valuable datasets for a wide range of applications, such as monitoring animal populations for ecology research, while reducing human efforts for collecting information. Transferring such technology to novel application domains implies exposing non-expert users to unfamiliar datasets and technical concepts. Existing...
Poster
Full-text available
Machine Learning techniques for automatic classification have reached a broad range of applications. But the technology transfers face issues with user trust and acceptance, as classification results inherently contain errors. Machine Learning experts rely on widely-established error measurement methods and uncertainty visualizations. However end-u...
Presentation
Full-text available
We introduce an online user interface for visualizing multidimensional data, with a simple and highly flexible interaction design. The multidimensional data of the demo describes fish populations. It is extracted from continuous video footage, collected from 9 fixed cameras that observed coral reef ecosystems during 3 years. Computer Vision algorit...
Poster
Full-text available
We present an interface for eliciting sets of acceptable gambles on a three-outcome possibility space, discuss an experiment conducted for testing this interface, and present the results of this experiment.
Conference Paper
Full-text available
----------------------------------------------------------------------------------------------------------------------------------------- A more recent & complete publication is available: https://www.researchgate.net/publication/325958191_Supporting_End-User_Understanding_of_Classification_Errors -------------------------------------------------...
Poster
Full-text available
Machine Learning uncertainty is not easily understood by end-users. It yields trust issues impeding technology transfer, or potentially critical misinterpretations of data. The state-of-the-art visualizations are complex, and potentially misleading. Our novel design reduces complexity and limits misinterpretations of the uncertainty in classificati...
Conference Paper
Full-text available
Applying computer vision for monitoring ecosystems opens new perspectives for marine ecology research. Information on fish populations can be automatically extracted from underwater cameras. As scientists, ecologists need to understand the uncertainty in computer vision results. We present an interface for studying fish populations that addresses t...
Conference Paper
Full-text available
The recent use of computer vision techniques for monitoring ecosystems has opened new perspectives for marine ecology research. These techniques can extract information about fish populations from in-situ cameras, without requiring ecologists to watch the videos. However, they inherently introduce uncertainty since automatic information extraction...
Conference Paper
In-situ video recording of underwater ecosystems is able to provide valuable information for biology research and natural resources management, e.g. changes in species abundance. Searching the videos manually, however, requires costly human effort. Our video analysis tool supports the key task of counting different species of fish, allowing marine...
Conference Paper
Full-text available
In this work we present a framework for fish population monitoring through the analysis of underwater videos. We specifically focus on the user information needs, and on the dynamic data extraction and retrieval mechanisms that support them. Sophisticated though a software tool may be, it is ultimately important that its interface satisfies users'...
Article
Understanding and analyzing fish behaviour is a fundamental task for biologists that study marine ecosystems because the changes in animal behaviour reflect environmental conditions such as pollution and climate change. To support investigators in addressing these complex questions, underwater cameras have been recently used. They can continuously...
Conference Paper
Full-text available
Long-term monitoring of the underwater environment is still labour intensive work. Using underwater surveillance cameras to monitor this environment has the potential advantage to make the task become less labour intensive. Also, the obtained data can be stored making the research reproducible. In this work, a system to analyse long-term underwater...

Projects

Projects (4)
Project
Statistical methods to estimate classification bias and error in machine learning systems.
Project
Presentation of our end-to-end solution to personalization, exploration and visualization. Collaboration with Abdelrahman (Abdo) Hassan, Dan Xu, Aysenur (Ays) Bilgi, Ahmed Mohamed (from CWI, ABN Amro, and HvA Digital Society Shool).