Michał Bień

Michał Bień
École Polytechnique Fédérale de Lausanne | EPFL

Master of Science

About

6
Publications
6,810
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
13
Citations
Introduction
Working on language models training, management and use cases in big data and distributed systems.
Additional affiliations
January 2021 - present
École Polytechnique Fédérale de Lausanne
Position
  • Research Assistant
Description
  • Working in EPFL Data Science Lab on ACCOMOJI project.
June 2019 - August 2019
CERN
Position
  • Openlab Summer Student
Education
September 2020 - July 2022
École Polytechnique Fédérale de Lausanne
Field of study
  • Digital Humanities
October 2016 - February 2020
Poznan University of Technology
Field of study
  • Computer Science

Publications

Publications (6)
Preprint
Full-text available
Emojis come with prepacked semantics making them great candidates to create new forms of more accessible communications. Yet, little is known about how much of this emojis semantic is agreed upon by humans, outside of textual contexts. Thus, we collected a crowdsourced dataset of one-word emoji descriptions for 1,289 emojis presented to participant...
Conference Paper
Full-text available
Semi-structured text generation is a non-trivial problem. Although last years have brought lots of improvements in natural language generation , thanks to the development of neural models trained on large scale datasets, these approaches still struggle with producing structured, context-and commonsense-aware texts. Moreover, it is not clear how to...
Thesis
Full-text available
Cooking recipes are a very specific type of text, that allows to share culinary ideas between people by providing an algorithm for their realization. Creating a recipe requires a certain dose of creativity and often some of the best ones are crafting unlikely ingredient combinations. By providing an automatic recipe generator we can allow the creat...
Preprint
Full-text available
This work has successfully deployed two different use cases of interest for High Energy Physics using cloud resources: • CMS Big data reduction: This use case consists in running a data reduction workloads for physics data. The code and implementation has originally been developed by CERN openlab in collaboration with CMS and Intel in 2017-2018. It...
Presentation
Full-text available
Lightning talk presentation that summarizes my work at CERN

Network

Cited By

Projects

Projects (2)
Project
A cooking recipes dataset for semi-structured text generation
Project
Openlab Summer Student Programme project