Soumya Paul’s research while affiliated with University of Luxembourg and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (15)


Limits for Learning with Language Models
  • Preprint
  • File available

June 2023

·

94 Reads

·

·

·

[...]

·

Soumya Paul

With the advent of large language models (LLMs), the trend in NLP has been to train LLMs on vast amounts of data to solve diverse language understanding and generation tasks. The list of LLM successes is long and varied. Nevertheless, several recent papers provide empirical evidence that LLMs fail to capture important aspects of linguistic meaning. Focusing on universal quantification, we provide a theoretical foundation for these empirical findings by proving that LLMs cannot learn certain fundamental semantic properties including semantic entailment and consistency as they are defined in formal semantics. More generally, we show that LLMs are unable to learn concepts beyond the first level of the Borel Hierarchy, which imposes severe limits on the ability of LMs, both large and small, to capture many aspects of linguistic meaning. This means that LLMs will continue to operate without formal guarantees on tasks that require entailments and deep linguistic understanding.

Download


Fully specified SDRS for the first three sentences of example (2)
Underspecified SDRS for example (5)
A partial ME evaluation game tree for (5)
The progressive reinforcement of bias
Bias in Semantic and Discourse Interpretation

June 2022

·

140 Reads

·

4 Citations

Linguistics and Philosophy

In this paper, we show how game theoretic work on conversation combined with a theory of discourse structure provides a framework for studying interpretive bias and how bias affects the production and interpretation of linguistic content. We model the influence of author bias on the discourse content and structure of the author’s linguistic production and interpreter bias on the interpretation of ambiguous or underspecified elements of that content and structure. Interpretive bias is an essential feature of learning and understanding but also something that can be exploited to pervert or subvert the truth. We develop three types of games to understand and to analyze a range of interpretive biases, the factors that contribute to them, and their strategic effects.


Figure 1. A counterfactual space aroundˆxaroundˆ aroundˆx.
Figure 2. Causal Graph satisfying RE.
Counterfactual Models for Fair and Adequate Explanations

March 2022

·

64 Reads

·

9 Citations

Machine Learning and Knowledge Extraction

Recent efforts have uncovered various methods for providing explanations that can help interpret the behavior of machine learning programs. Exact explanations with a rigorous logical foundation provide valid and complete explanations, but they have an epistemological problem: they are often too complex for humans to understand and too expensive to compute even with automated reasoning methods. Interpretability requires good explanations that humans can grasp and can compute. We take an important step toward specifying what good explanations are by analyzing the epistemically accessible and pragmatic aspects of explanations. We characterize sufficiently good, or fair and adequate, explanations in terms of counterfactuals and what we call the conundra of the explainee, the agent that requested the explanation. We provide a correspondence between logical and mathematical formulations for counterfactuals to examine the partiality of counterfactual explanations that can hide biases; we define fair and adequate explanations in such a setting. We provide formal results about the algorithmic complexity of fair and adequate explanations. We then detail two sophisticated counterfactual models, one based on causal graphs, and one based on transport theories. We show transport based models have several theoretical advantages over the competition as explanation frameworks for machine learning algorithms.


Fair and Adequate Explanations

August 2021

·

37 Reads

·

12 Citations

Lecture Notes in Computer Science

Recent efforts have uncovered various methods for providing explanations that can help interpret the behavior of machine learning programs. Exact explanations with a rigorous logical foundation provide valid and complete explanations, but they have an epistemological problem: they may be too complex for humans to understand and too expensive to compute even with automated reasoning methods. Interpretability requires good explanations that humans can grasp and can compute.


Adequate and fair explanations

January 2020

·

88 Reads

Explaining sophisticated machine-learning based systems is an important issue at the foundations of AI. Recent efforts have shown various methods for providing explanations. These approaches can be broadly divided into two schools: those that provide a local and human interpreatable approximation of a machine learning algorithm, and logical approaches that exactly characterise one aspect of the decision. In this paper we focus upon the second school of exact explanations with a rigorous logical foundation. There is an epistemological problem with these exact methods. While they can furnish complete explanations, such explanations may be too complex for humans to understand or even to write down in human readable form. Interpretability requires epistemically accessible explanations, explanations humans can grasp. Yet what is a sufficiently complete epistemically accessible explanation still needs clarification. We do this here in terms of counterfactuals, following [Wachter et al., 2017]. With counterfactual explanations, many of the assumptions needed to provide a complete explanation are left implicit. To do so, counterfactual explanations exploit the properties of a particular data point or sample, and as such are also local as well as partial explanations. We explore how to move from local partial explanations to what we call complete local explanations and then to global ones. But to preserve accessibility we argue for the need for partiality. This partiality makes it possible to hide explicit biases present in the algorithm that may be injurious or unfair.We investigate how easy it is to uncover these biases in providing complete and fair explanations by exploiting the structure of the set of counterfactuals providing a complete local explanation.



Strategic Conversations Under Imperfect Information: Epistemic Message Exchange Games

December 2018

·

38 Reads

·

21 Citations

Journal of Logic Language and Information

This paper refines the game theoretic analysis of conversations in Asher et al. (J Philos Logic 46:355–404, 2017) by adding epistemic concepts to make explicit the intuitive idea that conversationalists typically conceive of conversational strategies in a situation of imperfect information. This ‘epistemic’ turn has important ramifications for linguistic analysis, and we illustrate our approach with a detailed treatment of linguistic examples. © 2018 Springer Science+Business Media B.V., part of Springer Nature


Bias in Semantic and Discourse Interpretation

June 2018

·

85 Reads

In this paper, we show how game-theoretic work on conversation combined with a theory of discourse structure provides a framework for studying interpretive bias. Interpretive bias is an essential feature of learning and understanding but also something that can be used to pervert or subvert the truth. The framework we develop here provides tools for understanding and analyzing the range of interpretive biases and the factors that contribute to them.


Conversation and Games

December 2017

·

20 Reads

·

1 Citation

Lecture Notes in Computer Science

In this paper we summarize concepts from earlier work and demonstrate how infinite sequential games can be used to model strategic conversations. Such a model allows one to reason about the structure and complexity of various kinds of winning goals that conversationalists might have. We show how to use tools from topology, set-theory and logic to express such goals. We then show how to tie down the notion of a winning condition to specific discourse moves using techniques from Mean Payoff games and discounting. We argue, however, that this still requires another addition from epistemic game theory to define appropriate solution and rationality underlying a conversation.


Citations (10)


... A growing body of literature links the lack of coherence in neural models in natural language generation to AI hallucination 17 (Ji et al., 2023b). Self-contradiction in generated text is a major form of hallucination, separate from inconsistencies with external sources Asher et al., 2023;Mündler et al., 2024). Interestingly, the application of 'hallucination' in AI has led someone to a sort of inversion. ...

Reference:

The bewitching AI: The Illusion of Communication with Large Language Models
Limits for learning with language models

... Deletion and negation interventions allow us to study the models' behavior in a counterfactual scenario. Such counterfactual scenarios are crucial to understanding the causal efficacy of the rationale in the models' inferring of the ground truth answer for a given question (Schölkopf 2019;Kusner et al. 2017;Asher, Paul, and Russell 2021). Scientific experiments establish or refute causal links between A and B by seeing what happens when A holds and what happens when ¬A holds. ...

Fair and Adequate Explanations
  • Citing Chapter
  • August 2021

Lecture Notes in Computer Science

... In the case of Trump's attacks on the media as in (1), it is that he is being insincere so as to gain advantage. The inference could also incorporate Bayesian reasoning to account for inferential bias arising from the degree to which the speaker is trusted by the listener (Asher, Hunter, and Paul, 2021). ...

Bias in Semantic and Discourse Interpretation

Linguistics and Philosophy

... The advantage of relying on QUDs as an established pragmatic model of communication is the model's relationship with (overt) questions. Linguistically, bullshit detection can also be based on other frameworks, which may be more apt to model uncooperative dialog (such as Asher et al., 2017;Asher and Paul, 2018). However, if it is possible to identify underlying questions and their answers in bullshitting texts, we may benefit from such NLP tasks as question answering or answer quality estimation. ...

Strategic Conversations Under Imperfect Information: Epistemic Message Exchange Games

Journal of Logic Language and Information

... In stochastic games, or multi-stage games generally, participants reason about how to act by attaching rewards to those long-term goals and back-propagating those to estimate the values of actions they can take now. Computational linguists have employed this kind of game-based decision framework to model strategic dialogue planning (Asher and Paul, 2017). These formalisms can provide a framework to explain how long-term motivations are turned into decisions about what to do immediately. ...

Conversation and Games
  • Citing Conference Paper
  • December 2017

Lecture Notes in Computer Science

... Humans face many game problems that are too large for the whole game tree to be used in their deliberations about action, and very little is understood about how they cope in such scenarios. However, when a human player's chosen strategy is conditioned on her limited perspective of how the game might progress (Degremont et al. 2016), then it should be possible to manipulate her into changing her planned move by mentioning a possible outcome of an alternative move. This paper demonstrates that human players can be manipulated this way: in the game The Settlers of Catan, where negotiation is only a small part of what one must do to win the game thereby generating uncertainty about which outcomes to the negotiation are good and which are bad, the likelihood that a player accepts a trade offer that deviates from their declared preferred strategy is higher if it is accompanied by a description of what that trade offer can lead to. ...

A Logic of Sights

Journal of Logic and Computation

... Inspired by this earlier literature, Asher, Paul, and Venant (2017) provide a model of language in terms of a space of finite and infinite strings. Many of these strings are just non-meaningful sequences of words but the space also includes coherent and consistent strings that form meaningful texts and conversations. ...

Message Exchange Games in Strategic Contexts

Journal of Philosophical Logic

... In (T, B) both players gain something; S because H has acquired some content he was willing to transmit, and perhaps S has also gained some "face" from the transaction (cf. McCready, Asher, & Soumya, 2013), and H because she has learned something true. In (F, D) S has said something false and H has ignored him, meaning that both players have wasted their time and must pay some penalty. ...

Winning Strategies in Politeness
  • Citing Conference Paper
  • November 2012

Lecture Notes in Computer Science