Christopher J. Quinn’s research while affiliated with Purdue University West Lafayette and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (23)


Issues and Methods for Access, Storage, and Analysis of Data From Online Social Communities
  • Chapter

January 2018

·

14 Reads

Christopher John Quinn

·

Matthew James Quinn

·

·

John Thomas Quinn

This chapter provides an overview for a number of important issues related to studying user interactions in an online social network. The approach of social network analysis is detailed along with important basic concepts for network models. The different ways of indicating influence within a network are provided by describing various measures such as degree centrality, betweenness centrality and closeness centrality. Network structure as represented by cliques and components with measures of connectedness defined by clustering and reciprocity are also included. With the large volume of data associated with social networks, the significance of data storage and sampling are discussed. Since verbal communication is significant within networks, textual analysis is reviewed with respect to classification techniques such as sentiment analysis and with respect to topic modeling specifically latent semantic analysis, probabilistic latent semantic analysis, latent Dirichlet allocation and alternatives. Another important area that is provided in detail is information diffusion.


Combining human and machine intelligence to derive agents’ behavioral rules for groundwater irrigation

September 2017

·

121 Reads

·

27 Citations

Advances in Water Resources

For agent-based modeling, the major challenges in deriving agents’ behavioral rules arise from agents’ bounded rationality and data scarcity. This study proposes a “gray box” approach to address the challenge by incorporating expert domain knowledge (i.e., human intelligence) with machine learning techniques (i.e., machine intelligence). Specifically, we propose using directed information graph (DIG), boosted regression trees (BRT), and domain knowledge to infer causal factors and identify behavioral rules from data. A case study is conducted to investigate farmers' pumping behavior in the Midwest, U.S.A. Results show that four factors identified by the DIG algorithm- corn price, underlying groundwater level, monthly mean temperature and precipitation- have main causal influences on agents’ decisions on monthly groundwater irrigation depth. The agent-based model is then developed based on the behavioral rules represented by three DIGs and modeled by BRTs, and coupled with a physically-based groundwater model to investigate the impacts of agents’ pumping behavior on the underlying groundwater system in the context of coupled human and environmental systems.


Robust Sparse Approximations for Stochastic Dynamical Systems

July 2017

·

27 Reads

IFAC-PapersOnLine

Inferring the exact topology of the interactions in a large, stochastic dynamical system from time-series data can often be prohibitive computationally and statistically without strong side information. One alternative is to seek approximations of the system topology that nonetheless describe the data well. In recent works, algorithms were proposed to identify sparse approximations which are optimal in terms of Kullback-Leibler divergence. Those algorithms relied on point estimates of statistics from the data. In this work, we investigate the more practical setting where point estimates are not reliable. We propose an algorithm to identify sparse, connected approximations that are robust to estimation error.


How Information Spreads in Online Social Networks

July 2016

·

231 Reads

·

1 Citation

Online social networks are increasingly important venues for businesses to promote their products and image. However, information propagation in online social networks is significantly more complicated compared to traditional transmission media such as newspaper, radio, and television. In this chapter, we will discuss research on modeling and forecasting diffusion of virally marketed content in social networks. Important aspects include the content and its presentation, the network topology, and transmission dynamics. Theoretical models, algorithms, and case studies of viral marketing will be explored.



Crowdsourcing High Quality Labels with a Tight Budget

February 2016

·

41 Reads

·

50 Citations

·

·

·

[...]

·

Christopher J. Quinn

In the past decade, commercial crowdsourcing platforms have revolutionized the ways of classifying and annotating data, especially for large datasets. Obtaining labels for a single instance can be inexpensive, but for large datasets, it is important to allocate budgets wisely. With limited budgets, requesters must trade-off between the quantity of labeled instances and the quality of the final results. Existing budget allocation methods can achieve good quantity but cannot guarantee high quality of individual instances under a tight budget. However, in some scenarios, requesters may be willing to label fewer instances but of higher quality. Moreover, they may have different requirements on quality for different tasks. To address these challenges, we propose a flexible budget allocation framework called Requallo. Requallo allows requesters to set their specific requirements on the labeling quality and maximizes the number of labeled instances that achieve the quality requirement under a tight budget. The budget allocation problem is modeled as a Markov decision process and a sequential labeling policy is produced. The proposed policy greedily searches for the instance to query next as the one that can provide the maximum reward for the goal. The Requallo framework is further extended to consider worker reliability so that the budget can be better allocated. Experiments on two real-world crowdsourcing tasks as well as a simulated task demonstrate that when the budget is tight, the proposed Requallo framework outperforms existing state-of-the-art budget allocation methods from both quantity and quality aspects.


Figure 7: The ratio (17) for all approximations in the order selected by Algorithm 5, averaged over 150 trials. The black and green curves depict the mean standard deviation respectively. 
Figure 8: The estimate values used by Algorithm 5 and a modified Algorithm 3 to rank unconstrained approximations using optimal and greedy search. One trial is shown. Estimates are scaled to actual ratio values (17). The smooth light blue curve corresponds to ordering from the optimal search. The black curve corresponds to the greedy search. 
Figure 9: The difference between the ratio (17) for the actual optimal ordering of unconstrained approximations and the ordering returned by Algorithm 3 (using estimates). Results are averaged over 150 trials. The black curve depicts the mean. The green curves depict one standard deviation. 
Bounded Degree Approximations of Stochastic Networks
  • Article
  • Full-text available

June 2015

·

54 Reads

·

5 Citations

We propose algorithms to approximate directed information graphs. Directed information graphs are probabilistic graphical models that depict causal dependencies between stochastic processes in a network. The proposed algorithms identify optimal and near-optimal approximations in terms of Kullback-Leibler divergence. The user-chosen sparsity trades off the quality of the approximation against visual conciseness and computational tractability. One class of approximations contains graphs with specified in-degrees. Another class additionally requires that the graph is connected. For both classes, we propose algorithms to identify the optimal approximations and also near-optimal approximations, using a novel relaxation of submodularity. We also propose algorithms to identify the r-best approximations among these classes, enabling robust decision making.

Download

Dynamic and Succinct Statistical Analysis of Neuroscience Data

May 2014

·

53 Reads

·

26 Citations

Proceedings of the IEEE

Modern neuroscientific recording technologies are increasingly generating rich, multimodal data that provide unique opportunities to investigate the intricacies of brain function. However, our ability to exploit the dynamic, interactive interplay among neural processes is limited by the lack of appropriate analysis methods. In this paper, some challenging issues in neuroscience data analysis are described, and some general-purpose approaches to address such challenges are proposed. Specifically, we discuss statistical methodologies with a theme of loss functions, and hierarchical Bayesian inference methodologies from the perspective of constructing optimal mappings. These approaches are demonstrated on both simulated and experimentally acquired neural data sets to assess causal influences and track time-varying interactions among neural processes on a fine time scale.


Optimal bounded-degree approximations of joint distributions of networks of stochastic processes

July 2013

·

23 Reads

·

2 Citations

We propose two algorithms to identify approximations for joint distributions of networks of stochastic processes. The approximations correspond to low-complexity network structures - connected, directed graphs with bounded indegree. The first algorithm identifies an optimal approximation in terms of KL divergence. The second efficiently finds a near-optimal approximation. Sufficient conditions are introduced to guarantee near-optimality.


Robust directed tree approximations for networks of stochastic processes

July 2013

·

23 Reads

·

3 Citations

We develop low-complexity algorithms to robustly identify the best directed tree approximation for a network of stochastic processes in the finite-sample regime. Directed information is used to quantify influence between stochastic processes and identify the best directed tree approximation in terms of Kullback-Leibler (KL) divergence. We provide finite-sample complexity bounds for confidence intervals of directed information estimates. We use these confidence intervals to develop a minimax framework to identify the best directed tree that is robust to point estimation errors. We provide algorithms for this minimax calculation and describe the relationships between exactness and complexity.


Citations (17)


... In this case, interpretability of the model has to be ensured which excludes many of the named machine-learning approaches. Regression trees, however, have been proven to be interpretable and hence can be used to derive rules of decision making (Hu et al., 2017). ...

Reference:

Impact of catchment and climate attributes on flood generating processes and their effect on flood statistics
Combining human and machine intelligence to derive agents’ behavioral rules for groundwater irrigation
  • Citing Article
  • September 2017

Advances in Water Resources

... Currently, these data sets are annotated post hoc by manual annotators (for example, Mturkers) with tools such as Mechanical Turk [12], Sagemaker Ground Truth [13], Supervisely [14], and Anolytics [15]. Data set annotations are often time, money, and labor-intensive endeavors [16]. This bottleneck inhibits users from quickly and efficiently creating customized data sets for end-user applications [17]. ...

Crowdsourcing High Quality Labels with a Tight Budget
  • Citing Conference Paper
  • February 2016

... In addition, social networks, exploiting computer-mediated technologies, facilitate the creation and sharing of information, ideas, and other forms of expression via virtual communities and networks (Hermkens and Kietzmann, 2011). As a result, the vast majority of the population can directly diffuse information, via short messages, posts in social networks, etc, which can then spread extremely fast (Favre, Abdelkader Zighed, Guille, Hacid, 2013, Olinsky, Quinn, Quinn, Quinn, 2016. Furthermore, the emergence of special-purpose social networks and services can certainly provide ability and motivate willingness for special forms of information to be shared, thus, translating awareness to action. ...

How Information Spreads in Online Social Networks
  • Citing Chapter
  • July 2016

... Sensory stimuli can evoke a cascade of oscillatory interactions, which often reflect tight excitatory and inhibitory control of spiking in piriform cortex during the presentation of odors (47), and produce exquisite gamma oscillatory dynamics in visual cortices in response to attended stimuli (48,49). In motor systems, tight coordination can also be observed throughout the basal ganglia in response to utilized cues (50) but also in traveling waves through motor cortices during movement preparation and execution (51)(52)(53). Aberrant oscillatory patterns have been observed in a number of neural disorders [e.g., disrupted gamma in Alzheimer's disease (54,55), disrupted b in Parkinson's disease (56)] and are commonly used as markers of altered interactions and disease progression. However, in all cases, the relationship between these oscillatory currents and cell spiking activity has been poorly understood due to a limited ability to assess spiking relationships to ongoing oscillations at the time scales over which they evolve (as opposed to averages over seconds-long epochs). ...

Dynamic and Succinct Statistical Analysis of Neuroscience Data
  • Citing Article
  • May 2014

Proceedings of the IEEE

... In our preliminary work Quinn et al. (2013b), we developed an algorithm to identify the optimal bounded in-degree approximation containing a directed spanning tree subgraph. This appears here as Algorithm 2. Also, a sufficient condition for a greedy search to return near-optimal approximations was identified in Quinn et al. (2013b), presented here as Definition 11. ...

Optimal bounded-degree approximations of joint distributions of networks of stochastic processes
  • Citing Conference Paper
  • July 2013

... Some of the keywords considered were: "graphic methods", "mathematical models", "problem solving", and "data processing". For example, the works developed by Quinn et al. [14] and Blasco et al. [15]. In the first work, authors developed an efficient method, based on Kullback-Leibler divergence minimization, to obtain an optimal approximate joint distribution with at most one parent for any node in the graph, facilitating more straightforward computation and visual extraction of relevant information. ...

Efficient Methods to Compute Optimal Tree Approximations of Directed Information Graphs
  • Citing Article
  • June 2013

IEEE Transactions on Signal Processing

... It is impossible to list all applications of Theorem 1.1. A few are: in the study of root-mean-square (RMS) absolute cross relation of unit vectors [73], frame potential [10,15,19], correlations [72], codebooks [30], 1 code division multiple access (CDMA) systems [54,55], wireless systems [70], compressed/compressive sensing [2,7,32,35,74,80,81,83], 'game of Sloanes' [48], equiangular tight frames [79], equiangular lines [24,34,47,63], digital fingerprinting [62] etc. Theorem 1.1 has been improved/different proofs were given in [20,25,26,31,45,71,78,84,85]. In 2021 M. ...

Fingerprinting With Equiangular Tight Frames
  • Citing Article
  • March 2013

IEEE Transactions on Information Theory

... The pairwise directed information I(X i → X k ) is a functional of only the pairwise distribution P Xi,X k and is much easier to estimate. However, in general computing pairwise directed information is not sufficient: there are cases when I(X → Y) > 0 but I(X → Y Z) = 0, and others when I(X → Y) = 0 but I(X → Y Z) > 0. Some initial work to achieve lower complexity methods include [12], which shows that if for each process, an upperbound on its indegree is known, then causally conditioning on those numbers of processes is sufficient (and in general necessary) to recover the minimal generative model. Future directions of this research include focusing on cases where the processes are from a certain family of distributions to achieve even lower complexity methods. ...

A Minimal Approach to Causal Inference on Topologies with Bounded Indegree
  • Citing Article
  • December 2011

... To be precise, in DIG, we determine the causality by comparing two conditional probabilities in KLdivergence sense: one is the conditional probability of N i t+dt given full history, and the other one is the conditional probability of N i t+dt given full history except that of type-j event. Last but not least, both Granger causal graph and DIG are equivalent to minimal generative model graphs [Quinn et al., 2011] and therefore can be used for causal inference in the same manner Bayesian networks are used for correlative statistical inference. ...

Equivalence between minimal generative model graphs and directed information graphs
  • Citing Conference Paper
  • September 2011