arxiv-summary: AI-summarized AI papers

Unsupervised Manifold Linearizing and Clustering

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Clustering data close to a union of low-dimensional manifolds is a problem in machine learning. Low-rank and sparse priors have been studied for linear subspaces. Real-world datasets cannot be approximated by linear subspaces. Works have proposed to identify the manifolds by learning a feature map. This paper proposes to simultaneously perform clustering and learn a union-of-subspace representation....

PACO: Parts and Attributes of Common Objects

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Object models are moving from predicting category labels to providing detailed descriptions of object instances. PACO is a dataset that provides part masks, attributes, and object categories across image and video datasets. PACO contains 641K part masks, 260K object boxes, and 55 attributes. Evaluation metrics and benchmark results are provided for 3 tasks on the dataset....

Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Language models can be used for complex reasoning either end-to-end or compositionally. Iterated decomposition is a workflow for developing and refining compositional LM programs. ICE is an open-source tool for visualizing the execution traces of LM programs. Iterated decomposition is applied to three real-world tasks and improves accuracy of LM programs. Paper Content Introduction Language models are often trained using feedback on outcomes Good outputs can be distinguished from bad ones As model capabilities and task complexities scale up, outcome-based evaluation may run into alignment problems Process supervision is an alternative to outcome-based training Process supervision promises increased interpretability, trust, and alignment Process supervision Process supervision is a way to train and deploy machine learning models....

A compositional account of motifs, mechanisms, and dynamics in biochemical regulatory networks

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Regulatory networks depict interactions between molecules in a biochemical system Signed graphs and signed functors are used to model and describe regulatory networks Functorial mappings are established between regulatory networks and other mathematical models in biochemistry Reaction networks modeled as Petri nets with signed links can be used to define a physical mechanism underlying a regulatory network Regulatory networks can be associated with a Lotka-Volterra system of differential equations Paper Content Introduction Living cells are made up of genes, proteins, and RNA molecules These molecules interact in complex ways to sustain the cell Regulatory networks are a directed graph that represent these interactions Edges are labeled with a positive or negative sign to indicate if the interaction is activating or inhibiting Network motifs are simple patterns that recur frequently in regulatory networks Category theory is a natural tool to study regulatory networks Sign-preserving functors can express indirect occurrences Regulatory networks can be viewed as signed graphs and signed categories Functorality is a safeguard for model transformation Regulatory networks can be connected to biochemical reaction networks Functorial assignment of continuous dynamics to regulatory networks is possible Lotka-Volterra systems of equations can be used to model complex biological systems Category theory is used to extend the constructions from closed systems to open ones Double categories are used to compose open systems together Qualitative analysis: motifs and mechanisms Regulatory networks as signed graphs Definition of a graph: a functor from Sch(Graph) to Set with two parallel morphisms Graph consists of a set of vertices and edges, and functions assigning source and target to each edge Signed graph is a graph with a function assigning a sign to each edge Regulatory networks are signed graphs with vertices representing components and edges representing interactions Sign-valued matrices are a special case of signed graphs Morphisms of signed graphs can embed or collapse multiple vertices/edges Signed graphs form a well-behaved category Signed graph morphisms factor essentially uniquely as an epimorphism followed by a monomorphism Refining regulatory networks using signed categories and functors Signed morphisms do not capture the idea of refining regulatory networks Signed categories are needed to express refinement Signed categories have objects and morphisms with signs Morphisms of signed categories are called signed functors Signed functors preserve signs Signed path categories are generated by signed graphs Signed functors between signed graphs are determined by morphisms of signed graphs Positive autoregulation is an example of a network motif Instances of motifs are monic signed functors A functor maps regulatory networks to instances of motifs A commutative square of functors is used to extend the signed path category functor Mechanistic models as petri nets with links Regulatory networks summarize how components of a complex biochemical system interact Regulatory networks include only a subset of the system’s components Regulatory networks do not model individual reactions and processes, only pairwise interactions Regulatory networks are not fully mechanistic models Mechanistic models in biochemistry model individual reactions Petri nets with signed links have two types of vertices and signed edges Signed edges represent reactions with multiple inputs/outputs, consumption, and promotion/inhibition Petri nets with signed links can be approximated as signed graphs A mechanistic model for a regulatory network is a Petri net with signed links and a monic signed functor Parameterized dynamical systems Baez and Pollard extended the mass-action model of reaction networks to a functor from the category of Petri nets with rates into a category of dynamical systems Rate coefficients are often unknown and must be extracted from existing literature or estimated from experimental data The dynamics functor is nearly identical to Baez-Pollard’s The dynamics functor is the main building block in constructing a category of parameterized dynamical systems Many dynamical models depend linearly on their parameters The law of mass action and Lotka-Volterra equations are linear in the rate and affinity parameters To express important physical constraints and to define a semantics for signed graphs, the dynamical system and its parameters are restricted to be nonnegative There is a functor Dynam + : FinSet → Con that sends a finite set S to the conical space of essentially nonnegative, algebraic vector fields A conically parameterized nonnegative dynamical system consists of finite sets P and S together with a conic-linear map v : R P + → Dynam + (S) The categories of linearly and conically parameterized dynamical systems are finitely cocomplete The initial linearly parameterized dynamical system has no parameter variables, no state variables, and the unique (trivial) vector field on the zero vector space The coproduct of two linearly parameterized dynamical systems has parameter variables P 1 + P 2 , state variables S 1 + S 2 , and parameterized vector field The lotka-volterra dynamical model A Lotka-Volterra system with n species has a vector field with state vector x ∈ R n and parameters ρ ∈ R n and β ∈ R n×n The parameter ρ i sets the rate of growth or decay for species i The parameter β i,j defines a promoting or inhibiting effect of species j on species i A functor from finite graphs to linearly parameterized dynamical systems gives a semantics for unlabeled graphs A functor from finite signed graphs to conically parameterized nonnegative dynamical systems gives a semantics for regulatory networks The functor LV preserves finite colimits The morphism LV(p) sends the parameterized dynamical system with state variables {R, * } and parameters ρ, β ∈ R {R, * } + The first way sets the system’s coefficients equal to sums of the former’s coefficients The second way substitutes x * for each x i , i ∈ S, in the first system and then takes the vector field v * to be the sum of the v i ’s Composing lotka-volterra models Extending Lotka-Volterra dynamics functors between graphs and parameterized dynamical systems Vertical composition is by composition in FinSet and in Para(Dynam) Horizontal composition and monoidal products are by pushouts and coproducts in Para(Dynam) Finite sets in the feet of the cospans interpreted as linearly parameterized dynamical systems with no parameter variables and identically zero vector fields Symmetric monoidal double category Open(Para(Dynam + )) of open conically parameterized nonnegative dynamical systems Projection functor π S : Para(Dynam) → FinSet Left adjoint Z : FinSet → Para(Dynam) Symmetric monoidal double category of Z-structured cospans Double functor between open graphs and open parameterized dynamical systems Coproduct of the parameter variables for identified vertices Lax double functor LV : Open(FinGraph) → Open(Para(Dynam)) Comparison cells defined using the morphisms of linearly parameterized dynamical systems Natural transformation α S : Z(S) → LV(Disc S) Conclusion Regulatory networks are a tool to describe interactions between molecules in biochemical systems We studied regulatory networks, reaction networks, and parameterized dynamical systems We used signed graphs, Petri nets with signed links, and Lotka-Volterra dynamics We aimed to systematize the language and methods of describing, composing, and transforming scientific models We studied four different motifs in regulatory networks We discussed feedback loop analysis in system dynamics We studied open signed graphs and open parameterized dynamical systems We constructed a lax double functor between open signed graphs and open parameterized dynamical systems

A Succinct Summary of Reinforcement Learning

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Reviews key results in single-agent reinforcement learning Intended audience are those with some familiarity with RL Paper Content Fundamentals 2.1 the rl paradigm Reinforcement learning (RL) is a field of machine learning RL has a history in psychology, neuroscience, economics, engineering, and mathematics RL is an interdisciplinary field Agent and environment Observability Agent receives observation identical to environment state Environment is partially observable Markov processes and markov reward processes Markov process is a sequence of random states with the Markov property Defined in terms of a finite set of states and a state transition probability kernel Markov Reward Process (MRP) extends the Markov process by including a reward function and a discount factor Immediate expected reward in a given state is defined as a product of the state transition probability and the reward function Discount factor determines the present value of future rewards Cumulative sum of discounted rewards is a quantity RL agents often seek to maximize Markov decision processes Single-agent RL can be formalized using Markov decision processes (MDPs)....

Identifying Exoplanets with Deep Learning. V. Improved Light Curve Classification for TESS Full Frame Image Observations

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract TESS mission produces large amount of time series data Deep learning techniques used to differentiate promising astrophysical eclipsing candidates from other phenomena Dataset curated using manual review process and used to train neural network Neural network achieves 99.6% recall and 75.7% precision Neural network able to recover 3577 out of 4140 TOIs Paper Content Introduction Human judgement has been used to detect exoplanets for 30 years Exoplanets are hard to detect due to their size and faintness Historically, humans have been used to classify planet signals as either false positives or viable planet candidates Humans are slow and inconsistent when classifying planet signals Machine learning has become a popular tool for identifying planet candidates Astronet-Triage was used in the TESS Quick-Look Pipeline to triage planet candidates Astronet-Triage-v2 was created to reduce the number of lost planet candidates while throwing out more false positives Input transit signals and corresponding light curves were used for training and testing the classifier Data was processed before being input to the neural network classifier Neural network architecture and training process were described Results of the classifier were quantified and presented Implications of the results were discussed Data Used 25000 human vetted transit signals for training and testing model Signals detected by Quick-Look Pipeline (QLP) Tces from tess ffis TESS collected full-frame images every 30 minutes for 2 years FFI cadence was updated to 10 minutes for 1st Extended Mission QLP produces light curves from images for targets in TIC with TESS-band magnitude brighter than 13....

Large Language Models as Corporate Lobbyists

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Demonstrated proof-of-concept of large language model conducting corporate lobbying activities Model determines relevance of proposed U.S. Congressional bills to public companies and provides explanations and confidence levels Model drafts letters to sponsor of bill to persuade congressperson to make changes to proposed legislation Used hundreds of novel ground-truth labels to benchmark model performance Model outperforms baseline of predicting irrelevance Performance of previous model (text-davinci-002) worse than always predicting irrelevance AI used to augment human lobbyists, but may lead to less human oversight over automated assessments of policy ideas Paper Content I....

Language Models are Drummers: Drum Composition with Natural Language Pre-Training

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Automatic music generation with AI requires a lot of data which is hard to obtain for less common genres and instruments. Deep models can transfer knowledge from language to music by finetuning large language models pre-trained on a massive text corpus. GPT3 is capable of generating reasonable drum grooves, while models not pre-trained show no such ability....

Deep Learning and Computational Physics (Lecture Notes)

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Compiled as lecture notes for a course at the University of Southern California Accessible to engineering graduate students with a strong background in Applied Mathematics Introduce student to topics in deep learning Exploit connections between deep learning algorithms and conventional techniques of computational physics Use concepts from computational physics to develop understanding of deep learning algorithms Novel deep learning algorithms can be used to solve challenging problems in computational physics Paper Content Computational physics Computational physics solves problems in science and engineering Involves collecting measurements of an observable Postulating a physical law based on observations Writing a mathematical description of the law Solving the system using exact or approximate methods Machine learning ML does not require physical laws Collect data from physical phenomena, measurements, or numerical solvers Train algorithm to discover patterns or relations Use ML algorithm to make predictions and validate with data Examples of ml Regression algorithms are used to approximate a function given a set of pairwise data Decision trees can be used to predict the probability of a new individual owning a house Clustering algorithms are used to find patterns in a set of data Types of ml algorithms based leaning task Supervised learning: Predicting labels for new data based on existing data Unsupervised learning: Finding relations among different regions of data Artificial intelligence, machine learning and deep learning AI, ML and DL are related but different concepts AI refers to a system with human-like intelligence ML is a key component of an AI system Self-driving cars are an example of AI ML algorithms are trained using data DL is a subset of ML algorithms DL architecture is loosely motivated by how signals are transmitted by the central nervous system Machine learning and computational physics Combining computational physics and ML can provide an alternate route to representing mathematical laws....

Causal Inference in Recommender Systems: A Survey of Strategies for Bias Mitigation, Explanation, and Generalization

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Recommender systems (RSs) are used to estimate user interests and predict their future behaviors. Traditional RSs do not consider the causal reasons that lead to observed user behaviors, leading to biases in generated recommendations. Recent years have seen an upsurge of interest in enhancing traditional RSs with causal inference techniques. This survey provides an overview of causal RSs and discusses how different causal inference techniques can be introduced to address challenges....