arxiv-summary: AI-summarized AI papers

GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Global economy is increasingly dependent on knowledge workers AICPA developed Uniform CPA Examination to measure capability readiness for knowledge workers OpenAI’s text-davinci-003 and prior versions of GPT evaluated on sample Regulation exam and assessment of multiple-choice questions text-davinci-003 achieves 14.4% correct rate on sample REG exam section, underperforming human capabilities text-davinci-003 approaching human-level performance on Remembering & Understanding and Application skill levels Recent generations of GPT-3 demonstrate material improvements, rising from 30% to 57% Paper Content Introduction Knowledge work is an important part of the global economy Leading management theorists have studied knowledge workers for nearly seven decades Hundreds of millions to billions of people are considered knowledge workers Organizations require knowledge workers to demonstrate their preparedness through assessments Public accounting is a multidisciplinary practice that requires legal, financial, accounting, auditing, technology, and ethical knowledge and skills The CPA Exam is the most comprehensive assessment of knowledge work readiness The CPA Exam is divided into four sections: Auditing and Attestation, Business Environment and Concepts, Financial Accounting and Reporting, and Regulation AI has not been able to perform knowledge work Recent research has shown potential to address capability gaps GPT-3 has demonstrated state-of-the-art performance on a wide range of tasks GPT-3 was evaluated on the Bar Exam and achieved near-parity with human test-takers GPT-3 was evaluated on the CPA Exam to evaluate its usefulness for knowledge work Analysis suggests areas where GPT-3 may be useful and areas where research is still required Aicpa exam The Uniform CPA Examination is a computerized assessment based on psychometric and statistical techniques....

Data Distillation: A Survey

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Deep learning has led to the curation of many datasets Training parameter-hungry models on large datasets poses problems Data distillation approaches aim to create data summaries Formal framework and taxonomy of existing approaches presented Data distillation approaches for images, graphs, and user-item interactions discussed Current challenges and future research directions identified Paper Content Introduction Data distillation is a task that aims to create tiny, high-fidelity summaries of data....

Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Language models learn a lot of information during pretraining Facts can be stored in different locations than previously thought Past work on model editing methods relies on Causal Tracing to select which model layers to edit Experiments show that editing performance relates to localization results from representation denoising Better understanding of language models may not always translate to insights about how to best change their behavior Paper Content Introduction Language models learn facts during pretraining Recent work explores how facts are stored in model weights Model editing methods can be used to inject new facts into model weights Connection between localization and editing is based on assumption that one should edit a model by localizing a behavior and then editing that component Causal Tracing measures information content of hidden representations and is used to motivate ROME and MEMIT model-editing methods Correlation between Causal Tracing results and edit success is near zero Better to ignore tracing results and always choose early-to-mid-layer MLP weight for editing Tracing effects explain only a small fraction of variance in editing performance Related work Localization methods focus on model components such as layers, neurons, and weight matrices MLP layers are studied for their role in factual association Localization is validated by editing neuron activations or layer weights Editing suggested locations does not show if it is necessary or the best option Recent work has looked into editing success across layers Investigating edit success at the datapoint level reveals unexpected results Notation and background Data notation Consider facts of the form (s, r, o) Prompt P for some fact (s, r, o) Variations of the data for the fact (s, r, o) s* is a “neighboring” entity to the subject s r* is a paraphrase of the relation r s noise is a noised representation of the subject s o false is an object that incorrectly completes the tuple (s, r, •) o true is the object that correctly completes the fact (s, r, •) Causal tracing Causal Tracing is a method for localizing information in the forward pass of an autoregressive Transformer....

Mastering Diverse Domains through World Models

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract General intelligence requires solving tasks across many domains. DreamerV3 is a general and scalable algorithm based on world models that outperforms previous approaches. DreamerV3 is able to work across a wide range of domains with fixed hyperparameters. DreamerV3 has favorable scaling properties, with larger models leading to higher data-efficiency and performance. DreamerV3 is the first algorithm to collect diamonds in Minecraft from scratch without human data or curricula....

Neural Radiance Field Codebooks

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Compositional representations of the world are a promising step towards enabling high-level scene understanding. Learning such representations for complex scenes and tasks is an open challenge. NRC is a method for learning object-centric representations through novel view reconstruction. NRC representations transfer well to object navigation in THOR. NRC performs unsupervised segmentation better than prior methods....

On the Robustness of AlphaFold: A COVID-19 Case Study

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract AlphaFold is a protein folding neural network that predicts accurate protein structures AlphaFold’s robustness has not been explored We measure the robustness of the predicted structures using RMSD and GDT similarity measure Minimally perturbing protein sequences to fool protein folding neural networks is NP-complete Adversarial protein sequences can lead to large RMSD between predicted protein structure and original sequence Paper Content Introduction Proteins are essential for life and reproduction Proteins are composed of 20 amino acids It is important to understand 3D structure of proteins High-throughput sequencing techniques have helped to understand primary sequence of proteins AlphaFold has achieved success in predicting protein structures using neural networks Adversarial sequences can result in very different 3D protein structures Summary and related work PFNNs should obey the observation that small changes in protein sequence usually don’t lead to drastic changes in structure Studies have shown that two proteins with 50% sequence identity align within 1Å RMSD Exceptions exist where small perturbations can alter the entire fold of a protein Stein and Mchaourab 2021 used in silico mutagenesis to enhance AlphaFold prediction Del Alamo et al....

RedMule: A Mixed-Precision Matrix-Matrix Operation Engine for Flexible and Energy-Efficient On-Chip Linear Algebra and TinyML Training Acceleration

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Increasing interest in TinyML is pushing toward enabling TinyML-class training. Current training algorithms rely on floating-point matrix operations. This paper presents RedMulE, a low-power specialized accelerator for multi-precision floating-point operations. RedMulE-augmented PULP cluster achieves high GFLOPS/W and TFLOPS/W. RedMulE consumes less than 60 mW on average, enabling on-device training of deep learning models....

How Data Scientists Review the Scholarly Literature

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Keeping up with research literature is important for scientists This paper examines literature review practices of data scientists Data science is a field with an exponential rise in papers Tools have been developed to help data scientists cope with the deluge of research Interviews and think-aloud protocols were conducted to uncover challenges faced by data scientists Challenges include seeking and sensemaking of papers, understanding papers with missing details, and grappling with the deluge Data scientists rely on peers online and in-person for help Paper Content Introduction Literature reviews are important for scientists....

Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Convolution operation cannot handle irregular, random-masked input images Single-scale nature of BERT pre-training is inconsistent with convnet’s hierarchical structure Sparse convolution used to encode unmasked pixels as sparse voxels of 3D point clouds Hierarchical decoder developed to reconstruct images from multi-scale encoded features Sparse masKed modeling (SparK) can be used directly on any convolutional model without backbone modifications Surpasses state-of-the-art contrastive learning and transformer-based masked modeling by similarly large margins Improves object detection and instance segmentation up to +3....

Doc2Query--: When Less is More

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Doc2Query is a technique used to improve the first-stage retrieval effectiveness of search engines. Sequence-to-sequence models are known to “hallucinate” content that is not present in the source text. This work explores techniques for filtering out these harmful queries prior to indexing. Using a relevance model to remove poor-quality queries can improve the retrieval effectiveness of Doc2Query....