Rethinking with Retrieval: Faithful Large Language Model Inference
December 31, 2022 · 834 words · Hangfeng He, Hongming Zhang, Dan Roth
DensePose From WiFi
December 31, 2022 · 1224 words · Jiaqi Geng, Dong Huang, Fernando De la Torre
Nowcasting Stock Implied Volatility with Twitter
December 31, 2022 · 1201 words · Thomas Dierckx, Jesse Davis, Wim Schoutens
Design on Matroids: Diversity vs. Meritocracy
December 31, 2022 · 818 words · Isa E. Hafalir, Fuhito Kojima, M. Bumin Yenmez, Koji Yokote
A Survey for In-context Learning
December 31, 2022 · 1051 words · Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu and 5 others
Efficient Market Design with Distributional Objectives
December 31, 2022 · 927 words · Isa E. Hafalir, Fuhito Kojima, M. Bumin Yenmez
Effective Brain Connectome: the whole-brain effective connectivity from neural perturbational inference
December 31, 2022 · 1890 words · Zixiang Luo, Zhichao Liang, Chenyu Xu, Changsong Zhou, Quanying Liu
Integrated information theory (IIT) 4.0: Formulating the properties of phenomenal existence in physical terms
December 30, 2022 · 1188 words · Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso, Andrew M Haun and 11 others
MAUVE Scores for Generative Models: Theory and Practice
December 30, 2022 · 1786 words · Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta and 4 others
Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats
December 29, 2022 · 999 words · István Sárándi, Alexander Hermans, Bastian Leibe
GPT Takes the Bar Exam
December 29, 2022 · 737 words · Michael Bommarito II, Daniel Martin Katz
Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks
December 29, 2022 · 699 words · Vincent Herrmann, Louis Kirsch, Jürgen Schmidhuber
‘Real Attackers Don’t Compute Gradients’: Bridging the Gap Between Adversarial ML Research and Practice
December 29, 2022 · 1955 words · Giovanni Apruzzese, Hyrum S. Anderson, Savino Dambra, David Freeman, Fabio Pierazzi and 1 others
What Estimators Are Unbiased For Linear Models?
December 29, 2022 · 1734 words · Lihua Lei, Jeffrey Wooldridge
Cramming: Training a Language Model on a Single GPU in One Day
December 28, 2022 · 1055 words · Jonas Geiping, Tom Goldstein
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
December 28, 2022 · 912 words · Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang and 2 others
A System-Level View on Out-of-Distribution Data in Robotics
December 28, 2022 · 849 words · Rohan Sinha, Apoorva Sharma, Somrita Banerjee, Thomas Lew, Rachel Luo and 4 others
Feature learning in neural networks and kernel machines that recursively learn features
December 28, 2022 · 1479 words · Adityanarayanan Radhakrishnan, Daniel Beaglehole, Parthe Pandit, Mikhail Belkin
Sparse Coding in a Dual Memory System for Lifelong Learning
December 28, 2022 · 1021 words · Fahad Sarfraz, Elahe Arani, Bahram Zonooz
NeRN – Learning Neural Representations for Neural Networks
December 27, 2022 · 1042 words · Maor Ashkenazi, Zohar Rimon, Ron Vainshtein, Shir Levi, Elad Richardson and 2 others
Building a Culture of Reproducibility in Academic Research
December 27, 2022 · 766 words · Jimmy Lin
A Generalization of ViT/MLP-Mixer to Graphs
December 27, 2022 · 907 words · Xiaoxin He, Bryan Hooi, Thomas Laurent, Adam Perold, Yann LeCun and 1 others
The Forward-Forward Algorithm: Some Preliminary Investigations
December 27, 2022 · 1284 words · Geoffrey Hinton
Structure-based drug discovery with deep learning
December 26, 2022 · 765 words · Rıza Özçelik, Derek van Tilborg, José Jiménez-Luna, Francesca Grisoni
Fully Differentiable RANSAC
December 26, 2022 · 820 words · Tong Wei, Yash Patel, Jiri Matas, Daniel Barath
Large Language Models Encode Clinical Knowledge
December 26, 2022 · 733 words · Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei and 25 others
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
December 26, 2022 · 553 words · Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu and 7 others
Sitting Posture Recognition Using a Spiking Neural Network
December 25, 2022 · 988 words · Jianquan Wang, Basim Hafidh, Haiwei Dong, Abdulmotaleb El Saddik
Closed-form control with spike coding networks
December 25, 2022 · 1106 words · Filip S. Slijkhuis, Sander W. Keemink, Pablo Lanillos
GraphCast: Learning skillful medium-range global weather forecasting
December 24, 2022 · 918 words · Remi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, Peter Wirnsberger, Meire Fortunato and 13 others
Detecting Objects with Graph Priors and Graph Refinement
December 23, 2022 · 727 words · Aritra Bhowmik, Martin R. Oswald, Yu Wang, Nora Baka, Cees G. M. Snoek
SuperGF: Unifying Local and Global Features for Visual Localization
December 23, 2022 · 1070 words · Wenzheng Song, Ran Yan, Boshu Lei, Takayuki Okatani
Stop using the elbow criterion for k-means and how to choose the number of clusters instead
December 23, 2022 · 715 words · Erich Schubert
The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes
December 23, 2022 · 1174 words · Alexander Atanasov, Blake Bordelon, Sabarish Sainathan, Cengiz Pehlevan
Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
December 23, 2022 · 1333 words · William Brannon, Yogesh Virkar, Brian Thompson
Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?
December 23, 2022 · 810 words · Byung-Doh Oh, William Schuler
How different are self and nonself?
December 22, 2022 · 93 words · Andreas Mayer, Christopher J. Russo, Quentin Marcou, William Bialek, Benjamin D. Greenbaum
Deep learning for size-agnostic inverse design of random-network 3D printed mechanical metamaterials
December 22, 2022 · 1165 words · Helda Pahlavani, Kostas Tsifoutis-Kazolis, Prerak Mody, Jie Zhou, Mohammad J. Mirzaali and 1 others
Scalable Adaptive Computation for Iterative Generation
December 22, 2022 · 746 words · Allan Jabri, David Fleet, Ting Chen
Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized Photography
December 22, 2022 · 943 words · Ilya Chugunov, Yuxuan Zhang, Felix Heide
Beyond SOT: It’s Time to Track Multiple Generic Objects at Once
December 22, 2022 · 1283 words · Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool and 1 others
Impossibility Theorems for Feature Attribution
December 22, 2022 · 1610 words · Blair Bilodeau, Natasha Jaques, Pang Wei Koh, Been Kim
GOOD: Exploring Geometric Cues for Detecting Objects in an Open World
December 22, 2022 · 577 words · Haiwen Huang, Andreas Geiger, Dan Zhang
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
December 22, 2022 · 704 words · Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu and 4 others
Local Policy Improvement for Recommender Systems
December 22, 2022 · 787 words · Dawen Liang, Nikos Vlassis
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
December 21, 2022 · 1041 words · Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi
Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning
December 21, 2022 · 1458 words · Chris Lengerich, Gabriel Synnaeve, Amy Zhang, Hugh Leather, Kurt Shuster and 2 others
Generalized Decoding for Pixel, Image, and Language
December 21, 2022 · 1077 words · Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li and 9 others
Training language models for deeper understanding improves brain alignment
December 21, 2022 · 867 words · Khai Loong Aw, Mariya Toneva
From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models
December 21, 2022 · 914 words · Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li and 2 others
Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing
December 21, 2022 · 588 words · Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao and 4 others
Hierarchically branched diffusion models for efficient and interpretable multi-class conditional generation
December 21, 2022 · 990 words · Alex M. Tseng, Tommaso Biancalani, Max Shen, Gabriele Scalia
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
December 21, 2022 · 923 words · Zhiyang Xu, Ying Shen, Lifu Huang
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
December 21, 2022 · 1232 words · John Wieting, Jonathan H. Clark, William W. Cohen, Graham Neubig, Taylor Berg-Kirkpatrick
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks
December 21, 2022 · 990 words · Jimmy Z. Di, Jack Douglas, Jayadev Acharya, Gautam Kamath, Ayush Sekhari
There’s Plenty of Room Right Here: Biological Systems as Evolved, Overloaded, Multi-scale Machines
December 20, 2022 · 1525 words · Joshua Bongard, Michael Levin
Does unsupervised grammar induction need pixels?
December 20, 2022 · 601 words · Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty and 5 others
Debiasing NLP Models Without Demographic Information
December 20, 2022 · 980 words · Hadas Orgad, Yonatan Belinkov
Character-Aware Models Improve Visual Text Rendering
December 20, 2022 · 1045 words · Rosanne Liu, Dan Garrette, Chitwan Saharia, William Chan, Adam Roberts and 5 others
Parsel: A Unified Natural Language Framework for Algorithmic Reasoning
December 20, 2022 · 1610 words · Eric Zelikman, Qian Huang, Gabriel Poesia, Noah D. Goodman, Nick Haber
Self-Instruct: Aligning Language Model with Self Generated Instructions
December 20, 2022 · 771 words · Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith and 2 others
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers
December 20, 2022 · 637 words · Damai Dai, Yutao Sun, Li Dong, Yaru Hao, Zhifang Sui and 1 others
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
December 20, 2022 · 884 words · Prakhar Gupta, Yang Liu, Di Jin, Behnam Hedayatnia, Spandana Gella and 4 others
PairReranker: Pairwise Reranking for Natural Language Generation
December 20, 2022 · 707 words · Dongfu Jiang, Bill Yuchen Lin, Xiang Ren
A Length-Extrapolatable Transformer
December 20, 2022 · 644 words · Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang and 4 others
RangeAugment: Efficient Online Augmentation with Range Learning
December 20, 2022 · 1157 words · Sachin Mehta, Saeid Naderiparizi, Fartash Faghri, Maxwell Horton, Lailin Chen and 3 others
Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts
December 20, 2022 · 610 words · Skyler Hallinan, Alisa Liu, Yejin Choi, Maarten Sap
A Survey of Deep Learning for Mathematical Reasoning
December 20, 2022 · 1382 words · Pan Lu, Liang Qiu, Wenhao Yu, Sean Welleck, Kai-Wei Chang
Trustworthy Social Bias Measurement
December 20, 2022 · 982 words · Rishi Bommasani, Percy Liang
Is GPT-3 a Psychopath? Evaluating Large Language Models from a Psychological Perspective
December 20, 2022 · 951 words · Xingxuan Li, Yutong Li, Linlin Liu, Lidong Bing, Shafiq Joty
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
December 20, 2022 · 790 words · Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training
December 20, 2022 · 826 words · Kelly Marchisio, Patrick Lewis, Yihong Chen, Mikel Artetxe
A Measure-Theoretic Characterization of Tight Language Models
December 20, 2022 · 1204 words · Li Du, Lucas Torroba Hennigen, Tiago Pimentel, Clara Meister, Jason Eisner and 1 others
Precise Zero-Shot Dense Retrieval without Relevance Labels
December 20, 2022 · 785 words · Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan
LAMBADA: Backward Chaining for Automated Reasoning in Natural Language
December 20, 2022 · 892 words · Seyed Mehran Kazemi, Najoung Kim, Deepti Bhatia, Xin Xu, Deepak Ramachandran
Controllable Text Generation with Language Constraints
December 20, 2022 · 860 words · Howard Chen, Huihan Li, Danqi Chen, Karthik Narasimhan
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
December 20, 2022 · 804 words · Hyunwoo Kim, Jack Hessel, Liwei Jiang, Ximing Lu, Youngjae Yu and 6 others
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
December 20, 2022 · 888 words · Nikita Moghe, Evgeniia Razumovskaia, Liane Guillou, Ivan Vulić, Anna Korhonen and 1 others
Recycling diverse models for out-of-distribution generalization
December 20, 2022 · 913 words · Alexandre Ramé, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, Léon Bottou and 1 others
HouseCat6D – A Large-Scale Multi-Modal Category Level 6D Object Pose Dataset with Household Objects in Realistic Scenarios
December 20, 2022 · 1381 words · HyunJun Jung, Shun-Cheng Wu, Patrick Ruhkamp, Hannah Schieber, Pengyuan Wang and 6 others
Settling the Reward Hypothesis
December 20, 2022 · 1049 words · Michael Bowling, John D. Martin, David Abel, Will Dabney
Quantifying Local Extrinsic Curvature in Neural Manifolds
December 20, 2022 · 844 words · Francisco E. Acosta, Sophia Sanborn, Khanh Dao Duc, Manu Madhav, Nina Miolane
Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations
December 20, 2022 · 914 words · Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei Jiang and 2 others
Towards Reasoning in Large Language Models: A Survey
December 20, 2022 · 790 words · Jie Huang, Kevin Chen-Chuan Chang
Extrinsic Evaluation of Machine Translation Metrics
December 20, 2022 · 1106 words · Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch
High-resolution canopy height map in the Landes forest (France) based on GEDI, Sentinel-1, and Sentinel-2 data with a deep learning approach
December 20, 2022 · 2157 words · Martin Schwartz, Philippe Ciais, Catherine Ottlé, Aurelien De Truchis, Cedric Vega and 9 others
ReCode: Robustness Evaluation of Code Generation Models
December 20, 2022 · 914 words · Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang and 9 others
On the Role of Parallel Data in Cross-lingual Transfer Learning
December 20, 2022 · 635 words · Machel Reid, Mikel Artetxe
Multi-asset market making under the quadratic rough Heston
December 20, 2022 · 1218 words · Mathieu Rosenbaum, Jianfei Zhang
Goal-oriented Autonomous Driving
December 20, 2022 · 766 words · Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima and 11 others
Large Language Models Are Reasoning Teachers
December 20, 2022 · 833 words · Namgyu Ho, Laura Schmid, Se-Young Yun
Language Modeling with Latent Situations
December 20, 2022 · 968 words · Belinda Z. Li, Maxwell Nye, Jacob Andreas
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
December 20, 2022 · 870 words · Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati and 3 others
(QA)$^2$: Question Answering with Questionable Assumptions
December 20, 2022 · 885 words · Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty
Defending Against Poisoning Attacks in Open-Domain Question Answering
December 20, 2022 · 676 words · Orion Weller, Aleem Khan, Nathaniel Weir, Dawn Lawrie, Benjamin Van Durme
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
December 20, 2022 · 760 words · Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu and 2 others
Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
December 19, 2022 · 527 words · Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Yang Wang and 1 others
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
December 19, 2022 · 1093 words · Kelvin Xu, Zheyuan Hu, Ria Doshi, Aaron Rovinsky, Vikash Kumar and 2 others
Policy learning ‘without’’ overlap: Pessimism and generalized empirical Bernstein’s inequality
December 19, 2022 · 2355 words · Ying Jin, Zhimei Ren, Zhuoran Yang, Zhaoran Wang
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
December 19, 2022 · 1489 words · Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts
Training Trajectories of Language Models Across Scales
December 19, 2022 · 679 words · Mengzhou Xia, Mikel Artetxe, Chunting Zhou, Xi Victoria Lin, Ramakanth Pasunuru and 3 others
Scalable Diffusion Models with Transformers
December 19, 2022 · 797 words · William Peebles, Saining Xie
Evaluating Human-Language Model Interaction
December 19, 2022 · 1257 words · Mina Lee, Megha Srivastava, Amelia Hardy, John Thickstun, Esin Durmus and 13 others
DSI++: Updating Transformer Memory with New Documents
December 19, 2022 · 1325 words · Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran and 4 others
One Embedder, Any Task: Instruction-Finetuned Text Embeddings
December 19, 2022 · 1312 words · Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu and 5 others
Speaking Style Conversion With Discrete Self-Supervised Units
December 19, 2022 · 730 words · Gallil Maimon, Yossi Adi
KNIFE: Knowledge Distillation with Free-Text Rationales
December 19, 2022 · 1053 words · Aaron Chan, Zhiyuan Zeng, Wyatt Lake, Brihi Joshi, Hanjie Chen and 1 others
The case for 4-bit precision: k-bit Inference Scaling Laws
December 19, 2022 · 1046 words · Tim Dettmers, Luke Zettlemoyer
Continual Learning for Instruction Following from Realtime Feedback
December 19, 2022 · 992 words · Alane Suhr, Yoav Artzi
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
December 19, 2022 · 1265 words · Or Honovich, Thomas Scialom, Omer Levy, Timo Schick
A Natural Bias for Language Generation Models
December 19, 2022 · 726 words · Clara Meister, Wojciech Stokowiec, Tiago Pimentel, Lei Yu, Laura Rimell and 1 others
Multilingual Sequence-to-Sequence Models for Hebrew NLP
December 19, 2022 · 513 words · Matan Eyal, Hila Noga, Roee Aharoni, Idan Szpektor, Reut Tsarfaty
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering
December 19, 2022 · 755 words · Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee and 4 others
Visconde: Multi-document QA with GPT-3 and Neural Reranking
December 19, 2022 · 567 words · Jayr Pereira, Robson Fidalgo, Roberto Lotufo, Rodrigo Nogueira
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
December 19, 2022 · 1090 words · Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie and 42 others
Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation
December 19, 2022 · 1091 words · Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, André F. T. Martins
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
December 19, 2022 · 1056 words · Yong Cheng, Yu Zhang, Melvin Johnson, Wolfgang Macherey, Ankur Bapna
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
December 19, 2022 · 1128 words · Zheng-Xin Yong, Hailey Schoelkopf, Niklas Muennighoff, Alham Fikri Aji, David Ifeoluwa Adelani and 9 others
Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization
December 19, 2022 · 773 words · Dustin Wright, Isabelle Augenstein
StyleTRF: Stylizing Tensorial Radiance Fields
December 19, 2022 · 1142 words · Rahul Goel, Sirikonda Dhawal, Saurabh Saini, P. J. Narayanan
Transferring General Multimodal Pretrained Models to Text Recognition
December 19, 2022 · 477 words · Junyang Lin, Xuancheng Ren, Yichang Zhang, Gao Liu, Peng Wang and 2 others
APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning
December 19, 2022 · 855 words · Soumya Sanyal, Yichong Xu, Shuohang Wang, Ziyi Yang, Reid Pryzant and 3 others
Multi hash embeddings in spaCy
December 19, 2022 · 1029 words · Lester James Miranda, Ákos Kádár, Adriane Boyd, Sofie Van Landeghem, Anders Søgaard and 1 others
Discovering Language Model Behaviors with Model-Written Evaluations
December 19, 2022 · 1169 words · Ethan Perez, Sam Ringer, Kamilė Lukošiūtė, Karina Nguyen, Edwin Chen and 58 others
Natural Language to Code Generation in Interactive Data Science Notebooks
December 19, 2022 · 1069 words · Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen and 7 others
Emergent Analogical Reasoning in Large Language Models
December 19, 2022 · 1428 words · Taylor Webb, Keith J. Holyoak, Hongjing Lu
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
December 18, 2022 · 1133 words · Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff and 1 others
Language model acceptability judgements are not always robust to context
December 18, 2022 · 706 words · Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes and 2 others
Beyond the C: Retargetable Decompilation using Neural Machine Translation
December 17, 2022 · 1533 words · Iman Hosseini, Brendan Dolan-Gavitt
Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark
December 17, 2022 · 975 words · Xiaofeng Wang, Zheng Zhu, Yunpeng Zhang, Guan Huang, Yun Ye and 3 others
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy
December 17, 2022 · 943 words · Long Lian, Zhirong Wu, Stella X. Yu
Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
December 17, 2022 · 678 words · Jifan Chen, Yuhao Zhang, Lan Liu, Rui Dong, Xinchi Chen and 3 others
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
December 16, 2022 · 1192 words · Alex Nichol, Heewoo Jun, Prafulla Dhariwal, Pamela Mishkin, Mark Chen
Neural Story Planning
December 16, 2022 · 1050 words · Anbang Ye, Christopher Cui, Taiwei Shi, Mark O. Riedl
‘Rarely’ a problem? Language models exhibit inverse scaling in their predictions following ‘few’-type quantifiers
December 16, 2022 · 471 words · James A. Michaelov, Benjamin K. Bergen
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
December 16, 2022 · 969 words · Qiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui and 4 others
Attentive Mask CLIP
December 16, 2022 · 964 words · Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang and 6 others
Connecting Permutation Equivariant Neural Networks and Partition Diagrams
December 16, 2022 · 2488 words · Edward Pearce-Crump
Efficient Conditionally Invariant Representation Learning
December 16, 2022 · 985 words · Roman Pogodin, Namrata Deka, Yazhe Li, Danica J. Sutherland, Victor Veitch and 1 others
Brauer’s Group Equivariant Neural Networks
December 16, 2022 · 962 words · Edward Pearce-Crump
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
December 16, 2022 · 1185 words · Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
December 16, 2022 · 885 words · David Dale, Elena Voita, Loïc Barrault, Marta R. Costa-jussà
Biomedical image analysis competitions: The state of current participation practice
December 16, 2022 · 988 words · Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee and 350 others
Fake it till you make it: Learning(s) from a synthetic ImageNet clone
December 16, 2022 · 1136 words · Mert Bulent Sariyildiz, Karteek Alahari, Diane Larlus, Yannis Kalantidis
Teaching Small Language Models to Reason
December 16, 2022 · 864 words · Lucie Charlotte Magister, Jonathan Mallinson, Jakub Adamek, Eric Malmi, Aliaksei Severyn
How to disagree well: Investigating the dispute tactics used on Wikipedia
December 16, 2022 · 985 words · Christine de Kock, Tom Stafford, Andreas Vlachos
ALERT: Adapting Language Models to Reasoning Tasks
December 16, 2022 · 780 words · Ping Yu, Tianlu Wang, Olga Golovneva, Badr Alkhamissy, Gargi Ghosh and 2 others
SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation
December 16, 2022 · 608 words · Jee Seok Yoon, Chenghao Zhang, Heung-Il Suk, Jia Guo, Xiaoxiao Li
Economic impacts of AI-augmented R&D
December 15, 2022 · 1977 words · Tamay Besiroglu, Nicholas Emery-Xu, Neil Thompson
Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines
December 15, 2022 · 1083 words · Andrew Lee, David Wu, Emily Dinan, Mike Lewis
FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference
December 15, 2022 · 864 words · Michiel de Jong, Yury Zemlyanskiy, Joshua Ainslie, Nicholas FitzGerald, Sumit Sanghai and 2 others
Efficient Long Sequence Modeling via State Space Augmented Transformer
December 15, 2022 · 967 words · Simiao Zuo, Xiaodong Liu, Jian Jiao, Denis Charles, Eren Manavoglu and 2 others
On Second Thought, Let’s Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
December 15, 2022 · 959 words · Omar Shaikh, Hongxin Zhang, William Held, Michael Bernstein, Diyi Yang
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
December 15, 2022 · 1632 words · Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang and 5 others
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
December 15, 2022 · 954 words · William Held, Christopher Hidey, Fei Liu, Eric Zhu, Rahul Goel and 2 others
Objaverse: A Universe of Annotated 3D Objects
December 15, 2022 · 901 words · Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel and 5 others
Image-and-Language Understanding from Pixels Only
December 15, 2022 · 1127 words · Michael Tschannen, Basil Mustafa, Neil Houlsby
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
December 15, 2022 · 971 words · Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor and 15 others
FlexiViT: One Model for All Patch Sizes
December 15, 2022 · 1204 words · Lucas Beyer, Pavel Izmailov, Alexander Kolesnikov, Mathilde Caron, Simon Kornblith and 5 others
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
December 15, 2022 · 1307 words · Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan and 6 others
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
December 15, 2022 · 553 words · Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer and 2 others
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
December 15, 2022 · 777 words · Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker and 20 others
Multimodal Teacher Forcing for Reconstructing Nonlinear Dynamical Systems
December 15, 2022 · 609 words · Manuel Brenner, Georgia Koppe, Daniel Durstewitz
Manifestations of Xenophobia in AI Systems
December 15, 2022 · 1606 words · Nenad Tomasev, Jonathan Leader Maynard, Iason Gabriel
Protein Structure Prediction until CASP15
December 15, 2022 · 467 words · Arne Elofsson
Transformers learn in-context by gradient descent
December 15, 2022 · 952 words · Johannes von Oswald, Eyvind Niklasson, Ettore Randazzo, João Sacramento, Alexander Mordvintsev and 2 others
RT-1: Robotics Transformer for Real-World Control at Scale
December 13, 2022 · 1320 words · Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis and 46 others
Multi-Concept Customization of Text-to-Image Diffusion
December 8, 2022 · 745 words · Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu
ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation
December 7, 2022 · 1561 words · Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao
Robust Speech Recognition via Large-Scale Weak Supervision
December 6, 2022 · 1590 words · Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey and 1 others
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
December 6, 2022 · 898 words · Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang and 12 others
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
December 3, 2022 · 1494 words · Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, Risheng Yu and 2 others
Scaling Language-Image Pre-training via Masking
December 1, 2022 · 449 words · Yanghao Li, Haoqi Fan, Ronghang Hu, Christoph Feichtenhofer, Kaiming He
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
December 1, 2022 · 853 words · Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich