DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields

DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Novel optimization algorithm called DroNeRF for autonomous positioning of monocular camera drones Used to generate new views of an object or scene from a set of input images Calculates optimized pose for individual drones without using external localization system Evaluates quality of generated novel views using perceptual metrics like PSNR and SSIM Paper Content Iii....

March 8, 2023 · 609 words · Dipam Patel, Phu Pham, Aniket Bera
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Denoising Diffusion models are proficient for generative sampling. Techniques such as BTD have been proposed to reduce the number of network calls. TRACT is a new method that extends BTD. TRACT improves FID by up to 2.4x on the same architecture. TRACT achieves new single-step Denoising Diffusion Implicit Models state-of-the-art FID. PyTorch implementation will be released soon....

March 7, 2023 · 737 words · David Berthelot, Arnaud Autef, Jierui Lin, Dian Ang Yap, Shuangfei Zhai and 4 others
How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding

How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Transformers have been successful across many domains, but their learning mechanics are not well understood. Recent research has begun to study the representational aspects of transformers, but there is no guarantee the learning dynamics will converge. This paper provides a mechanistic understanding of how transformers learn “semantic structure” by combining experiments on synthetic data, Wikipedia data, and mathematical analysis....

March 7, 2023 · 1421 words · Yuchen Li, Yuanzhi Li, Andrej Risteski
Computing with Categories in Machine Learning

Computing with Categories in Machine Learning

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Category theory has been applied to various scientific domains. DisCoPyro combines categorical structures with amortized variational inference. DisCoPyro can be applied to program learning for variational autoencoders. DisCoPyro provides mathematical foundations and concrete applications. DisCoPyro is compared to other models (e.g. neuro-symbolic models). DisCoPyro could contribute to the development of artificial general intelligence....

March 7, 2023 · 545 words · Eli Sennesh, Tom Xu, Yoshihiro Maruyama
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Semantic occupancy perception is essential for autonomous driving. Existing benchmarks lack diversity in urban scenes and only evaluate front-view predictions. OpenOccupancy is the first surrounding semantic occupancy perception benchmark. Annotations rely on LiDAR points superimposition, which can miss some occupancy labels. Augmenting And Purifying (AAP) pipeline is used to densify the annotations....

March 7, 2023 · 911 words · Xiaofeng Wang, Zheng Zhu, Wenbo Xu, Yunpeng Zhang, Yi Wei and 5 others
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Need for large-scale high-quality text datasets BigScience workshop formed to research and train large language models ROOTS corpus created, 1.6TB dataset spanning 59 languages BLOOM language model trained using ROOTS corpus Large initial subset of corpus released with processing tools Paper Content Introduction BigScience1 is a one-year open collaborative research initiative Goal was to train an open-access, massively multilingual language model Engaged in ethical, sociopolitical, and data governance issues Four working groups: Data Governance, Data Sourcing and Preparation, Privacy, Legal Scholarship Released a large subset of ROOTS Released data tools used to curate, source, clean and inspect constituent datasets Outline of the paper Collected a web-scale dataset covering 59 languages 46 natural languages and 13 programming languages 62% of text from community-selected and documented list of language data sources 38% of text from pre-processed web crawl, OSCAR Filtered with help of native speakers Related work Pre-trained models are used in natural language processing Performance is based on model size and dataset size/quality Recent models trained on up to 1....

March 7, 2023 · 968 words · Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral and 49 others
New Perspectives on Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization

New Perspectives on Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Optimal transport-based distributionally robust optimization problems studied Fictitious adversary (nature) can choose distribution of uncertain problem parameters Robustification related to various forms of variation and Lipschitz regularization Conditions for existence and computability of Nash equilibrium between decision-maker and nature Paper Content Introduction Stochastic optimization methods are becoming popular in machine learning. A stochastic optimization problem aims to minimize the expected value of an uncertainty-affected loss function....

March 7, 2023 · 1535 words · Soroosh Shafieezadeh-Abadeh, Liviu Aolaritei, Florian Dörfler, Daniel Kuhn
Selecting Features for Markov Modeling: A Case Study on HP35

Selecting Features for Markov Modeling: A Case Study on HP35

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Markov state models are used to interpret molecular dynamics trajectories. Structurally distinct conformations are needed to understand the biomolecular process. Dihedral angles and interresidue distances are used as input coordinates. Contacts are used to define and select contact distances. Low-pass filtering and correlation-based characterization of states are used. States of the Markov model are discriminated by the features....

March 7, 2023 · 1219 words · Daniel Nagel, Sofia Sartore, Gerhard Stock
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles

Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Focuses on a novel optimization problem where the objective function can only be evaluated through a ranking oracle. RLHF is an example of this problem, used to improve the quality of LLMs with human guidance. Proposes ZO-RankSGD, a zeroth-order optimization algorithm with a theoretical guarantee. Can be applied to policy search problems in reinforcement learning with only a ranking oracle of the episode reward....

March 7, 2023 · 1239 words · Zhiwei Tang, Dmitry Rybin, Tsung-Hui Chang
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Link to paper The full paper is available here. You can also find the paper on PapersWithCode here. Abstract Many works have focused on reducing the number of FLOPs to design fast neural networks. Reducing FLOPs does not necessarily lead to a similar level of reduction in latency. Low FLOPS is mainly due to frequent memory access of operators, especially the depthwise convolution. Proposed a novel partial convolution (PConv) to extract spatial features more efficiently....

March 7, 2023 · 1090 words · Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo, Song Wen and 2 others