ML Arxiv Haul #8

5 min readAug 28, 2022

These have become useful to me as a mental bookmarking exercise and as a summary for interesting papers.

A Dyson Sphere around a black hole

The search for extraterrestrial intelligence (SETI) has been conducted for nearly 60 years. A Dyson Sphere, a spherical…

arxiv.org

This paper is a year old, but I think resurfaced in a weird Twitter or podcast deep dive. Previous work had considered whether Dyson spheres could be built around a supermassive black hole and use cosmic microwave background radiation (CMB). The authors consider other sources of energy which would make a black hole appealing, and how these might be detectable from our telescopes.

Adversarial Attacks on Image Generation With Made-Up Words

Text-guided image generation models can be prompted to generate images using nonce words adversarially designed to…

arxiv.org

After June’s Discovering the Hidden Vocabulary of DALLE-2, it’s interesting to see someone else develop their own take. Millière describes two types of nonsense prompts: “macaronic” which mixes tokens from multiple languages (uccoisegeljaros to mean birds) and “evocative” which mimics scientific names (ceralineus rabaventis to mean insects).

Challenges and Pitfalls of Bayesian Unlearning

Machine unlearning refers to the task of removing a subset of training data, thereby removing its contributions to a…

arxiv.org

In June I was in a Probabilistic AI class, so I have been trying to pick up some more knowledge about Bayes + ML. The paper discusses how Bayesian systems have been proposed to mathematically remove a record from training (pitched as following GDPR regulations). There can be major negative effects of these removal processes; the authors point to hyperparameters as making or breaking a successful removal.

Efficient Training of Language Models to Fill in the Middle

We show that autoregressive language models can learn to infill text after we apply a straightforward transformation to…

arxiv.org

Work from OpenAI about setting up generative models to not just append text to the end, but form text in the middle of a context. This would be super-cool because existing models and methods could be tuned to do this new task.
A Microsoft project (CodeT) specifically studies this in code generation. I’m particularly interested in it because you could write a comment to be infilled, then the code which you’d like to generate, and see what type of comment would have prompted the model.

ferret: a Framework for Benchmarking Explainers on Transformers

Many interpretability tools allow practitioners and researchers to explain Natural Language Processing systems…

arxiv.org

ferret is a new Python library to take in HuggingFace models and run tons of different explanation methods.

Improved Text Classification via Test-Time Augmentation

Test-time augmentation — the aggregation of predictions across transformed examples of test inputs — is an…

arxiv.org

Hadn’t thought about this idea before. This experiment is working on the WILDS domain-shift dataset, and then during test it develops a system for some portion of the data to get noised through nlpaug.

Improving Wikipedia Verifiability with AI

Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by…

openreview.net

Facebook/Meta’s plan to match all English Wikipedia articles to their cited sources. This is getting tech press for an ‘accurate’ Wikipedia, as though it will be a super-intelligent no-nonsense AI checking our work, but if a bad article has a bad source, the retrieval model should agree. I also wonder about all of the articles which have non-web sources? This is covered in the article’s discussion section.

we only considered references corresponding to web pages, but Wikipedia also cites books, scientific articles and other kind of documents. These include other modalities than just text, such as images and videos. To fully assess the quality of Wikipedia references, Side needs to become multi-modal

Language Models Can Teach Themselves to Program Better

This work shows how one can use large-scale language models (LMs) to synthesize programming problems with verified…

arxiv.org

After my AI Village talk, I’m continuing to read up on code-generation models. This paper generates new programming problems, and this could expand on the existing handful of open source code-generation problems and prompts.

Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human…

Frozen models trained to mimic static datasets can never improve their performance. Models that can employ…

arxiv.org

Facebook/Meta dialogue model related to their ‘BlenderBot’ project. Incorporates retrieval from the web and human feedback.

Model Zoo: A Growing “Brain” That Learns Continually

This paper argues that continual learning methods can benefit by splitting the capacity of the learner across multiple…

arxiv.org

Builds an ensemble of smaller models and weights their accuracy on different tasks.

Perspectives on Incorporating Expert Feedback into Model Updates

Machine learning (ML) practitioners are increasingly tasked with developing models that are aligned with non-technical…

arxiv.org

A taxonomy for improving ML datasets and models. I think maybe this could help people outside of ML understand where data augmentation etc. fall in this world.

Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators

Deep neural network (DNN) accelerators received considerable attention in recent years due to the potential to save…

arxiv.org

This is a peculiar sub-field which I had not heard about before, where they’ve developed a model which can handle imprecise or low-voltage hardware.

Scalable Interpretability via Polynomials

Generalized Additive Models (GAMs) have quickly become the leading choice for fully-interpretable machine learning…

arxiv.org

Facebook/Meta paper about a new approach to polynomial models, which are somewhat interpretable, and perform better than other interpretable methods even if they don’t match neural networks.

SKILL: Structured Knowledge Infusion for Large Language Models

Large language models (LLMs) have demonstrated human-level performance on a vast spectrum of natural language tasks…

arxiv.org

Improving language models by training it on a WikiData knowledge graph. The very largest model benefit much more from this process.

Survey of NLP in Pharmacology: Methodology, Tasks, Resources, Knowledge, and Tools

Natural language processing (NLP) is an area of artificial intelligence that applies information technologies to…

arxiv.org

Introduces NLP concepts and popular biology/medical BERT models to people in the pharma world.

Why do tree-based models still outperform deep learning on tabular data?

While deep learning has enabled tremendous progress on text and image datasets, its superiority on tabular data is not…

arxiv.org

This has been linked from multiple places in the past month. Tabular data (and time-series data, not part of this paper) are still difficult for neural networks. The pro-NN view is that the datasets and models just need to be bigger. This paper covers three things which are interesting about tabular data, and how they perform on different NN architectures.

ML Arxiv Haul #8

A Dyson Sphere around a black hole

The search for extraterrestrial intelligence (SETI) has been conducted for nearly 60 years. A Dyson Sphere, a spherical…

Adversarial Attacks on Image Generation With Made-Up Words

Text-guided image generation models can be prompted to generate images using nonce words adversarially designed to…

Challenges and Pitfalls of Bayesian Unlearning

Machine unlearning refers to the task of removing a subset of training data, thereby removing its contributions to a…

Efficient Training of Language Models to Fill in the Middle

We show that autoregressive language models can learn to infill text after we apply a straightforward transformation to…

ferret: a Framework for Benchmarking Explainers on Transformers

Many interpretability tools allow practitioners and researchers to explain Natural Language Processing systems…

Improved Text Classification via Test-Time Augmentation

Test-time augmentation — the aggregation of predictions across transformed examples of test inputs — is an…

Improving Wikipedia Verifiability with AI

Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by…

Language Models Can Teach Themselves to Program Better

This work shows how one can use large-scale language models (LMs) to synthesize programming problems with verified…

Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human…

Frozen models trained to mimic static datasets can never improve their performance. Models that can employ…

Model Zoo: A Growing “Brain” That Learns Continually

This paper argues that continual learning methods can benefit by splitting the capacity of the learner across multiple…

Perspectives on Incorporating Expert Feedback into Model Updates

Machine learning (ML) practitioners are increasingly tasked with developing models that are aligned with non-technical…

Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators

Deep neural network (DNN) accelerators received considerable attention in recent years due to the potential to save…

Scalable Interpretability via Polynomials

Generalized Additive Models (GAMs) have quickly become the leading choice for fully-interpretable machine learning…

SKILL: Structured Knowledge Infusion for Large Language Models

Large language models (LLMs) have demonstrated human-level performance on a vast spectrum of natural language tasks…

Survey of NLP in Pharmacology: Methodology, Tasks, Resources, Knowledge, and Tools

Natural language processing (NLP) is an area of artificial intelligence that applies information technologies to…

Why do tree-based models still outperform deep learning on tabular data?

While deep learning has enabled tremendous progress on text and image datasets, its superiority on tabular data is not…

Written by Nick Doiron

No responses yet