Search code, repositories, users, issues, pull requests...

Paper: Controlling Neural Networks with Rule Representations (NeurIPs, 2021)

1: Controlling Neural Networks with Rule Representations(Presented on 10/05/2023)

Presenter: James Kelly

Code

Paper: InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

2: Understanding Instaflow/Rectified Flow(Presented on 10/11/2023)

Presenter: Isamu Isozaki

Papers: Text Embeddings Reveal (Almost) As Much As Text+NEFTune: Noisy Embeddings Improve Instruction Finetuning

3: Mysteries of Text Embeddings(Presented on 10/19/2023)

Presenter: Isamu Isozaki

Paper: Training Image Derivatives: Increased Accuracy and Universal Robustness

4: Training Image Derivatives: Increased Accuracy and Universal Robustness(Presented on 11/08/2023)

Presenter: Vsevolod I. Avrutskiy. Author of the paper

5: Understanding Zephyr(Presented on 11/16/2023)

Presenter: Isamu Isozaki

Paper: Zephyr: Direct Distillation of LM Alignment

Papers: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks + Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering + RA-DIT: Retrieval-Augmented Dual Instruction Tuning

6: Literature Review on RAG(Retrieval Augmented Generation) for Custom Domains(Presented on 11/29/2023)

Presenter: Isamu Isozaki

Paper: Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

7: Understanding MagVIT2: Language Model Beats Diffusion: Tokenizer is key to visual generation(Presented on 12/13/2023)

Presenter: Isamu Isozaki

8: Understanding Common Diffusion Noise Schedules and Sample Steps are Flawed(Presented on 12/21/2023)

Presenter: Isamu Isozaki

Paper: Common Diffusion Noise Schedules and Sample Steps are Flawed

Paper: The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey

9: The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey(Presented on 1/5/2024)

Presenter: Dhruv Dhamani. Author of the paper

Paper: Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

10: Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation(Presented on 1/12/2024)

Presenter: Phil Butler

Papers: On the acceptability of arguments and its fundamental role in non-monotonic reasoning, logic programming, and n-person games+An Answer Set Programming Approach to Argumentative Reasoning in the ASPIC+ Framework+HYPO’s legacy: introduction to the virtual special issue+Induction of Defeasible Logic Theories in the Legal Domain+Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset+Large Language Models in Law: A Survey+The Smart Court - A New Pathway to Justice in China?

Unfortunately, no recordings but a coauthors came.

11: Literature Review on AI in Law(Presented on 2/2/2024)

Presenter: Isamu Isozaki

12: A forthcoming decoder-only foundation model for time-series forecasting & further research(Presented on 2/9/2024)

Presenter: Tonic

Paper: A decoder-only foundation model for time-series forecasting

Paper: Mamba: Linear-Time Sequence Modeling with Selective State Spaces

13: Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Presenter: Eric Auld

Paper: Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures

14: Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures

Presenter: Vincent Abbott. Author of the paper

Papers: TIES-Merging: Resolving Interference When Merging Models+Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch+ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization+Learning to Route Among Specialized Experts for Zero-Shot Generalization

15: SOTA on Model Merging

Presenter: Prateek Yadav. Author of TIES-Merging and ComPEFT

Papers: Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context + Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference + Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity + Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts

16: Gemini 1.5 Pro: Unlock reasoning and knowledge from entire books and movies in a single prompt

Presenter: Shashank Shekhar

Paper: HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction

17: HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction

Presenter: Harvie Zhang. Author of the paper

Papers: ProteinBERT: A universal deep-learning model of protein sequence and function+Detecting anomalous proteins using deep representations+Protein Language Models Expose Viral Mimicry and Immune Escape

18: ProteinBERT: A universal deep-learning model of protein sequence and function

Presenter: Dan Ofer. Author of the papers

Paper: Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

19: Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

I was absent this meeting so if anyone knows, please let me know/do a pr to fill this part!

20: Graph Machine Learning in the Era of Large Language Models (LLMs)

Presenter: Isamu Isozaki

Papers: Graph Machine Learning in the Era of Large Language Models (LLMs)+Large Language Models on Graphs: A Comprehensive Survey+House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation

Papers: GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence+Creating Suspenseful Stories: Iterative Planning with Large Language Models+Improving Pacing in Long-Form Story Planning+Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives+Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers+DOC: Improving Long Story Coherence With Detailed Outline Control+End-to-end Story Plot Generator+Weaver: Foundation Models for Creative Writing

21: Story Generation with AI

Presenter: Isamu Isozaki

Papers: Accurate structure prediction of biomolecular interactions with AlphaFold 3+Highly accurate protein structure prediction with AlphaFold

22: AlphaFold 3

Presnter: starrynightdev

Write ups: Huggingface blog+Github blog

23: AI for Physics. Hamilton Neural Networks/Lagrangian Neural Networks

Presenter: PS_Venom

Papers: Hamiltonian Neural Networks+Lagrangian Neural Networks