- LLMs Research
- Posts
- Research papers published on 29th Feb'24
Research papers published on 29th Feb'24
Daily papers from top journals & conferences
Note: I sent out the last stack yesterday morning (EST), so some publications from the 28th weren't included. Here are the ones missing from the 28th, followed by the complete list for February 29th.
28th February 2024
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness
Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Less is More: Fewer Interpretable Region via Submodular Subset Selection
On the Decision-Making Abilities in Role-Playing using Large Language Models
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
How Much Annotation is Needed to Compare Summarization Models?
Simple linear attention language models balance the recall-throughput tradeoff
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
BFRFormer: Transformer-based generator for Real-World Blind Face Restoration
Cooperative Open-ended Learning Framework for Zero-shot Coordination
29th February 2024
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
ManiFPT: Defining and Analyzing Fingerprints of Generative Models
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
PRSA: Prompt Reverse Stealing Attacks against Large Language Models
Teaching Large Language Models an Unseen Language on the Fly
Typographic Attacks in Large Multimodal Models Can be Alleviated by More Informative Prompts
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
Syntactic Ghost: An Imperceptible General-purpose Backdoor Attacks on Pre-trained Language Models
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
End-to-End Quantum Vision Transformer: Towards Practical Quantum Speedup in Large-Scale Models
Language-specific LLM research
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
PeLLE: Encoder-based language models for Brazilian Portuguese based on open data
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
Reply