Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards Paper • 2306.04488 • Published Jun 7, 2023 • 2
eP-ALM: Efficient Perceptual Augmentation of Language Models Paper • 2303.11403 • Published Mar 20, 2023 • 3
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning Paper • 2310.00647 • Published Oct 1, 2023