BAIR

article thumbnail

Modeling Extremely Large Images with xT

BAIR

These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative image (not GIF) that we use for each post for tweeting (see below as well) and for the emails to subscribers. Please provide this image (and any other images and GIFs) in the blog to the BAIR Blog editors directly.

article thumbnail

On the Stepwise Nature of Self-Supervised Learning

BAIR

Figure 1: stepwise behavior in self-supervised learning. When training common SSL algorithms, we find that the loss descends in a stepwise fashion (top left) and the learned embeddings iteratively increase in dimensionality (bottom left). Direct visualization of embeddings (right; top three PCA directions shown) confirms that embeddings are initially collapsed to a point, which then expands to a 1D manifold, a 2D manifold, and beyond concurrently with steps in the loss.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Rethinking the Role of PPO in RLHF

BAIR

Rethinking the Role of PPO in RLHF TL;DR : In RLHF, there’s tension between the reward learning phase, which uses human preference in the form of comparisons, and the RL fine-tuning phase, which optimizes a single, non-comparative reward. What if we performed RL in a comparative way? Figure 1: This diagram illustrates the difference between reinforcement learning from absolute feedback and relative feedback.

Algorithm 100
article thumbnail

Goal Representations for Instruction Following

BAIR

Goal Representations for Instruction Following page width. --> A longstanding goal of the field of robot learning has been to create generalist agents that can perform tasks for humans. Natural language has the potential to be an easy-to-use interface for humans to specify arbitrary tasks, but it is difficult to train robots to follow language instructions.

Robotics 100
article thumbnail

Asymmetric Certified Robustness via Feature-Convex Neural Networks

BAIR

Asymmetric Certified Robustness via Feature-Convex Neural Networks TLDR : We propose the asymmetric certified robustness problem, which requires certified robustness for only one class and reflects real-world adversarial scenarios. This focused setting allows us to introduce feature-convex classifiers, which produce closed-form and deterministic certified radii on the order of milliseconds.

article thumbnail

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

BAIR

These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative image (not GIF) that we use for each post for tweeting (see below as well) and for the emails to subscribers. Please provide this image (and any other images and GIFs) in the blog to the BAIR Blog editors directly.

article thumbnail

The Shift from Models to Compound AI Systems

BAIR

These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative image (not GIF) that we use for each post for tweeting (see below as well) and for the emails to subscribers. Please provide this image (and any other images and GIFs) in the blog to the BAIR Blog editors directly.

LLM 100