Posts by Tags

AutoWS-Bench-101: Benchmarking Automated Weak Supervision on Diverse Tasks

9 minute read

Published: November 28, 2022

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

6 minute read

Published: February 28, 2025

Large pretrained models like GPT-4, Gemini, and Claude 3 are fantastic at labeling data—-whether it’s spam detection in YouTube comments or classifying topics in medical documents. But there’s a drawback: querying these models for every single data point via API calls gets expensive fast.

Weak-to-Strong Generalization Through a Data-Centric Lens

7 minute read

Published: February 25, 2025

Authors: Changho Shin

Exploring how overlap density drives weak-to-strong generalization and its applications in data source selection.

Tabby: Tabular Data Synthesis With Large Language Models

6 minute read

Published: March 04, 2025

Authors: Sonia Cromp

While impressive examples of AI-generated art and dialogue have captured the public’s attention in recent years, one of the most fundamental data formats–tabular data–still lacks specialized, high-performing models. Tables are ubiquitous in modern life, but are not modeled well by off-the-shelf models intended for other datatypes. Given the central role of tabular data in everything from global economic forecasts and astronomical observations to classroom gradebooks and household budgets, the lack of deep learning methods tailored for tables is quite surprising. To address the table synthesis gap, we introduce Tabby: a foundation model designed specifically for tabular data. Tabby introduces the inductive biases necessary to represent tabular data into a pre-trained large language model, avoiding the costly process of training a foundation model from scratch. Read on to discover how Tabby generates synthetic data that is nearly indistinguishable from real-world datasets!

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

6 minute read

Published: February 28, 2025

Authors: Tzu-Heng Huang

Large pretrained models like GPT-4, Gemini, and Claude 3 are fantastic at labeling data—-whether it’s spam detection in YouTube comments or classifying topics in medical documents. But there’s a drawback: querying these models for every single data point via API calls gets expensive fast.

Weak-to-Strong Generalization Through a Data-Centric Lens

7 minute read

Published: February 25, 2025

Authors: Changho Shin

Exploring how overlap density drives weak-to-strong generalization and its applications in data source selection.

AutoWS-Bench-101: Benchmarking Automated Weak Supervision on Diverse Tasks

9 minute read

Published: November 28, 2022

Nicholas Roberts

Re-Structuring CLIP’s Language Capabilities

10 minute read

Published: May 07, 2025

Authors: Zhiqi Gao

Vision-language models (VLM) like CLIP have transformed how we approach image classification. The performance of these models is heavily influenced by subtle choices such as prompt templates and choice of class names used. Research has shown that prompt templates can sway classification accuracy by as much as 8%, while our experiment shows that the classification accuracy could drop to more than 25% if we replace the class name with their synonyms and 50% if we replace the class name with their hypernyms, both of which highlights CLIP’s sensitivity to linguistic variation.

Tabby: Tabular Data Synthesis With Large Language Models

6 minute read

Published: March 04, 2025

Authors: Sonia Cromp

While impressive examples of AI-generated art and dialogue have captured the public’s attention in recent years, one of the most fundamental data formats–tabular data–still lacks specialized, high-performing models. Tables are ubiquitous in modern life, but are not modeled well by off-the-shelf models intended for other datatypes. Given the central role of tabular data in everything from global economic forecasts and astronomical observations to classroom gradebooks and household budgets, the lack of deep learning methods tailored for tables is quite surprising. To address the table synthesis gap, we introduce Tabby: a foundation model designed specifically for tabular data. Tabby introduces the inductive biases necessary to represent tabular data into a pre-trained large language model, avoiding the costly process of training a foundation model from scratch. Read on to discover how Tabby generates synthetic data that is nearly indistinguishable from real-world datasets!

Alignment, Simplified: Steering LLMs with Self-Generated Preferences

10 minute read

Published: February 27, 2025

Authors: Dyah Adila

Efficient LLM alignment without the data and compute expense of traditional methods.

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

6 minute read

Published: February 28, 2025

Authors: Tzu-Heng Huang

Large pretrained models like GPT-4, Gemini, and Claude 3 are fantastic at labeling data—-whether it’s spam detection in YouTube comments or classifying topics in medical documents. But there’s a drawback: querying these models for every single data point via API calls gets expensive fast.

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

5 minute read

Published: February 24, 2025

Authors: Changho Shin

OTTER offers a tuning-free, inference-time label distribution adaptation of zero-shot models by leveraging optimal transport.

RoboShot: Zero-Shot Robustification of Zero-Shot Models

8 minute read

Published: July 19, 2023

Authors: Dyah Adila

Effortlessly robustify CLIP-based models to handle spurious currelations– no xtra data, no xtra training!

Aggregating Foundation Model Objects

21 minute read

Published: June 26, 2023

Authors: Harit Vishwakarma

RoboShot: Zero-Shot Robustification of Zero-Shot Models

8 minute read

Published: July 19, 2023

Authors: Dyah Adila

Effortlessly robustify CLIP-based models to handle spurious currelations– no xtra data, no xtra training!

Aggregating Foundation Model Objects

21 minute read

Published: June 26, 2023

Authors: Harit Vishwakarma

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

5 minute read

Published: February 24, 2025

Authors: Changho Shin

OTTER offers a tuning-free, inference-time label distribution adaptation of zero-shot models by leveraging optimal transport.

Weak-to-Strong Generalization Through a Data-Centric Lens

7 minute read

Published: February 25, 2025

Authors: Changho Shin

Exploring how overlap density drives weak-to-strong generalization and its applications in data source selection.

AutoWS-Bench-101: Benchmarking Automated Weak Supervision on Diverse Tasks

9 minute read

Published: November 28, 2022

Nicholas Roberts

RoboShot: Zero-Shot Robustification of Zero-Shot Models

8 minute read

Published: July 19, 2023

Authors: Dyah Adila

Effortlessly robustify CLIP-based models to handle spurious currelations– no xtra data, no xtra training!

Alignment, Simplified: Steering LLMs with Self-Generated Preferences

10 minute read

Published: February 27, 2025

Authors: Dyah Adila

Efficient LLM alignment without the data and compute expense of traditional methods.

Tabby: Tabular Data Synthesis With Large Language Models

6 minute read

Published: March 04, 2025

Authors: Sonia Cromp

While impressive examples of AI-generated art and dialogue have captured the public’s attention in recent years, one of the most fundamental data formats–tabular data–still lacks specialized, high-performing models. Tables are ubiquitous in modern life, but are not modeled well by off-the-shelf models intended for other datatypes. Given the central role of tabular data in everything from global economic forecasts and astronomical observations to classroom gradebooks and household budgets, the lack of deep learning methods tailored for tables is quite surprising. To address the table synthesis gap, we introduce Tabby: a foundation model designed specifically for tabular data. Tabby introduces the inductive biases necessary to represent tabular data into a pre-trained large language model, avoiding the costly process of training a foundation model from scratch. Read on to discover how Tabby generates synthetic data that is nearly indistinguishable from real-world datasets!

Aggregating Foundation Model Objects

21 minute read

Published: June 26, 2023

Authors: Harit Vishwakarma

Tabby: Tabular Data Synthesis With Large Language Models

6 minute read

Published: March 04, 2025

Authors: Sonia Cromp

While impressive examples of AI-generated art and dialogue have captured the public’s attention in recent years, one of the most fundamental data formats–tabular data–still lacks specialized, high-performing models. Tables are ubiquitous in modern life, but are not modeled well by off-the-shelf models intended for other datatypes. Given the central role of tabular data in everything from global economic forecasts and astronomical observations to classroom gradebooks and household budgets, the lack of deep learning methods tailored for tables is quite surprising. To address the table synthesis gap, we introduce Tabby: a foundation model designed specifically for tabular data. Tabby introduces the inductive biases necessary to represent tabular data into a pre-trained large language model, avoiding the costly process of training a foundation model from scratch. Read on to discover how Tabby generates synthetic data that is nearly indistinguishable from real-world datasets!

Aggregating Foundation Model Objects

21 minute read

Published: June 26, 2023

Authors: Harit Vishwakarma

Re-Structuring CLIP’s Language Capabilities

10 minute read

Published: May 07, 2025

Authors: Zhiqi Gao

Vision-language models (VLM) like CLIP have transformed how we approach image classification. The performance of these models is heavily influenced by subtle choices such as prompt templates and choice of class names used. Research has shown that prompt templates can sway classification accuracy by as much as 8%, while our experiment shows that the classification accuracy could drop to more than 25% if we replace the class name with their synonyms and 50% if we replace the class name with their hypernyms, both of which highlights CLIP’s sensitivity to linguistic variation.

Aggregating Foundation Model Objects

21 minute read

Published: June 26, 2023

Authors: Harit Vishwakarma

AutoWS-Bench-101: Benchmarking Automated Weak Supervision on Diverse Tasks

9 minute read

Published: November 28, 2022

Nicholas Roberts

Weak-to-Strong Generalization Through a Data-Centric Lens

7 minute read

Published: February 25, 2025

Authors: Changho Shin

Exploring how overlap density drives weak-to-strong generalization and its applications in data source selection.

OTTER: Effortless Label Distribution Adaptation of Zero-shot Models

5 minute read

Published: February 24, 2025

Authors: Changho Shin

OTTER offers a tuning-free, inference-time label distribution adaptation of zero-shot models by leveraging optimal transport.

RoboShot: Zero-Shot Robustification of Zero-Shot Models

8 minute read

Published: July 19, 2023

Authors: Dyah Adila

Effortlessly robustify CLIP-based models to handle spurious currelations– no xtra data, no xtra training!

Sprocket Lab

Posts by Tags

AutoML

Automated Data Labeling

Data Source Selection

Data-Centric AI

Diverse Tasks

Foundation Models

Inference-time steering

LLM-as-data-annotators

Label Distribution Adaptation

Language Models

Multi-modal Models

Non-Euclidean ML

Optimal Transport

Overlap Density

Research

Robust ML

Self-Alignment

Structured Data

Structured Prediction

Tabular Data

Tensor Decomposition

Vision-Language Models

Weak Supervision

Weak-to-Strong Generalization

Zero-Shot Models

Zero-shot inference