beans-ai (alpha)

Chess engines like Stockfish are the best in the world for playing chess, but unlike the best humans, engines are black boxes that can't explain their thinking process.

But what if you could have an AI with state-of-the-art chess capabilities, a transparent thinking process, and the ability to have conversations?

Introducing beans-0-1.5B—an LLM trained to reason about chess.

beans-0-1.5B

The ultimate goal of beans-ai is to master chess, but before doing so, it must learn to make legal moves consistently.

When prompting chess puzzles, a 1.5B parameter distilled DeepSeek R1 outputs legal moves only about 4% of the time.

I wanted to see if I could improve that.

So, I prompted full R1 to solve several thousand chess puzzles and used the generations that produced a legal next move to create a dataset of 2,159 reasoning examples.

I then used LLaMA-Factory to fine-tune DeepSeek-R1-Distill-Qwen-1.5B on the dataset, creating beans-0-1.5B.

Evals

To evaluate how well beans-0-1.5B outputs legal moves, I randomly sampled 100 chess puzzles from the EleutherAI/lichess-puzzles dataset and prompted beans to solve each puzzle.

Answers were evaluated as follows:

1: The next move exists on the board.
0: The next move doesn't exist on the board.
-1: Invalid format (no <answer></answer>).

Results

Model	Parsing Failures (-1)	Invalid Moves (0)	Legal Moves (1)	Accuracy	Expected value
beans-0-1.5B	62	16	22	0.22	-0.40
DeepSeek-R1-Distill-Qwen-1.5B	75	21	4	0.04	-0.71

beans-0-1.5B demonstrated significant improvement over its baseline DeepSeek-R1-Distill-Qwen-1.5B at generating legal moves for chess puzzles!

Future plan

Build a reward function to evaluate chess move quality.
Implement GRPO for RL training.
Scale training data with >> 2,159 samples.
Fine-tune using only good moves.
Explore self-improvement via synthetic data generation.

Acknowledgements

Thank you,

Startup Shell for access to 2 NVIDIA 3090s.
Bobby George for setting up SSH for the GPUs.
DeepSeek for open-source AI.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
prompt_example.txt		prompt_example.txt
prompt_template.txt		prompt_template.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

beans-ai (alpha)

beans-0-1.5B

Evals

Results

Future plan

Acknowledgements

About

Releases

Packages

License

sshkeda/beans-ai

Folders and files

Latest commit

History

Repository files navigation

beans-ai (alpha)

beans-0-1.5B

Evals

Results

Future plan

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages