Update README.md
This commit is contained in:
parent
985973ca6b
commit
9e026c7f14
|
|
@ -2,6 +2,8 @@
|
||||||
|
|
||||||
This project demonstrates an implementation that orchestrates a Chain-of-Thought process, enabling LLMs to solve complex reasoning problems by thinking step-by-step. It's designed to improve problem-solving accuracy through deliberate, structured reasoning managed externally.
|
This project demonstrates an implementation that orchestrates a Chain-of-Thought process, enabling LLMs to solve complex reasoning problems by thinking step-by-step. It's designed to improve problem-solving accuracy through deliberate, structured reasoning managed externally.
|
||||||
|
|
||||||
|
This implementation is based on: [Build Chain-of-Thought From Scratch - Tutorial for Dummies](https://zacharyhuang.substack.com/p/build-chain-of-thought-from-scratch).
|
||||||
|
|
||||||
## Features
|
## Features
|
||||||
|
|
||||||
- Improves model reasoning on complex problems.
|
- Improves model reasoning on complex problems.
|
||||||
|
|
@ -80,8 +82,8 @@ This problem demonstrates why structured Chain-of-Thought is valuable:
|
||||||
For comparison:
|
For comparison:
|
||||||
- [Claude 3.7 Sonnet (single prompt)](https://claude.ai/share/da139326-42fe-42d9-9d7b-35870daa5c1b): Wrong answer
|
- [Claude 3.7 Sonnet (single prompt)](https://claude.ai/share/da139326-42fe-42d9-9d7b-35870daa5c1b): Wrong answer
|
||||||
- [Claude 3.7 Sonnet (using built-in thinking)](https://claude.ai/share/6f4140ed-f33c-4949-8778-a57719498e40): Correct answer after 3m, 45s
|
- [Claude 3.7 Sonnet (using built-in thinking)](https://claude.ai/share/6f4140ed-f33c-4949-8778-a57719498e40): Correct answer after 3m, 45s
|
||||||
- [GPT-4o (using built-in thinking)](https://chatgpt.com/share/67fee0fd-2600-8000-bcdf-76e40a986ee4): Correct answer after 2m, 0s
|
- [GPT-o1 (using built-in thinking)](https://chatgpt.com/share/67fee0fd-2600-8000-bcdf-76e40a986ee4): Correct answer after 2m, 0s
|
||||||
- [GPT-4 Turbo (using built-in thinking)](https://chatgpt.com/share/67fee11b-530c-8000-92d1-609b6ca49c9c): Correct answer after 4m, 24s
|
- [GPT-o1 pro (using built-in thinking)](https://chatgpt.com/share/67fee11b-530c-8000-92d1-609b6ca49c9c): Correct answer after 4m, 24s
|
||||||
|
|
||||||
Below is an example output trace showing how **this implementation guides Claude 3.7 Sonnet** through the problem-solving process:
|
Below is an example output trace showing how **this implementation guides Claude 3.7 Sonnet** through the problem-solving process:
|
||||||
|
|
||||||
|
|
|
||||||
Loading…
Reference in New Issue