[P] How I found & fixed 4 bugs in Microsoft's Phi-4 model

yoracale · 2025-01-15T18:43:33+00:00

Btw this kind of got buried but Unsloth also fixed a gradient accumulation issue in transformers a while ago: https://www.reddit.com/r/MachineLearning/comments/1g8ymrn/r_gradient_accumulation_bug_fix_in_nightly/

Hugging Face managed to upstream some of the changes.

SirBlobfish · 2025-01-15T20:05:17+00:00

Incredible work! Do you have a more detailed walkthrough of the debugging process? I see a detailed breakdown of the bugs/fixes, but not how you figured those out. Maybe I'm just missing a link or something?

asraniel · 2025-01-15T18:55:03+00:00

anybody knows if and when those fixes come to ollama or if that is even needed?

__bee_07 · 2025-01-16T03:51:00+00:00

Unslothai is a nice project, thanks for your contributions

Thrumpwart · 2025-01-15T22:23:19+00:00

Amazing! Can't wait for the unsloth 128k release too! Loving the Qwen 2.5 Coder 32B with 128k context model you put out!

projekt_treadstone · 2025-01-16T08:31:03+00:00

Great work. Long time follower of you on twitter and learnt a lot about fine tuning the LLM with least headache.

Inevitable_Mistake32 · 2025-01-15T20:59:47+00:00

Oh I'm just hopping in 100% for a big thank you for the incredible work you're doing. Both with Gemma/Phi and Unsloth.

No notes.

sherlock_holmes14 · 2025-01-16T01:56:52+00:00

Insane. 👌🏽

InevitablePrompt7613 · 2025-01-16T06:14:10+00:00

this is incredibly useful, thank you so much

jprobichaud · 2025-01-16T12:55:17+00:00

What is people experience with non-English and Phi-4? I have a project that help specialized teacher "translate" regulsr French to an alternative version that helps people with intellectual disabilities to learn reading.

English-centric LLMs are often struggling with that task. How good is phi4 with French tasks?

_Bia · 2025-01-16T13:30:24+00:00

Thank you for your excellent contributions.

Arophous · 2025-01-15T19:03:54+00:00

Doing free work for corp companies who make bank… smart

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS

Do our Fixes Work?