izard

You're viewing

izard's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

For a long time, there was no decent local LLM that understood and spoke Russian well. I've tested many fine-tunes, and they all behaved significantly dumber when answering in Russian than their respective English originals.

Russian internet has a huge potential training corpus second only to English; but smart French, German, Spanish and Arabic fine-tunes for local LLMs existed for months now, and there was no Russian one.

Until Serge Gotsuliak released his finetune few days ago.

Highly recommend it, works great with llama.cpp. I was missing RAG on my Russian books, will set it up now.

There is a drawback: 4-bit quant that I tested needs GPU[s] with ~48GB VRAM, or Apple silicon machine with 64GB+ RAM to run at decent speed.

Flat | Top-Level Comments Only

From:

vak

Wow, sounds promising. I never tried llama.cpp - I hope it's not so complicated.

From:

izard

It is very minimalistic, well designed and has 0 dependencies.

From:

juan_gandhi

I kind of missed it. You mean, one can install it on one's local machine, if it is M2 with 64GB RAM?

From:

izard

That’s right. Also llama.cpp does not have any dependencies so compiles /installs quickly and easily.

From:

vak

It worked pretty well on Linux: https://vak.dreamwidth.org/1184101.html

On Mac it failed though. When converting the model into gguf:

  File "llama.cpp/convert.py", line 793, in load
    fp = self.zip_file.open(info)
         ^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/Cellar/python@3.12/3.12.2_1/Frameworks/Python.framework/Versions/3.12/lib/python3.12/zipfile/__init__.py", line 1643, in open
    raise BadZipFile(f"Overlapped entries: {zinfo.orig_filename!r} (possible zip bomb)")
zipfile.BadZipFile: Overlapped entries: 'pytorch_model-00004-of-00015/data/7' (possible zip bomb)

Edited Date: 2024-02-26 02:58 am (UTC)

From:

izard

I was only using on Mac, and convert.py worked for me. probably some different python minor versions/llama.cpp commit tags.

From:

vak

Right, it failed with python 3.12. So I tried 3.11, and it worked fine.

Flat | Top-Level Comments Only

Profile

izard

July 2025

S	M	T	W	T	F	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Page Summary

Style Credit

Style: Neutral Good for Practicality by timeasmymeasure

Expand Cut Tags

No cut tags

Page generated Jul. 16th, 2025 01:02 am

At last: Russian LLM fine-tune

At last: Russian LLM fine-tune

no subject

no subject

no subject

no subject

no subject

no subject

no subject

Profile

July 2025

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags