If you want to make it more human-explainable, then ditch the entire tokenizer and just feed the mod...

amelius • today at 11:58 AM • 1 reply • view on HN

If you want to make it more human-explainable, then ditch the entire tokenizer and just feed the models raw characters. Because now there is nothing to explain.

Replies

sigmoid10 • today at 3:52 PM

Then that means you need at least 4x the compute to achieve the same results as state of the art. Meaning that if I can train my frontier model with my normal tokenizer in 3 months, it will take you a year. When major releases across all competing providers are measured in months, there's simply no incentive to do that just to capture these fringe edge cases.

➕ show 1 reply

alt Hacker News

Replies