zlacker

[parent] [thread] 2 comments
1. armcat+(OP)[view] [source] 2023-11-19 09:04:01
The final codebase, yes. But ML is not like traditional software engineering. There is a 99% failure rate, so you are forgetting 100s of hours that go into: (1) surveying literature to find that one thing that will give you a boost in performance, (2) hundreds of notebooks in trying various experiments, (3) hundreds of tweaks and hacks with everything from data pre-processing, to fine-tuning and alignment, to tearing up flash attention, (4) beta and user testing, (5) making all this run efficiently on the underlying infra hardware - by means of distillation, quantization, and various other means, (6) actually pipelining all this into something that can be served at hyperscale
replies(2): >>pk-pro+z2 >>karmas+n4
2. pk-pro+z2[view] [source] 2023-11-19 09:26:39
>>armcat+(OP)
> you are forgetting 100s of hours

I would say thousands. Even for the hobby projects, - thousands of GPU hours and thousands of research hours a year.

3. karmas+n4[view] [source] 2023-11-19 09:41:52
>>armcat+(OP)
And some luck is needed really.
[go to top]