Hacker Newsnew | past | comments | ask | show | jobs | submit | txhwind's commentslogin

I prefer synthetic dataset since the first day hearing distillation. The engineering friction is much lower than soft logits, and I have not observed or heard performance loss (in Speech and language area).

Could you share some latest articles or papers comparing both methods, especially on lanuage modelling case? I was not conviced by this claim when reading the original Knowledge Distillation paper. ChatGPT said there are some later works showing: 1. the gain may come from label smoothing; 2. soft logits are more meaningful for students much smaller than teacher.

How is the water animation implemented?

search source code: initWaterEffect

I'm curious on the use of rsync in version control. What's the source and destination?

Yeah, this is insane. Show a complete lack of understanding on how the tool works.

Using rsync on git is like hammering a nail with a hammer, but then use a 10 pound stone to hammer the hammer.


from src/ to src_final_(4)/

now we may have a more powerful "Prolog" - LLM Agent, though not precise and correct somtimes.

It reminds me of an old joke:

Radio Yerevan: A listener asks: "Is it true that in Moscow, on Red Square, they are giving away cars?"

Our answer: "Yes, it is true. Except it isn't in Moscow, but in Leningrad. And it isn't on Red Square, but on Palace Square. And they aren't cars, but bicycles. And they aren't giving them away, they are stealing them."


I really hate modern time schedule. It's nightmare to be forced to get up 6am or 7am every workday since childhood. The only relief is natural wakeup on weekend.


another "obscurity": I'm not valuable enough to be attacked, compared with the cost. But what if cost has been reduced a lot?


Fucking abbreviations. Who knows it's DeepSeek, Dark Souls or DualShock? All possible on HN.


Could be Death Stranding too


The proof will be more friendly to nowadays programmers if we treat all "Gödel numbers" as bytecode of a programming language. It's trivial that functions like "prove" and "subst" can be implemented based on abilities like bytecode parsing and expression tree manipulation.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: