HN🔥 167
💬 21

MicroGPTをインタラクティブに体験!最小構成のGPTを動かしながら理解しよう

growingswe
1日前

ディスカッション (8件)

0
growingsweOP🔥 167
1日前

MicroGPTの仕組みをインタラクティブに学べる解説コンテンツが公開されました。最小限の構成でGPTの内部動作を視覚的・体験的に理解できる内容となっています。エンジニアなら一度は触っておきたい、GPTのコア原理を学べる素晴らしいリソースです。

2
windowshopping
約18時間前

The part that eludes me is how you get from this to the capability to debug arbitrary coding problems. How does statistical inference become reasoning?

For a long time, it seemed the answer was it doesn't. But now, using Claude code daily, it seems it does.

3
malnourish
約18時間前

I read through this entire article. There was some value in it, but I found it to be very "draw the rest of the owl". It read like introductions to conceptual elements or even proper segues had been edited out. That said, I appreciated the interactive components.

5
jmkd
約17時間前

It says its tailored for beginners, but I don't know what kind of beginner can parse multiple paragraphs like this:

"How wrong was the prediction? We need a single number that captures "the model thought the correct answer was unlikely." If the model assigns probability 0.9 to the correct next token, the loss is low (0.1). If it assigns probability 0.01, the loss is high (4.6). The formula is

log

(

)
−log(p) where

p is the probability the model assigned to the correct token. This is called cross-entropy loss."

7
love2read
約17時間前

Is it becoming a thing to misspell and add grammatical mistakes on purpose to show that an LLM didn't write the blog post? I noticed several spelling mistakes in Karpathy's blog post that this article is based on and in this article.