• kablammy@sh.itjust.works
    link
    fedilink
    arrow-up
    1
    ·
    9 months ago

    It would have been more obviously gradient descent if they didnt start with 0, so the first gradient wasn’t the same as the second answer. I thought they were just repeating the last correct answer.