_______ __ _______ | | |.---.-..----.| |--..-----..----. | | |.-----..--.--.--..-----. | || _ || __|| < | -__|| _| | || -__|| | | ||__ --| |___|___||___._||____||__|__||_____||__| |__|____||_____||________||_____| on Gopher (inofficial) URI Visit Hacker News on the Web COMMENT PAGE FOR: URI Representation Engineering (2024) mock-possum wrote 16 min ago: That last experiment, where the LLM with its honesty vector increased is tasked with judging whether a user asking an example question has honest intentions, is interesting. It looks like it doesnât quite grasp the ask, and is instead just equivocating about the definition of âhonest.â I wonder what a response with the âthoroughnessâ vector turned up might have answered in that a case - would it have pointed out that itâs impossible to know intention from words, because people can lie, but itâs possible to at least guess - and even then, judging the honesty of intention could be interpreted several different ways? k__ wrote 3 hours 31 min ago: Somehow hidden state reminds me of DNA. DIR <- back to front page