Leon Derczynski ⚒️☁️🏔️🌲 (@LeonDerczynski) / X

Leon Derczynski ⚒️☁️🏔️🌲

25.5K posts

Leon Derczynski ⚒️☁️🏔️🌲

@LeonDerczynski

NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acct

Seattle / Copenhagen

bsky.app/profile/leonde…

Joined January 2012

Pinned
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jun 13, 2023
Proud to announce: 💫 garak - an LLM vulnerability scanner💫 🔎 Check if a model is susceptible to common attacks 🦜 Supports HuggingFace, OpenAI, ggml, Cohere, ... 🔧 >70 probes: prompt injection, false claims, toxicity, encoding evasion, ..
GitHub - NVIDIA/garak: the LLM vulnerability scanner
From github.com
63K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
May 27, 2022
'I don't really trust papers out of "Top Labs" anymore' reddit.com/r/MachineLearn…
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jan 13, 2023
Replying to @Sicclord1
strong "making a call during takeoff will crash the plane" vibes
37K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jul 7, 2022
machine learning researchers learn to optimise their own best paper rate through collusion and other unregulated mechanisms reddit.com/r/MachineLearn…
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jul 1, 2020
Replying to @lexfridman
finally, a proof for division by zero: ∴ 0 / 9 = 0.00000000
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Sep 6, 2021
Thanks, reviewer 2 𝚁𝚎𝚊𝚜𝚘𝚗𝚜 𝚝𝚘 𝚛𝚎𝚓𝚎𝚌𝚝 -------------------------- 𝙸 𝚌𝚊𝚗𝚗𝚘𝚝 𝚏𝚒𝚗𝚍 𝚊𝚗𝚢 𝚛𝚎𝚊𝚜𝚘𝚗 𝚝𝚘 𝚛𝚎𝚓𝚎𝚌𝚝 𝚝𝚑𝚒𝚜 𝚙𝚊𝚙𝚎𝚛 𝚊𝚝 𝚊𝚕𝚕 (𝚝𝚑𝚎𝚛𝚎 𝚒𝚜 𝚊 𝟷0 𝚠𝚘𝚛𝚍 𝚖𝚒𝚗𝚒𝚖𝚞𝚖 𝚏𝚘𝚛 𝚝𝚑𝚒𝚜 𝚜𝚘 𝙸 𝚓𝚞𝚜𝚝 𝚔𝚎𝚎𝚙 𝚠𝚛𝚒𝚝𝚒𝚗𝚐)
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Feb 3, 2023
using a 3d printer to "write" homework generated by chatgpt. your move, examiners
00:00
91K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Apr 22, 2023
machine learning researchers debating theories of consciousness
GIF
51K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jan 24, 2024
Replying to @USAinUK
FYI microwaves are British
116K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Aug 11, 2024
Replying to @jaketropolis
Legal protections on tips going to employees make this unprofitable
23K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Mar 31, 2023
ChatGPT not best at many language tasks. It's outranked by other systems on many NLP benchmarks in current evaluation. For 77.5% of tasks examined, other systems are better than ChatGPT. opensamizdat.com/posts/chatgpt_…
90K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Mar 11, 2024
it's 2024 and we're still having to say this lol cos sucks if you have more than say three or four dimensions
Sumit
@_reachsumit
Mar 11, 2024
Is Cosine-Similarity of Embeddings Really About Similarity? Netflix cautions against blindly using cosine similarity as a measure of semantic similarity between learned embeddings, as it can yield arbitrary and meaningless results. 📝arxiv.org/abs/2403.05440
108K
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Jan 18, 2022
Replying to @qntm and @SwiftOnSecurity
i know a head of national sales at google who only knows how to use "recent" docs. that's their filesystem
Leon Derczynski ⚒️☁️🏔️🌲
@LeonDerczynski
Feb 18, 2023
ELIZA designer Joseph Weizenbaum observed: “What I had not realized is that extremely short exposures to a relatively simple computer program could induce powerful delusional thinking in quite normal people.”
Introducing the AI Mirror Test, which very smart people keep failing
From theverge.com
51K