Log inSign up
Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
25.5K posts
Image
user avatar
Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
@LeonDerczynski
NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acct
Seattle / Copenhagen
bsky.app/profile/leonde…
Joined January 2012
1,134
Following
6,483
Followers
  • Pinned
    user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jun 13, 2023
    Proud to announce: πŸ’« garak - an LLM vulnerability scannerπŸ’« πŸ”Ž Check if a model is susceptible to common attacks 🦜 Supports HuggingFace, OpenAI, ggml, Cohere, ... πŸ”§ >70 probes: prompt injection, false claims, toxicity, encoding evasion, ..
    Image
    GitHub - NVIDIA/garak: the LLM vulnerability scanner
    From github.com
    63K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    May 27, 2022
    'I don't really trust papers out of "Top Labs" anymore' reddit.com/r/MachineLearn…
    Image
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jan 13, 2023
    Replying to @Sicclord1
    strong "making a call during takeoff will crash the plane" vibes
    37K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jul 7, 2022
    machine learning researchers learn to optimise their own best paper rate through collusion and other unregulated mechanisms reddit.com/r/MachineLearn…
    Image
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jul 1, 2020
    Replying to @lexfridman
    finally, a proof for division by zero: ∴ 0 / 9 = 0.00000000
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Sep 6, 2021
    Thanks, reviewer 2 πšπšŽπšŠπšœπš˜πš—πšœ 𝚝𝚘 πš›πšŽπš“πšŽπšŒπš -------------------------- 𝙸 πšŒπšŠπš—πš—πš˜πš πšπš’πš—πš πšŠπš—πš’ πš›πšŽπšŠπšœπš˜πš— 𝚝𝚘 πš›πšŽπš“πšŽπšŒπš πšπš‘πš’πšœ πš™πšŠπš™πšŽπš› 𝚊𝚝 πšŠπš•πš• (πšπš‘πšŽπš›πšŽ πš’πšœ 𝚊 𝟷0 πš πš˜πš›πš πš–πš’πš—πš’πš–πšžπš– πšπš˜πš› πšπš‘πš’πšœ 𝚜𝚘 𝙸 πš“πšžπšœπš πš”πšŽπšŽπš™ πš πš›πš’πšπš’πš—πš)
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Feb 3, 2023
    using a 3d printer to "write" homework generated by chatgpt. your move, examiners
    Image
    00:00
    91K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Apr 22, 2023
    machine learning researchers debating theories of consciousness
    Image
    GIF
    51K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jan 24, 2024
    Replying to @USAinUK
    FYI microwaves are British
    Image
    116K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Aug 11, 2024
    Replying to @jaketropolis
    Legal protections on tips going to employees make this unprofitable
    23K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Mar 31, 2023
    ChatGPT not best at many language tasks. It's outranked by other systems on many NLP benchmarks in current evaluation. For 77.5% of tasks examined, other systems are better than ChatGPT. opensamizdat.com/posts/chatgpt_…
    Image
    90K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Mar 11, 2024
    it's 2024 and we're still having to say this lol cos sucks if you have more than say three or four dimensions
    user avatar
    Sumit
    @_reachsumit
    Mar 11, 2024
    Is Cosine-Similarity of Embeddings Really About Similarity? Netflix cautions against blindly using cosine similarity as a measure of semantic similarity between learned embeddings, as it can yield arbitrary and meaningless results. πŸ“arxiv.org/abs/2403.05440
    Image
    108K
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Jan 18, 2022
    Replying to @qntm and @SwiftOnSecurity
    i know a head of national sales at google who only knows how to use "recent" docs. that's their filesystem
  • user avatar
    Leon Derczynski βš’οΈβ˜οΈπŸ”οΈπŸŒ²
    @LeonDerczynski
    Feb 18, 2023
    ELIZA designer Joseph Weizenbaum observed: β€œWhat I had not realized is that extremely short exposures to a relatively simple computer program could induce powerful delusional thinking in quite normal people.”
    A photograph of a monkey inspecting its reflection in the mirror.
    Introducing theΒ AIΒ Mirror Test, which very smart people keep failing
    From theverge.com
    51K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsΒ·PrivacyΒ·CookiesΒ·AccessibilityΒ·Ads InfoΒ·Β© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up