Skip to content
View sunyiyou's full-sized avatar

Highlights

  • Pro

Block or report sunyiyou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. rdi-berkeley/agents-last-exam rdi-berkeley/agents-last-exam Public

    Agents' Last Exam

    Python 710 29

  2. deeplearning-wisc/react deeplearning-wisc/react Public

    Code for NeurIPS 2021 paper "ReAct: Out-of-distribution Detection With Rectified Activations"

    Python 57 10

  3. deeplearning-wisc/knn-ood deeplearning-wisc/knn-ood Public

    Code for ICML 2022 paper "Out-of-distribution Detection with Deep Nearest Neighbors"

    Python 202 18

  4. sunblaze-ucb/omega sunblaze-ucb/omega Public

    Python 47 4

  5. sunblaze-ucb/rl-grok-recipe sunblaze-ucb/rl-grok-recipe Public

    Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""

    Python 35 1

  6. rdi-berkeley/awesome-RLVR-boundary rdi-berkeley/awesome-RLVR-boundary Public

    A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

    89 6