Skip to content
View alphadl's full-sized avatar
🎯
hiring @ alibaba https://liamding.cc/hiring.html
🎯
hiring @ alibaba https://liamding.cc/hiring.html

Highlights

  • Pro

Block or report alphadl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alphadl/README.md

Hi there

πŸ™‹β€β™‚οΈ I build agentic AI at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.

πŸ”­ Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.

πŸ’ͺ I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019πŸ˜…. will resume training in 2024πŸ’ͺ🏻).

πŸ₯— I (onceπŸ˜…) enjoy cooking.

🐈 I like to spend Sundays with my cats (two from 2020-2023, one from 2023).

πŸ”₯ Recent open-source projects on agentic AI, together covering data generation, reuse, evaluation, and context efficiency:

  • πŸ”„ AgentHER Hindsight relabeling of failed trajectories for training.
  • 🧬 AgentSynth Synthetic agent data from scratch with execution validation.
  • πŸ“ AdaRubric Dynamic rubric evaluation for trajectory quality.
  • πŸ—œοΈ trajectory_tokenization ReAct with compressed history for long-horizon context.

Pinned Loading

  1. THUNLP-MT/MT-Reading-List THUNLP-MT/MT-Reading-List Public

    A machine translation reading list maintained by Tsinghua Natural Language Processing Group

    TeX 2.4k 440

  2. lookahead.pytorch lookahead.pytorch Public

    lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

    Python 338 64

  3. AgentHER AgentHER Public

    AgentHER: Hindsight Experience Replay for LLM Agents

    Python 1

  4. AgentSynth AgentSynth Public

    AgentSynth: Industrial-Grade Agent Data Synthesis Pipeline

    Python 1

  5. AdaRubrics AdaRubrics Public

    AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

    Python 1

  6. trajectory_tokenization trajectory_tokenization Public

    Trajectory Tokenization for ReAct: compress older steps into tokens, keep recent steps fullβ€”no training, drop-in

    Python 4