alphadl

Follow

🎯

hiring @ alibaba https://liamding.cc/hiring.html

Liam Liang Ding alphadl

🎯

hiring @ alibaba https://liamding.cc/hiring.html

Follow

NLP researcher, developing agentic AI.

233 followers · 216 following

Alibaba
Shanghai(CN) & Sydney(AU)
22:49 (UTC +01:00)
liamding.cc
@liangdingNLP
https://scholar.google.com/citations?user=lFCLvOAAAAAJ
https://huggingface.co/alphadl

Achievements

Achievements

Highlights

Pro

alphadl/README.md

Hi there

🙋‍♂️ I build agentic AI at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.

🔭 Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.

💪 I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019😅. will resume training in 2024💪🏻).

🥗 I (once😅) enjoy cooking.

🐈 I like to spend Sundays with my cats (two from 2020-2023, one from 2023).

🔥 Recent open-source projects on agentic AI, together covering data generation, reuse, evaluation, and context efficiency:

🔄 AgentHER Hindsight relabeling of failed trajectories for training.
🧬 AgentSynth Synthetic agent data from scratch with execution validation.
📏 AdaRubric Dynamic rubric evaluation for trajectory quality.
🗜️ trajectory_tokenization ReAct with compressed history for long-horizon context.

Pinned Loading

THUNLP-MT/MT-Reading-List THUNLP-MT/MT-Reading-List Public

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2.4k 440
lookahead.pytorch lookahead.pytorch Public

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

Python 338 64
AgentHER AgentHER Public

AgentHER: Hindsight Experience Replay for LLM Agents

Python 1
AgentSynth AgentSynth Public

AgentSynth: Industrial-Grade Agent Data Synthesis Pipeline

Python 1
AdaRubrics AdaRubrics Public

AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

Python 1
trajectory_tokenization trajectory_tokenization Public

Trajectory Tokenization for ReAct: compress older steps into tokens, keep recent steps full—no training, drop-in

Python 4