diff --git a/_pages/about.md b/_pages/about.md index 2bd97c9798468..092f6cda0a628 100644 --- a/_pages/about.md +++ b/_pages/about.md @@ -23,13 +23,13 @@ I have 4-year experiences in LLMs. My research spans the areas of natural langua My research spans the areas of natural language processing and machine learning, specifically focusing on LLMs. I am broadly interested in the following topics: -* Computational Efficiency Method. (LLM Fine-tuning) Developing theories, tools, and algorithms for the efficient adaptation of large-scale models to downstream tasks in a computation-friendly manner. (e.g., parameter-efficient methods: LoRa, Adapter, etc.; instruction tuning) ([Prompt Transferability](https://aclanthology.org/2022.naacl-main.290/), [IPT](https://arxiv.org/abs/2110.07867), [Parameter-efficient Fine-tuning Survey](https://arxiv.org/abs/2203.06904), [APET](https://openreview.net/forum?id=3CIQIYNGlp)). +* Computational Efficiency Method. (Fine-tuning) Developing theories, tools, and algorithms for the efficient adaptation of large-scale models to downstream tasks in a computation-friendly manner. (e.g., parameter-efficient methods: LoRa, Adapter, etc.; instruction tuning) ([Prompt Transferability](https://aclanthology.org/2022.naacl-main.290/), [IPT](https://arxiv.org/abs/2110.07867), [Parameter-efficient Fine-tuning Survey](https://arxiv.org/abs/2203.06904), [APET](https://openreview.net/forum?id=3CIQIYNGlp)). -* Interactive AI Agent. (LLM Planning and Reasoning) Developing autonomous agents (based on LLMs) that can acquire external information through interaction with the external (environment/human) and utilize the acquired knowledge to drive their decision-making and accomplish more complex long-horizon tasks ([AgentVerse](https://arxiv.org/abs/2308.10848), [XAgent](https://blog.x-agent.net/about/), [CokeBERT](https://arxiv.org/abs/2009.13964), [CSS-LM](https://arxiv.org/abs/2102.03752), [Tool Leaning](https://arxiv.org/abs/2304.08354), [ChatDev](https://arxiv.org/abs/2307.07924)). Note that: I explore how to make multiple autonomous agents collaborate more efficiently to accomplish complex tasks and am further instered in their [emergent human-like social behaviors](https://arxiv.org/abs/2308.10848). +* Interactive AI Agent. (Planning and Reasoning) Developing autonomous agents (based on LLMs) that can acquire external information through interaction with the external (environment/human) and utilize the acquired knowledge to drive their decision-making and accomplish more complex long-horizon tasks ([AgentVerse](https://arxiv.org/abs/2308.10848), [XAgent](https://blog.x-agent.net/about/), [CokeBERT](https://arxiv.org/abs/2009.13964), [CSS-LM](https://arxiv.org/abs/2102.03752), [Tool Leaning](https://arxiv.org/abs/2304.08354), [ChatDev](https://arxiv.org/abs/2307.07924)). Note that: I explore how to make multiple autonomous agents collaborate more efficiently to accomplish complex tasks and am further instered in their [emergent human-like social behaviors](https://arxiv.org/abs/2308.10848). -* AI Alignment. (LLM Alignment) Designing paradigms to control, steer, teach, and guide AI systems that smarter than human ([scalable oversight](https://openai.com/blog/introducing-superalignment)). At the present stage, I am studying how to create a tool that can explore unclosed behavior in LLMs ([Model Emotion](https://arxiv.org/abs/2302.09582) and propose a metric/pipeline to autonomously evaluate/supervise LLMs ([Chateval](https://arxiv.org/abs/2308.07201)). +* AI Alignment. (Alignment) Designing paradigms to control, steer, teach, and guide AI systems that smarter than human ([scalable oversight](https://openai.com/blog/introducing-superalignment)). At the present stage, I am studying how to create a tool that can explore unclosed behavior in LLMs ([Model Emotion](https://arxiv.org/abs/2302.09582) and propose a metric/pipeline to autonomously evaluate/supervise LLMs ([Chateval](https://arxiv.org/abs/2308.07201)).