Tech Blog

This is a tech blog that focuses on my research. I'll try to use both Chinese(my native language) and English in this blog.

显示器挑选指南

1 minute read

专业修图显示器选购指南

RL 100

15 minute read

1. Triage

My Gem Instructions

2 minute read

Code Master Gem Instructions

A Tutorial on CQL and Cal-QL: From Offline Conservatism to Online Fine-Tuning

13 minute read

This guide reviews two important algorithms in reinforcement learning, Conservative Q-Learning (CQL) and Calibrated Q-Learning (Cal-QL), explaining the probl...

Port Shifts Solution with udev

3 minute read

Using udev to Create Persistent Device Paths

The Llama 3 Herd of Models

5 minute read

LLaMA3 技术报告的简要分析。

LLaMA 2: Open Foundation and Fine-Tuned Chat Models

14 minute read

Abstract: In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billio...

LLaMA: Open and Efficient Foundation Language Models

11 minute read

LLaMA (Large Language Model Meta AI) is a series of foundational language models developed by Meta AI. The LLaMA models are designed to be efficient and effe...