LLM Reference
LLM Research Paper Guide β Navigating the Literature
Understanding LLM research requires reading key papers, understanding methodologies, and staying current with rapid advances. This guide provides a roadmap for navigating the research landscape.
- Key Papers β Foundational and influential LLM research
- Reading Guides β How to read and understand research papers
- Research Methodology β Conducting LLM research
Read the classics first, then explore the frontiers.
LLM Research Paper Guide
The LLM field advances rapidly, with new papers published daily. This guide helps you navigate the literature, understand key contributions, and stay current with the field.
DfLLM Research Literature
LLM research literature encompasses papers on language modeling, architecture innovations, training methods, evaluation, alignment, safety, and applications. Understanding this literature is essential for practitioners and researchers.
Key Papers by Category
Foundational Papers
| Paper | Year | Contribution | Impact |
|---|---|---|---|
| Attention Is All You Need | 2017 | Transformer architecture | Foundation for all modern LLMs |
| BERT | 2018 | Bidirectional pre-training | Revolutionized NLP |
| GPT-2 | 2019 | Zero-shot task learning | Showed scale matters |
| GPT-3 | 2020 | In-context learning | Few-shot paradigm |
| T5 | 2019 | Text-to-text framework | Unified NLP tasks |
| PaLM | 2022 | Pathways system | Scalable training |
Architecture Papers
DfArchitecture Research
Architecture research focuses on designing neural network structures that enable better language understanding and generation, including attention mechanisms, positional encodings, and scaling strategies.
Key papers:
- Transformer (Vaswani et al., 2017): Self-attention mechanism
- GPT (Radford et al., 2018): Decoder-only architecture
- BERT (Devlin et al., 2018): Encoder-only architecture
- T5 (Raffel et al., 2019): Encoder-decoder architecture
- LLaMA (Touvron et al., 2023): Efficient open-source architecture
Training and Alignment
DfAlignment Research
Alignment research focuses on training LLMs to follow human intentions, be helpful, harmless, and honest. This includes RLHF, constitutional AI, and other alignment techniques.
Key papers:
- InstructGPT (Ouyang et al., 2022): RLHF for instruction following
- Constitutional AI (Bai et al., 2022): AI-assisted alignment
- DPO (Rafailov et al., 2023): Direct preference optimization
- RLHF (Stiennon et al., 2020): Learning from human feedback
- KTO (Ethayarajh et al., 2024): Kahneman-Tversky optimization
Scaling Laws and Emergence
Chinchilla Scaling Law
Here,
- =Test loss
- =Model parameters
- =Training tokens
Key papers:
- Scaling Laws for Neural LMs (Kaplan et al., 2020): Power law relationships
- Chinchilla (Hoffmann et al., 2022): Optimal scaling
- Emergent Abilities (Wei et al., 2022): Abilities that appear at scale
- Scaling Data-Constrained Language Models (Muennighoff et al., 2023): Data limits
Efficiency and Optimization
Key papers:
- LoRA (Hu et al., 2022): Low-rank adaptation
- QLoRA (Dettmers et al., 2023): Quantized LoRA
- Flash Attention (Dao et al., 2022): Efficient attention
- GQA (Ainslie et al., 2023): Grouped-query attention
- Speculative Decoding (Leviathan et al., 2023): Fast inference
Safety and Ethics
Key papers:
- Training language models to follow instructions (Ouyang et al., 2022)
- Red Teaming Language Models (Perez et al., 2022)
- Sleeper Agents (Hubinger et al., 2024): Deceptive alignment
- Constitutional AI (Bai et al., 2022): Principles for AI behavior
Reading Guide
How to Read a Paper
DfPaper Reading Strategy
A systematic approach to reading research papers involves understanding the structure, identifying key contributions, and critically evaluating the work.
Reading steps:
- Skim: Read title, abstract, introduction, conclusion (5-10 minutes)
- Understand structure: Identify sections and flow
- Deep read: Read methodology and results carefully
- Critical analysis: Evaluate assumptions, limitations, reproducibility
- Synthesis: Connect to other work and your own research
Paper Structure
| Section | Purpose | What to Look For |
|---|---|---|
| Abstract | Summary | Key contribution, results |
| Introduction | Motivation | Problem, why it matters |
| Related Work | Context | What came before |
| Method | Approach | How they solved it |
| Experiments | Validation | Evidence for claims |
| Discussion | Analysis | Limitations, future work |
| Conclusion | Summary | Key takeaways |
Critical Reading Questions
Critical Reading Questions
- What is the main contribution?
- What problem does it solve?
- What are the key assumptions?
- What evidence supports the claims?
- What are the limitations?
- How does it compare to alternatives?
- What are the implications?
- What future work is suggested?
Reading Log Template
## Paper Reading Log
### Paper Information
- Title:
- Authors:
- Year:
- Venue:
- Link:
### Summary
- Problem:
- Approach:
- Key contribution:
- Results:
### Key Insights
- Insight 1:
- Insight 2:
- Insight 3:
### Questions
- Question 1:
- Question 2:
### Connection to My Work
- How does this relate to my research?
- What can I apply?
### Rating
- Importance: 1-5
- Quality: 1-5
- Relevance: 1-5
Research Methodology
Research Process
DfLLM Research Process
The LLM research process involves identifying problems, reviewing literature, forming hypotheses, designing experiments, conducting research, and communicating results.
Research phases:
- Problem identification: Find important open problems
- Literature review: Understand existing approaches
- Hypothesis formation: Develop testable hypotheses
- Experimental design: Plan experiments carefully
- Implementation: Build and test systems
- Analysis: Analyze results rigorously
- Communication: Write and present findings
Experimental Design
Experimental Design Principles
Here,
- =Baseline comparisons
- =Reduce bias
- =Ensure reliability
Key principles:
- Baselines: Compare against strong baselines
- Ablation studies: Understand component contributions
- Statistical significance: Use proper statistical tests
- Reproducibility: Provide code, data, and details
- Multiple runs: Report variance across runs
Evaluation Methodology
DfLLM Evaluation
LLM evaluation systematically measures model performance on specific tasks using appropriate metrics, datasets, and protocols.
Evaluation components:
- Benchmarks: Standardized evaluation datasets
- Metrics: Quantitative measures of performance
- Human evaluation: Subjective quality assessment
- Safety evaluation: Testing for harmful behaviors
- Efficiency evaluation: Computational requirements
Common Evaluation Frameworks
| Framework | Focus | Metrics |
|---|---|---|
| MMLU | Knowledge | Accuracy across subjects |
| HumanEval | Code | Pass@k |
| GSM8K | Math | Accuracy |
| TruthfulQA | Honesty | Truthfulness rate |
| HELM | Holistic | Multiple dimensions |
Staying Current
Conferences and Venues
| Venue | Focus | Frequency |
|---|---|---|
| NeurIPS | ML | Annual |
| ICML | ML | Annual |
| ICLR | DL | Annual |
| ACL | NLP | Annual |
| EMNLP | NLP | Annual |
| NAACL | NLP | Annual |
| CoLLM | LLMs | Annual |
Pre-print Servers
- arXiv: Primary source for ML/NLP papers
- Semantic Scholar: Search and discovery
- Papers With Code: Papers with implementations
- Hugging Face Papers: Curated ML papers
Reading Groups and Communities
DfResearch Community
Research communities include reading groups, online forums, conferences, and social media where researchers discuss and share work.
Ways to stay current:
- Daily arXiv scanning: Check new papers daily
- Reading groups: Join or start a reading group
- Twitter/X: Follow researchers and discussions
- Newsletters: Subscribe to ML newsletters
- Conferences: Attend talks and workshops
Paper Recommendation Systems
Finding Relevant Papers
- Start with citation networks of key papers
- Use Semantic Scholar recommendations
- Follow "similar papers" suggestions
- Check reference lists of recent papers
- Ask researchers in your network
Writing Research Papers
Paper Structure
DfResearch Paper Structure
A research paper typically includes: abstract, introduction, related work, method, experiments, results, discussion, conclusion, and references.
Writing Tips
- Clear contribution: State your contribution early
- Motivation: Explain why the problem matters
- Reproducibility: Provide sufficient details
- Honest evaluation: Report both strengths and limitations
- Related work: Properly credit prior work
Common Mistakes
Common research paper mistakes:
- Weak baselines or unfair comparisons
- Missing ablation studies
- Insufficient experimental details
- Overclaiming results
- Ignoring limitations
- Poor writing quality
Practical Implementation
Building a Reading System
import requests
import json
from datetime import datetime, timedelta
class PaperReader:
def __init__(self, api_key: str):
self.api_key = api_key
self.base_url = "https://api.semanticscholar.org/graph/v1"
def search_papers(self, query: str, limit: int = 10):
response = requests.get(
f"{self.base_url}/paper/search",
params={
"query": query,
"limit": limit,
"fields": "title,abstract,year,citationCount,url"
}
)
return response.json()["data"]
def get_recent_papers(self, topic: str, days: int = 7):
cutoff = datetime.now() - timedelta(days=days)
papers = self.search_papers(topic, limit=50)
recent = []
for paper in papers:
if paper.get("year") and paper["year"] >= cutoff.year:
recent.append(paper)
return recent
def get_citation_network(self, paper_id: str):
response = requests.get(
f"{self.base_url}/paper/{paper_id}/citations",
params={"fields": "title,year"}
)
return response.json()["data"]
def create_reading_list(self, papers: list, priority_fn=None):
if priority_fn:
papers.sort(key=priority_fn, reverse=True)
return papers[:10] # Top 10 papers
Paper Analysis Template
@dataclass
class PaperAnalysis:
title: str
authors: list
year: int
venue: str
# Summary
problem: str
approach: str
contribution: str
results: str
# Critical analysis
strengths: list
weaknesses: list
limitations: list
# Personal notes
key_insights: list
questions: list
connections: list
# Rating
importance: int # 1-5
quality: int # 1-5
relevance: int # 1-5
def analyze_paper(paper_path: str) -> PaperAnalysis:
# Read paper
with open(paper_path, "r") as f:
content = f.read()
# Analyze (using LLM or manual analysis)
analysis = PaperAnalysis(
title="...",
authors=[],
year=2024,
venue="...",
problem="...",
approach="...",
contribution="...",
results="...",
strengths=[],
weaknesses=[],
limitations=[],
key_insights=[],
questions=[],
connections=[],
importance=4,
quality=4,
relevance=4
)
return analysis
Research Log
class ResearchLog:
def __init__(self):
self.papers_read = []
self.ideas = []
self.experiments = []
self.writing = []
def add_paper(self, paper_analysis: PaperAnalysis):
self.papers_read.append({
"date": datetime.now(),
"analysis": paper_analysis
})
def add_idea(self, idea: str, source: str):
self.ideas.append({
"date": datetime.now(),
"idea": idea,
"source": source
})
def add_experiment(self, description: str, results: dict):
self.experiments.append({
"date": datetime.now(),
"description": description,
"results": results
})
def generate_summary(self):
return {
"papers_read": len(self.papers_read),
"ideas_generated": len(self.ideas),
"experiments_conducted": len(self.experiments),
"top_topics": self.get_top_topics()
}
Best Practices
Reading Practice
- Consistency: Read regularly, even if just one paper per week
- Active reading: Take notes and ask questions
- Discussion: Discuss papers with others
- Implementation: Try to reimplement key ideas
- Connection: Connect papers to your own work
Research Practice
- Rigorous evaluation: Use proper baselines and metrics
- Reproducibility: Provide code and detailed methods
- Honesty: Report limitations and negative results
- Collaboration: Work with others when possible
- Communication: Write clearly and present well
Start by reading survey papers to get an overview of a field, then dive into specific papers based on your interests and research needs.
Practice Exercises
-
Paper Reading: Read a foundational LLM paper (e.g., "Attention Is All You Need") and write a summary with critical analysis.
-
Literature Review: Conduct a literature review on a specific LLM topic. What are the key papers and open problems?
-
Experimental Design: Design an experiment to compare two LLM approaches. What metrics, baselines, and statistical tests would you use?
-
Research Proposal: Write a brief research proposal for an LLM project. What problem would you solve and how?
Key Takeaways:
- Understanding LLM research requires reading foundational and recent papers
- Use systematic reading strategies to efficiently process papers
- Follow rigorous research methodology for conducting LLM research
- Stay current through conferences, pre-prints, and research communities
- Write clearly and provide sufficient details for reproducibility
What to Learn Next
-> LLM Glossary Comprehensive glossary of LLM terms and concepts.
-> LLM Tool Ecosystem Overview of HuggingFace, LangChain, LlamaIndex, and other tools.
-> LLM Best Practices Best practices for common LLM tasks and applications.
-> LLM Roadmap Learning roadmap, skill progression, and career paths in LLMs.
-> LLM Tool Ecosystem Overview of HuggingFace, LangChain, LlamaIndex, and other tools.
-> LLM Best Practices Best practices for common LLM tasks and applications.