LLM Reference

LLM Research Paper Guide — Navigating the Literature

Understanding LLM research requires reading key papers, understanding methodologies, and staying current with rapid advances. This guide provides a roadmap for navigating the research landscape.

Key Papers — Foundational and influential LLM research
Reading Guides — How to read and understand research papers
Research Methodology — Conducting LLM research

Read the classics first, then explore the frontiers.

LLM Research Paper Guide

The LLM field advances rapidly, with new papers published daily. This guide helps you navigate the literature, understand key contributions, and stay current with the field.

DfLLM Research Literature

LLM research literature encompasses papers on language modeling, architecture innovations, training methods, evaluation, alignment, safety, and applications. Understanding this literature is essential for practitioners and researchers.

Key Papers by Category

Foundational Papers

Paper	Year	Contribution	Impact
Attention Is All You Need	2017	Transformer architecture	Foundation for all modern LLMs
BERT	2018	Bidirectional pre-training	Revolutionized NLP
GPT-2	2019	Zero-shot task learning	Showed scale matters
GPT-3	2020	In-context learning	Few-shot paradigm
T5	2019	Text-to-text framework	Unified NLP tasks
PaLM	2022	Pathways system	Scalable training

Architecture Papers

DfArchitecture Research

Architecture research focuses on designing neural network structures that enable better language understanding and generation, including attention mechanisms, positional encodings, and scaling strategies.

Key papers:

Transformer (Vaswani et al., 2017): Self-attention mechanism
GPT (Radford et al., 2018): Decoder-only architecture
BERT (Devlin et al., 2018): Encoder-only architecture
T5 (Raffel et al., 2019): Encoder-decoder architecture
LLaMA (Touvron et al., 2023): Efficient open-source architecture

Training and Alignment

DfAlignment Research

Alignment research focuses on training LLMs to follow human intentions, be helpful, harmless, and honest. This includes RLHF, constitutional AI, and other alignment techniques.

Key papers:

InstructGPT (Ouyang et al., 2022): RLHF for instruction following
Constitutional AI (Bai et al., 2022): AI-assisted alignment
DPO (Rafailov et al., 2023): Direct preference optimization
RLHF (Stiennon et al., 2020): Learning from human feedback
KTO (Ethayarajh et al., 2024): Kahneman-Tversky optimization

Scaling Laws and Emergence

Chinchilla Scaling Law

L(N, D) = \\left(\\frac{N_c}{N}\\right)^{\\alpha_N} + \\left(\\frac{D_c}{D}\\right)^{\\alpha_D} + L_\\infty

Here,

$L$ =Test loss
$N$ =Model parameters
$D$ =Training tokens

Key papers:

Scaling Laws for Neural LMs (Kaplan et al., 2020): Power law relationships
Chinchilla (Hoffmann et al., 2022): Optimal scaling
Emergent Abilities (Wei et al., 2022): Abilities that appear at scale
Scaling Data-Constrained Language Models (Muennighoff et al., 2023): Data limits

Efficiency and Optimization

Key papers:

LoRA (Hu et al., 2022): Low-rank adaptation
QLoRA (Dettmers et al., 2023): Quantized LoRA
Flash Attention (Dao et al., 2022): Efficient attention
GQA (Ainslie et al., 2023): Grouped-query attention
Speculative Decoding (Leviathan et al., 2023): Fast inference

Safety and Ethics

Key papers:

Training language models to follow instructions (Ouyang et al., 2022)
Red Teaming Language Models (Perez et al., 2022)
Sleeper Agents (Hubinger et al., 2024): Deceptive alignment
Constitutional AI (Bai et al., 2022): Principles for AI behavior

Reading Guide

How to Read a Paper

DfPaper Reading Strategy

A systematic approach to reading research papers involves understanding the structure, identifying key contributions, and critically evaluating the work.

Reading steps:

Skim: Read title, abstract, introduction, conclusion (5-10 minutes)
Understand structure: Identify sections and flow
Deep read: Read methodology and results carefully
Critical analysis: Evaluate assumptions, limitations, reproducibility
Synthesis: Connect to other work and your own research

Paper Structure

Section	Purpose	What to Look For
Abstract	Summary	Key contribution, results
Introduction	Motivation	Problem, why it matters
Related Work	Context	What came before
Method	Approach	How they solved it
Experiments	Validation	Evidence for claims
Discussion	Analysis	Limitations, future work
Conclusion	Summary	Key takeaways

Critical Reading Questions

What is the main contribution?
What problem does it solve?
What are the key assumptions?
What evidence supports the claims?
What are the limitations?
How does it compare to alternatives?
What are the implications?
What future work is suggested?

Reading Log Template

## Paper Reading Log

### Paper Information
- Title:
- Authors:
- Year:
- Venue:
- Link:

### Summary
- Problem:
- Approach:
- Key contribution:
- Results:

### Key Insights
- Insight 1:
- Insight 2:
- Insight 3:

### Questions
- Question 1:
- Question 2:

### Connection to My Work
- How does this relate to my research?
- What can I apply?

### Rating
- Importance: 1-5
- Quality: 1-5
- Relevance: 1-5

Research Methodology

Research Process

DfLLM Research Process

The LLM research process involves identifying problems, reviewing literature, forming hypotheses, designing experiments, conducting research, and communicating results.

Research phases:

Problem identification: Find important open problems
Literature review: Understand existing approaches
Hypothesis formation: Develop testable hypotheses
Experimental design: Plan experiments carefully
Implementation: Build and test systems
Analysis: Analyze results rigorously
Communication: Write and present findings

Experimental Design

Experimental Design Principles

\\text{Valid Inference} = \\text{Control} + \\text{Randomization} + \\text{Replication}

Here,

$Control$ =Baseline comparisons
$Randomization$ =Reduce bias
$Replication$ =Ensure reliability

Key principles:

Baselines: Compare against strong baselines
Ablation studies: Understand component contributions
Statistical significance: Use proper statistical tests
Reproducibility: Provide code, data, and details
Multiple runs: Report variance across runs

Evaluation Methodology

DfLLM Evaluation

LLM evaluation systematically measures model performance on specific tasks using appropriate metrics, datasets, and protocols.

Evaluation components:

Benchmarks: Standardized evaluation datasets
Metrics: Quantitative measures of performance
Human evaluation: Subjective quality assessment
Safety evaluation: Testing for harmful behaviors
Efficiency evaluation: Computational requirements

Common Evaluation Frameworks

Framework	Focus	Metrics
MMLU	Knowledge	Accuracy across subjects
HumanEval	Code	Pass@k
GSM8K	Math	Accuracy
TruthfulQA	Honesty	Truthfulness rate
HELM	Holistic	Multiple dimensions

Staying Current

Conferences and Venues

Venue	Focus	Frequency
NeurIPS	ML	Annual
ICML	ML	Annual
ICLR	DL	Annual
ACL	NLP	Annual
EMNLP	NLP	Annual
NAACL	NLP	Annual
CoLLM	LLMs	Annual

Pre-print Servers

arXiv: Primary source for ML/NLP papers
Semantic Scholar: Search and discovery
Papers With Code: Papers with implementations
Hugging Face Papers: Curated ML papers

Reading Groups and Communities

DfResearch Community

Research communities include reading groups, online forums, conferences, and social media where researchers discuss and share work.

Ways to stay current:

Daily arXiv scanning: Check new papers daily
Reading groups: Join or start a reading group
Twitter/X: Follow researchers and discussions
Newsletters: Subscribe to ML newsletters
Conferences: Attend talks and workshops

Paper Recommendation Systems

Finding Relevant Papers

Start with citation networks of key papers
Use Semantic Scholar recommendations
Follow "similar papers" suggestions
Check reference lists of recent papers
Ask researchers in your network

Writing Research Papers

Paper Structure

DfResearch Paper Structure

A research paper typically includes: abstract, introduction, related work, method, experiments, results, discussion, conclusion, and references.

Writing Tips

Clear contribution: State your contribution early
Motivation: Explain why the problem matters
Reproducibility: Provide sufficient details
Honest evaluation: Report both strengths and limitations
Related work: Properly credit prior work

Common Mistakes

Common research paper mistakes:

Weak baselines or unfair comparisons
Missing ablation studies
Insufficient experimental details
Overclaiming results
Ignoring limitations
Poor writing quality

Practical Implementation

Building a Reading System

import requests
import json
from datetime import datetime, timedelta

class PaperReader:
    def __init__(self, api_key: str):
        self.api_key = api_key
        self.base_url = "https://api.semanticscholar.org/graph/v1"
    
    def search_papers(self, query: str, limit: int = 10):
        response = requests.get(
            f"{self.base_url}/paper/search",
            params={
                "query": query,
                "limit": limit,
                "fields": "title,abstract,year,citationCount,url"
            }
        )
        return response.json()["data"]
    
    def get_recent_papers(self, topic: str, days: int = 7):
        cutoff = datetime.now() - timedelta(days=days)
        papers = self.search_papers(topic, limit=50)
        
        recent = []
        for paper in papers:
            if paper.get("year") and paper["year"] >= cutoff.year:
                recent.append(paper)
        
        return recent
    
    def get_citation_network(self, paper_id: str):
        response = requests.get(
            f"{self.base_url}/paper/{paper_id}/citations",
            params={"fields": "title,year"}
        )
        return response.json()["data"]
    
    def create_reading_list(self, papers: list, priority_fn=None):
        if priority_fn:
            papers.sort(key=priority_fn, reverse=True)
        return papers[:10]  # Top 10 papers

Paper Analysis Template

@dataclass
class PaperAnalysis:
    title: str
    authors: list
    year: int
    venue: str
    
    # Summary
    problem: str
    approach: str
    contribution: str
    results: str
    
    # Critical analysis
    strengths: list
    weaknesses: list
    limitations: list
    
    # Personal notes
    key_insights: list
    questions: list
    connections: list
    
    # Rating
    importance: int  # 1-5
    quality: int  # 1-5
    relevance: int  # 1-5

def analyze_paper(paper_path: str) -> PaperAnalysis:
    # Read paper
    with open(paper_path, "r") as f:
        content = f.read()
    
    # Analyze (using LLM or manual analysis)
    analysis = PaperAnalysis(
        title="...",
        authors=[],
        year=2024,
        venue="...",
        problem="...",
        approach="...",
        contribution="...",
        results="...",
        strengths=[],
        weaknesses=[],
        limitations=[],
        key_insights=[],
        questions=[],
        connections=[],
        importance=4,
        quality=4,
        relevance=4
    )
    
    return analysis

Research Log

class ResearchLog:
    def __init__(self):
        self.papers_read = []
        self.ideas = []
        self.experiments = []
        self.writing = []
    
    def add_paper(self, paper_analysis: PaperAnalysis):
        self.papers_read.append({
            "date": datetime.now(),
            "analysis": paper_analysis
        })
    
    def add_idea(self, idea: str, source: str):
        self.ideas.append({
            "date": datetime.now(),
            "idea": idea,
            "source": source
        })
    
    def add_experiment(self, description: str, results: dict):
        self.experiments.append({
            "date": datetime.now(),
            "description": description,
            "results": results
        })
    
    def generate_summary(self):
        return {
            "papers_read": len(self.papers_read),
            "ideas_generated": len(self.ideas),
            "experiments_conducted": len(self.experiments),
            "top_topics": self.get_top_topics()
        }

Best Practices

Reading Practice

Consistency: Read regularly, even if just one paper per week
Active reading: Take notes and ask questions
Discussion: Discuss papers with others
Implementation: Try to reimplement key ideas
Connection: Connect papers to your own work

Research Practice

Rigorous evaluation: Use proper baselines and metrics
Reproducibility: Provide code and detailed methods
Honesty: Report limitations and negative results
Collaboration: Work with others when possible
Communication: Write clearly and present well

Start by reading survey papers to get an overview of a field, then dive into specific papers based on your interests and research needs.

Practice Exercises

Paper Reading: Read a foundational LLM paper (e.g., "Attention Is All You Need") and write a summary with critical analysis.
Literature Review: Conduct a literature review on a specific LLM topic. What are the key papers and open problems?
Experimental Design: Design an experiment to compare two LLM approaches. What metrics, baselines, and statistical tests would you use?
Research Proposal: Write a brief research proposal for an LLM project. What problem would you solve and how?

Key Takeaways:

Understanding LLM research requires reading foundational and recent papers
Use systematic reading strategies to efficiently process papers
Follow rigorous research methodology for conducting LLM research
Stay current through conferences, pre-prints, and research communities
Write clearly and provide sufficient details for reproducibility

What to Learn Next

-> LLM Glossary Comprehensive glossary of LLM terms and concepts.

-> LLM Tool Ecosystem Overview of HuggingFace, LangChain, LlamaIndex, and other tools.

-> LLM Best Practices Best practices for common LLM tasks and applications.

-> LLM Roadmap Learning roadmap, skill progression, and career paths in LLMs.

-> LLM Tool Ecosystem Overview of HuggingFace, LangChain, LlamaIndex, and other tools.

-> LLM Best Practices Best practices for common LLM tasks and applications.

LLM Research Paper Guide

LLM Research Paper Guide — Navigating the Literature

LLM Research Paper Guide

DfLLM Research Literature

Key Papers by Category

Foundational Papers

Architecture Papers

DfArchitecture Research

Training and Alignment

DfAlignment Research

Scaling Laws and Emergence

Chinchilla Scaling Law

Efficiency and Optimization

Safety and Ethics

Reading Guide

How to Read a Paper

DfPaper Reading Strategy

Paper Structure

Critical Reading Questions

Critical Reading Questions

Reading Log Template

Research Methodology

Research Process

DfLLM Research Process

Experimental Design

Experimental Design Principles

Evaluation Methodology

DfLLM Evaluation

Common Evaluation Frameworks

Staying Current

Conferences and Venues

Pre-print Servers

Reading Groups and Communities

DfResearch Community

Paper Recommendation Systems

Finding Relevant Papers

Writing Research Papers

Paper Structure

DfResearch Paper Structure

Writing Tips

Common Mistakes

Practical Implementation

Building a Reading System

Paper Analysis Template

Research Log

Best Practices

Reading Practice

Research Practice

Practice Exercises

What to Learn Next

Need Expert LLM Help?