Implementing the Lazy Loading Pattern in Python

ArjanCodes//Dec 5, 2025//3 min read

Overview

Lazy loading is a powerful computational principle that delays the initialization of an object or the execution of a process until it is strictly necessary. By avoiding "eager" loading—where a program fetches all data upfront—you can significantly improve application startup times and reduce memory overhead. This technique proves vital when dealing with massive datasets, such as CSV files with millions of rows, where immediate interaction is prioritized over complete data availability.

Prerequisites

To follow this guide, you should have a solid understanding of Python fundamentals, including functions and decorators. Familiarity with basic data structures like lists and dictionaries, as well as an understanding of how IO operations impact performance, will help you grasp the trade-offs involved in lazy evaluation.

Key Libraries & Tools

functools: A standard library module providing higher-order functions; specifically, we use cache for memoization.
typing: Used for type hinting, particularly the Generator type to define data streams.
threading: Enables background execution for preloading data without blocking the main UI thread.
CSV: Python's built-in module for parsing tabular data.

Code Walkthrough

Implementing the Lazy Loading Pattern in Python — The Lazy Loading Pattern: How to Make Python Programs Feel Instant

The Naive Approach

Most developers start with eager loading. The program blocks while reading the entire file into memory before showing a user interface.

def load_sales_data(path):
    # This blocks the UI for 10+ seconds
    with open(path, 'r') as f:
        return list(csv.DictReader(f))

Integrating functools.cache

To prevent redundant file reads, we apply the functools.cache decorator. This ensures that subsequent calls return the stored result instantly.

from functools import cache

@cache
def load_sales(path):
    print("Loading data...")
    with open(path, 'r') as f:
        return list(csv.DictReader(f))

Lazy Streaming with Generators

If you only need a subset of data (e.g., the first 10,000 records), loading the whole file is wasteful. Generators allow you to stream rows one by one using the yield keyword.

from typing import Generator

def load_sales_gen(path) -> Generator[dict, None, None]:
    with open(path, 'r') as f:
        reader = csv.DictReader(f)
        for row in reader:
            yield row

Implementing Time-Limited Caching (TTL)

For volatile data like API conversion rates, a permanent cache is dangerous. We implement a custom Time-To-Live (TTL) decorator to refresh data periodically.

import time

def ttl_cache(seconds: int):
    def decorator(func):
        cache_data = {}
        def wrapper(*args):
            now = time.time()
            if args in cache_data and (now - cache_data[args]['time'] < seconds):
                return cache_data[args]['result']
            result = func(*args)
            cache_data[args] = {'result': result, 'time': now}
            return result
        return wrapper
    return decorator

Syntax Notes

Using yield transforms a standard function into a generator object. This object adheres to the iterator protocol, meaning it doesn't compute its values until you iterate over it. Combined with @functools.cache, you create a system that is both efficient on first run and lightning-fast on subsequent calls.

Practical Examples

Web Interfaces: Displaying a login screen while assets load in the background.
ORMs: Django uses lazy loading to delay database queries until a specific field is accessed.
Large Data Science: Pandas and TensorFlow utilize similar principles to manage memory-intensive operations.

Tips & Gotchas

Avoid caching functions that rely on external state unless you use a TTL mechanism. Be cautious with threading; while preloading data in a background thread improves responsiveness, it introduces complexity regarding thread safety. Finally, remember that lazy loading can hide performance bottlenecks; a simple property access might unexpectedly trigger a massive 30-second database query.

Topic DensityMention share of the most discussed topics · 8 mentions across 8 distinct topics

Django: 13%· libraries
Eager Loading: 13%· concepts
Generators: 13%· concepts
Lazy Loading: 13%· concepts
Martin Fowler: 13%· people
Other topics: 38%

End of Article

Source video

Implementing the Lazy Loading Pattern in Python

The Lazy Loading Pattern: How to Make Python Programs Feel Instant

ArjanCodes // 19:39

ArjanCodes

ArjanCodes

On this channel, I post videos about programming and software design to help you take your coding skills to the next level. I'm an entrepreneur and a university lecturer in computer science, with more than 20 years of experience in software development and design. If you're a software developer and you want to improve your development skills, and learn more about programming in general, make sure to subscribe for helpful videos. I post a video here every Friday. If you have any suggestion for a topic you'd like me to cover, just leave a comment on any of my videos and I'll take it under consideration. Thanks for watching!

What they talk about

AI and Agentic Coding News

Who and what they mention most

27.3%3

18.2%2

18.2%2

18.2%2

18.2%2

3 min read0%

3 min read