SQLAlchemy

Products

Feb 2023 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Feb 2023

Mar 2023 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Mar 2023

Sep 2023 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Sep 2023

Oct 2023 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Oct 2023

Jan 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Jan 2024

Apr 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Apr 2024

May 2024 • 2 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 2 videos across 1 sources.

May 2024

Jul 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Jul 2024

Aug 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Aug 2024

Oct 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Oct 2024

Nov 2024 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Nov 2024

Jun 2025 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Jun 2025

Oct 2025 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Oct 2025

Dec 2025 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Dec 2025

Mar 2026 • 1 videos

High activity month for SQLAlchemy. ArjanCodes among the most active voices, with 1 videos across 1 sources.

Mar 2026

TL;DR

ArjanCodes (7 mentions) highlights SQLAlchemy as the Object-Relational Mapper (ORM) for database persistence in videos like "How to Tell If Your Code Is Actually Production-Ready" and mentions its usefulness for understanding database session management.

// ArjanCodes
The Problem with Framework Coupling Many developers start a FastAPI project by writing business logic directly inside their route handlers. It starts simple. You take a database connection from a dependency, run a few SQLAlchemy queries, check some conditions, and return a dictionary. But this approach creates a tangled mess where your core business rules are inseparable from the transport layer and the database implementation. When your domain logic raises `HTTPException`, it becomes untestable without a running web server. When it accepts a database `Session` object, you can't run a unit test without mocking complex SQLAlchemy internals or spinning up a real database. This coupling makes switching frameworks—like moving from SQL to NoSQL or FastAPI to another web framework—nearly impossible without a total rewrite. The solution is the Ports and Adapters architecture, also known as Hexagonal Architecture. Prerequisites To follow this tutorial, you should be comfortable with Python 3.10+ and have a baseline understanding of asynchronous programming. You should also understand basic REST API concepts and Object-Relational Mapping (ORM). Knowledge of type hinting and Pydantic will help you grasp the data modeling sections. Key Libraries & Tools * FastAPI: A modern, fast web framework for building APIs with Python. * SQLAlchemy: The Python SQL toolkit and ORM used here to manage database interactions. * typing.Protocol: A structural typing mechanism used to define "Ports" without forced inheritance. * dataclasses: Built-in Python decorators used to create clean domain models. Code Walkthrough: Isolating the Domain The first step is creating a pure domain layer. This layer must have zero imports from external frameworks. We define our own exceptions to replace HTTP errors. ```python domain/errors.py class DomainError(Exception): pass class OutOfStock(DomainError): def __init__(self, sku: str, requested: int, available: int): super().__init__(f"SKU {sku} is out of stock. Requested {requested}, only {available} left.") ``` Next, we define the "Port." In Python, a Port is best represented by a `Protocol`. It defines *what* the domain needs from the outside world without specifying *how* it's done. ```python domain/ports.py from typing import Protocol class InventoryPort(Protocol): def get_stock(self, sku: str) -> int: ... def reserve(self, sku: str, quantity: int) -> int: ... def exists(self, sku: str) -> bool: ... ``` Now, we write the "Use Case." This is pure logic that only speaks the language of the domain. It takes the `InventoryPort` as an argument, allowing us to swap the real database for a simple mock during testing. ```python domain/use_cases.py def place_order(inventory: InventoryPort, request: OrderRequest) -> OrderPlaced: if not inventory.exists(request.sku): raise UnknownSKU(request.sku) available = inventory.get_stock(request.sku) if available < request.quantity: raise OutOfStock(request.sku, request.quantity, available) remaining = inventory.reserve(request.sku, request.quantity) return OrderPlaced(sku=request.sku, quantity=request.quantity, remaining_stock=remaining) ``` Implementing the Adapter The Adapter is the glue code. The `SQLAlchemyInventoryAdapter` implements the `InventoryPort` using real database calls. The FastAPI route then becomes a thin translation layer: it converts incoming JSON to a domain `OrderRequest`, calls the use case, and maps domain errors back to HTTP status codes. ```python adapters/sql_alchemy.py class SQLAlchemyInventoryAdapter(InventoryPort): def __init__(self, conn: Connection): self.conn = conn def get_stock(self, sku: str) -> int: # Real SQL Alchemy logic here ... ``` Syntax Notes & Best Practices * **Structural Typing**: Using `typing.Protocol` is preferred over `abc.ABC` because it allows for duck-typing. Any class with the right methods automatically satisfies the port. * **Data Classes**: Use `@dataclass(frozen=True)` for domain models. Immutability prevents side effects within your business logic. * **Error Translation**: Always catch domain errors at the edge (the API adapter) and translate them to transport-specific errors. This keeps your domain "clean." Tips & Gotchas Don't let the extra files intimidate you. While Ports and Adapters adds boilerplate, it drastically reduces the "Cognitive Load" of testing. If your use case needs to be atomic, handle the transaction logic within your adapter implementation rather than polluting the domain with `db.commit()` calls. This keeps your architecture flexible for future changes.
Mar 6, 2026
// ArjanCodes
Dec 26, 2025
// ArjanCodes
Oct 3, 2025
// ArjanCodes
Jun 27, 2025
// ArjanCodes
Nov 15, 2024
// ArjanCodes
Python's ecosystem is a vast landscape of hidden gems that can transform your development workflow. Whether you are building complex backend systems or just trying to automate a mundane task, the right library can save you hours of manual labor. This collection focuses on tools that range from foundational industry standards to niche utilities that solve specific, annoying problems. Data Management and Scraping Essentials At the top of the list for any serious backend developer is SQLAlchemy. It acts as a powerful Object-Relational Mapper (ORM), allowing you to define database tables as Python classes. By abstracting SQL complexity, it helps you avoid injection attacks through automatic input sanitization. However, if you are performing highly optimized, complex queries, the abstraction might occasionally get in your way. For those pulling data from the web, Beautiful%20Soup remains the undisputed champion. It parses HTML and XML with ease, though you must always check a site's `robots.txt` file to ensure your scraping activities are permitted. Scientific Computing and Project Scaffolding If your work involves symbolic mathematics, SymPy is a fascinating tool that solves algebraic equations symbolically rather than numerically. It is a favorite for engineers, though its lack of precise type annotations in certain modules can be a minor hurdle for those who rely heavily on IDE autocompletion. On the project management side, Cookiecutter is indispensable for reducing boilerplate. It uses templates to scaffold entire projects instantly. The only caveat is the sheer volume of templates available; you might spend more time picking the "perfect" Fast API template than it would take to write the code yourself if you aren't careful. Visualization and Automation Tools Data cleaning is often the most tedious part of data science. Missingno provides a visual way to understand where your datasets are incomplete, using Matplotlib to generate heatmaps of missing values. For automation, Watchdog is a standout library for monitoring file system events. It can trigger scripts the moment a file is modified or created in a specific directory. This is perfect for building local development tools that auto-reload or for security monitoring on sensitive folders. Functional Patterns and Modern UI Python is increasingly embracing functional programming concepts. The Returns package brings monads and pattern matching to error handling, allowing you to use `Success` and `Failure` objects instead of traditional try-except blocks. Similarly, the Pipe library introduces a cleaner syntax for chaining operations. When it comes to user interfaces, NiceGUI offers a refreshing alternative to old-school frameworks like Tkinter. It lets you build web-based dashboards directly in Python, which is ideal for internal tools that need to look modern without requiring a dedicated frontend team. The Cutting Edge: AI and Beyond The list concludes with a look at the future: Screenshot-to-code. While not a traditional library you'd pip install into every project, this deep-learning tool uses models like GPT-4 or Claude to convert UI screenshots directly into HTML and CSS. It’s a glimpse into a world where the bridge between design and implementation is fully automated. Exploring these tools isn't just about adding more packages to your requirements file; it’s about expanding your problem-solving toolkit. Try integrating one of these into your next sprint and see how much friction it removes from your build process.
Oct 18, 2024
// ArjanCodes
Overview Python 3.13 introduces an experimental feature that allows developers to run code without the Global Interpreter Lock (GIL). Historically, the GIL prevented multiple threads from executing Python bytecode simultaneously to ensure memory safety. Removing this lock enables true parallelism, allowing CPU-bound tasks to utilize multiple processor cores effectively within a single process. This guide explores the technical shift toward a "no-GIL" Python ecosystem. Prerequisites To follow this exploration, you should understand: * **Threading vs. Multiprocessing**: Knowing how Python handles concurrent execution. * **CPython Internals**: Basic familiarity with how the default Python interpreter manages memory. * **Compilation**: Comfort with building Python from source, as early no-GIL builds require custom flags. Key Libraries & Tools * **CPython**: The standard Python implementation currently undergoing these architectural changes. * **threading**: The built-in module for managing concurrent execution threads. * **multiprocessing**: A module used to side-step the GIL by spawning separate memory spaces. * **FastAPI** and **SQLAlchemy**: High-level frameworks that may require updates for thread-safety in a no-GIL environment. Code Walkthrough Testing the impact of the GIL involves comparing standard threaded execution against a no-GIL build. In a standard environment, the following CPU-bound task gains no speed from threading: ```python import threading import time def count_primes(n): # Intensive calculation logic here pass Standard threading hampered by the GIL threads = [threading.Thread(target=count_primes, args=(1000000,)) for _ in range(4)] start = time.perf_counter() for t in threads: t.start() for t in threads: t.join() print(f"Elapsed: {time.perf_counter() - start}") ``` When running this on a Python 3.13 build with the GIL disabled, the execution time drops significantly. The interpreter no longer forces threads to wait for the mutex, allowing the operating system to distribute the `count_primes` workload across four physical CPU cores simultaneously. Syntax Notes Disabling the GIL is currently a build-time configuration. Developers check for the status using `sys._is_gil_enabled()` if available. The implementation relies heavily on new C macros in the `ceval_gil.c` source file, which conditionally compile locking logic based on the `--disable-gil` flag. Practical Examples * **Data Science**: Running heavy NumPy or pandas transformations across threads without the overhead of inter-process communication. * **AI/ML**: Scaling model inference locally by utilizing all available CPU threads within a single memory space. * **Web Servers**: Handling high-concurrency requests in frameworks like FastAPI more efficiently. Tips & Gotchas Removing the GIL is not a free lunch. Single-threaded performance in early no-GIL builds may actually decrease due to the overhead of new thread-safety mechanisms like biased reference counting. Furthermore, many third-party C extensions assume the GIL protects them; running these in a no-GIL environment can lead to race conditions or segmentation faults. Always test your dependency tree before migrating to a free-threaded build.
Aug 2, 2024
// ArjanCodes
Overview Designing a REST API involves more than just selecting a framework or hosting on a cloud provider. A truly great API acts as a seamless interface that developers enjoy using, yet even major tech companies often fail at basic usability. This guide explores the architectural decisions and best practices that transform a functional API into a professional-grade product, focusing on standards, consistency, and the implementation of advanced features like metadata merging. Prerequisites To get the most out of this tutorial, you should have a solid grasp of **Python**, basic **HTTP methods** (GET, POST, etc.), and the fundamentals of **JSON**. Familiarity with **FastAPI** and **SQLAlchemy** will help when we dive into the code walkthrough for custom data handling. Key Libraries & Tools * OpenAPI: The industry-standard specification for describing and documenting RESTful APIs. * FastAPI: A modern, high-performance Python web framework that automatically generates OpenAPI schemas. * SQLAlchemy: A powerful Python SQL Toolkit and ORM used here to manage database models. * Pydantic: Data validation and settings management using Python type annotations. Mastering Standards and Consistency Standards provide a shared vocabulary between the provider and the consumer. Adopting the OpenAPI specification allows you to generate interactive documentation automatically. This transparency reduces the friction of integration. Beyond documentation, adherence to REST naming conventions—using plural nouns like `/customers` instead of singular `/customer`—creates a predictable environment. Consistency is the hallmark of a mature API. If your `/orders` endpoint returns a `payer_id`, your `/invoices` endpoint should not suddenly switch to calling that same entity a `recipient` without an ID. Map out your resource relationships early. Ensure that pagination, error handling, and date formats remain uniform across every single endpoint. Code Walkthrough: Implementing Stripe-Style Metadata One of the most powerful features for third-party integration is the ability to store custom metadata. This allows users to link your resources to IDs in their other systems (like an accounting ID or a CRM link). The Base Model with Custom Data Magic In this implementation, we use SQLAlchemy and Pydantic to create a base class that handles "merge" logic for metadata, similar to the Stripe API. ```python import json from sqlalchemy.orm import declarative_base from pydantic import BaseModel, validator class BaseCustomData: def update_custom_data(self, new_data: dict): # 1. Load existing stringified JSON from the DB current_data = json.loads(self.custom_data or "{}") # 2. Iterate and merge logic for key, value in new_data.items(): if value is None: current_data.pop(key, None) # Unset if value is null else: current_data[key] = value # Merge new key-values # 3. Save back as string self.custom_data = json.dumps(current_data) ``` Explanation of the Logic * **The Merge Operation**: Instead of overwriting the entire `custom_data` field, the method loads the existing JSON, updates specific keys, and preserves others. This prevents accidental data loss during partial updates. * **The Null Deletion Pattern**: By checking if a value is `None`, the API follows the convention where sending `{"my_key": null}` explicitly removes that key from the database. * **Serialization**: We store the data as a string in the database for compatibility but expose it as a dictionary in the API layer for ease of use. Syntax Notes & Best Practices When defining field names, stick to `snake_case`. It is significantly more readable than mashing words together. Furthermore, utilize sensible defaults for arguments. If a user searches for transactions, default the `end_date` to the current time rather than forcing them to provide it. This reduces the cognitive load on the developer using your tool. Tips & Gotchas * **Version Your API**: Always include the version in the URL (e.g., `/v1/`) to prevent breaking changes for existing users. * **Clear Error Bodies**: Don't just return a 400 error. Provide a JSON response body explaining *why* the request failed. * **Navigation**: Ensure resources are interconnected. An order object should include a link or ID for the customer, and vice-versa. Avoid creating "data islands" where resources cannot be reached from related objects.
Jul 19, 2024
// ArjanCodes
Overview Building a robust backend is only half the battle; the real challenge lies in creating a reproducible pipeline to move that code from a local machine to a production server. This tutorial covers the end-to-end process of containerizing a FastAPI application, automating the build via GitHub Actions, and hosting it on a Virtual Private Server. By the end, you will understand how to bridge the gap between development and live distribution. Prerequisites To follow along, you should have a baseline understanding of Python and SQLAlchemy. Familiarity with basic terminal commands and Git is essential. You will also need a GitHub account to manage the automation workflows. Key Libraries & Tools - **FastAPI**: A high-performance web framework for building APIs with Python. - **Docker**: A platform for creating lightweight, portable containers that include all software dependencies. - **GitHub Actions**: A CI/CD tool to automate software workflows directly from your repository. - **Uvicorn**: An ASGI server implementation for Python, used to run the web application. - **Poetry**: A tool for dependency management and packaging in Python. Code Walkthrough: The Dockerfile A Docker file serves as the blueprint for your application environment. In this setup, we prioritize clean builds and real-time logging. ```python Use a stable Python base image FROM python:3.11.0 Prevent Python from buffering stdout/stderr ENV PYTHONUNBUFFERED=1 WORKDIR /app Install dependency manager and packages RUN pip install poetry COPY pyproject.toml poetry.lock ./ RUN poetry config virtualenvs.create false && poetry install --no-dev Copy source and expose the API COPY . . EXPOSE 8080 CMD ["uvicorn", "main:app", "--host", "0.0.0.0", "--port", "8080"] ``` Setting `PYTHONUNBUFFERED` to `1` ensures you can see logs in real-time. Disabling virtual environments inside the container is a best practice because the container itself acts as an isolated environment, removing the need for an extra layer of abstraction. Syntax Notes - **Port Mapping**: When running the container, we map the external server port to the internal container port (e.g., `-p 80:8080`). This redirects public traffic to our internal web server. - **Environment Variables**: Use `ENV` in Docker or GitHub Secrets for sensitive data like API keys and database credentials. Practical Examples This workflow is perfect for microservices. For instance, a weather API like Skypulse can separate its routing logic from CRUD operations. This modularity allows you to reuse the database logic for a command-line tool or a background worker without redeploying the entire web stack. Tips & Gotchas - **Standard Ports**: While we used `8080` for testing, production APIs should use Port `80` (HTTP) or `443` (HTTPS) to avoid blocking by client firewalls. - **SSH Security**: Always use SSH keys and secrets in GitHub Actions rather than hardcoding passwords in your YAML files. - **Database Costs**: Cloud providers often have hidden networking fees. Using a VPS from providers like Hostinger can offer more predictable monthly billing.
May 31, 2024
// ArjanCodes
Atomic Operations with Unit of Work Managing database consistency manually often leads to fragmented data and difficult debugging. The Unit of Work design pattern solves this by acting as a central collection point for multiple operations. Instead of sending every minor update to the database immediately, you gather them into a single logical transaction. This ensures that either every operation succeeds or the system rolls back to its original state, maintaining a reliable "all-or-nothing" integrity. Prerequisites & Tools To implement this pattern effectively, you should understand Python classes and basic database concepts like transactions and commits. While you can build this from scratch, most developers use SQLAlchemy, a powerful Python ORM. SQLAlchemy implements this pattern through its **Session** object, which tracks object changes and coordinates the final write-out. Code Walkthrough: Tracking State A simple implementation requires tracking which entities are new, modified ("dirty"), or slated for deletion. ```python class UnitOfWork: def __init__(self): self.new = [] self.dirty = [] self.removed = [] def register_new(self, entity): self.new.append(entity) def commit(self): # Logic to insert new, update dirty, and delete removed # If any step fails, trigger a rollback here pass ``` In this structure, calling `commit()` is the only way to persist data. This separation allows you to implement a rollback mechanism in the `except` block of your code, effectively undoing any partial changes that occurred before a failure. Syntax and Best Practices When using SQLAlchemy, utilize **Context Managers** (`with` statements) to handle your sessions. This ensures the session closes properly and provides a clean scope for your Unit of Work. If you need an ID from a newly created record before the final commit, use `session.flush()`. This pushes changes to the database buffer without ending the transaction, allowing the database to assign IDs while still permitting a full rollback if a later step fails. Beyond the Database This pattern isn't just for SQL. Think about **Infrastructure as Code**; if you provision a server but the database setup fails, you want to roll back the server creation so you aren't billed for unused resources. Similarly, file sync utilities like Dropbox use this logic to ensure a corrupt or partial file never replaces a healthy one during a network flicker.
May 10, 2024
// ArjanCodes
Overview of Modern Database Interaction SQLAlchemy transforms how developers interact with databases by acting as a sophisticated bridge between relational tables and Python's object-oriented nature. This tool provides an Object Relational Mapper (ORM) that allows you to manipulate database rows as if they were standard Python objects. By abstracting the complexities of raw SQL, it enables cleaner code, better maintainability, and the ability to switch between database backends like SQLite, MySQL, and PostgreSQL with minimal configuration changes. Prerequisites To follow this guide, you should possess a solid grasp of Python fundamentals, including classes and decorators. Familiarity with basic SQL concepts—such as primary keys, foreign keys, and JOIN operations—is necessary. You will also need a Python environment with the library installed via `pip install sqlalchemy`. Key Libraries & Tools * **SQLAlchemy Core**: The foundation providing the SQL Expression Language and database connectivity. * **SQLAlchemy ORM**: The high-level API that maps Python classes to database tables. * **SessionMaker**: A factory for producing `Session` objects to manage database transactions. * **Mapped & mapped_column**: Type-hinting utilities used in modern SQLAlchemy (2.0+) for defining schema. Code Walkthrough: The Object-Oriented Approach Modern database design favors the ORM approach for its readability. First, define your engine and base class: ```python from sqlalchemy import create_engine from sqlalchemy.orm import DeclarativeBase, Mapped, mapped_column, sessionmaker engine = create_engine("sqlite:///:memory:", echo=True) Session = sessionmaker(bind=engine) class Base(DeclarativeBase): pass class User(Base): __tablename__ = "users" id: Mapped[int] = mapped_column(primary_key=True) username: Mapped[str] = mapped_column(unique=True) email: Mapped[str] ``` In this snippet, `DeclarativeBase` serves as the registry for your schema. The `Mapped` type hints allow IDEs to provide better autocompletion. To interact with the data, use a session context manager: ```python with Session() as session: new_user = User(username="dev_expert", email="[email protected]") session.add(new_user) session.commit() ``` Complex Relationships and Data Logic SQLAlchemy shines when handling related data. You can define one-to-many relationships using the `relationship` function and `back_populates` to ensure bidirectional synchronization. For instance, a `User` class might have a `posts` attribute that automatically fetches all related entries from a `Post` table. By adding custom methods to these classes, you can encapsulate business logic—like password hashing or permission checks—directly within the data model. Tips & Gotchas Always utilize context managers for sessions to prevent hanging connections. If you encounter performance bottlenecks, enable `echo=True` in your engine configuration to audit the generated SQL. One common pitfall is forgetting to call `session.commit()`; without it, your changes exist only in memory and will vanish once the session closes.
Apr 5, 2024
// ArjanCodes
Building a FastAPI backend that works on your local machine is one thing; building one that survives the rigors of production is quite another. Most tutorials show you how to throw everything into a single `main.py` file and call it a day. That approach is a recipe for disaster. As your codebase grows, it becomes an unmaintainable knot of logic that is nearly impossible to test or extend. To move beyond the basics, you must adopt patterns that emphasize modularity, separation of concerns, and robust infrastructure. Organize Your Project with APIRouter When you start a project, it's easy to keep every endpoint in one file. It's convenient, until it's not. Once you have items, users, and automations all fighting for space in the same script, readability vanishes. The solution is the APIRouter mechanism. Think of it as a way to create mini-applications that you later stitch together into the main app. By grouping related endpoints into separate router files, you gain immediate clarity. For instance, an `items_router` can handle everything under the `/items` prefix, while an `automations_router` handles `/automations`. This isn't just about aesthetics; it allows you to apply different middleware or dependencies to specific groups of routes without polluting the rest of your system. In your main file, you simply use `app.include_router()` to register these modules. It keeps the entry point of your application clean and focused on high-level configuration. Decouple Business Logic from Routing A common mistake in FastAPI development is writing heavy database queries or complex logic directly inside the endpoint function. This creates a tight coupling between the transport layer (HTTP) and your business logic. If you ever want to trigger those same actions from a command-line interface or a background task, you're stuck copy-pasting code. Instead, move your operations into dedicated service functions. Your endpoint should be a thin wrapper that handles the request, calls a service function, and returns a response. I recommend using Pydantic models for data validation and SQLAlchemy for database interactions, but keep them at arm's length. Your service functions should ideally accept simple Python objects or IDs and return domain models. This makes unit testing significantly easier because you can test the logic in isolation without spinning up a full uvicorn server. Adopt Infrastructure as Code Early Deployment shouldn't be an afterthought. Many developers wait until the app is "finished" to think about how it will run in the cloud, only to realize their dependencies or environment variables are a mess. I prefer a "scaffolding first" approach. Before writing the core features, set up your Docker container and a basic CI/CD pipeline. Using an Infrastructure as Code (IaC) tool like Pulumi changes the game. It allows you to define your cloud resources using Python instead of static YAML files. This is incredibly powerful for dynamic workflows. For example, if you need to run untrusted automation code, you can use Pulumi to dynamically spin up a serverless sandbox, execute the code, and destroy the environment immediately after. This level of automation ensures your production environment is reproducible and keeps your backend secure from side-effect-heavy tasks. Secure Your Endpoints with Rate Limiting Production apis face threats that local dev environments don't: abuse and brute-force attacks. While OAuth2 and API keys are vital for authentication, they don't stop a legitimate user from accidentally (or intentionally) flooding your server with thousands of requests. FastAPI doesn't include built-in rate limiting, so you need a tool like slowapi. Implementing a rate limiter allows you to set specific thresholds, such as one request per second for sensitive endpoints like item creation, while allowing higher limits for read-only operations. It typically identifies users by their IP address using a `get_remote_address` helper. Integrating this early prevents your database from being overwhelmed and ensures that your service remains available for everyone, even when one client goes rogue. Building for production is about anticipating growth and protecting your resources. By splitting your routes, isolating your logic, automating your infrastructure, and limiting access, you move from a prototype to a professional-grade backend.
Jan 12, 2024
// ArjanCodes
Overview Testing simple functions is straightforward, but API testing that involves a database often leaves developers frustrated. When your application relies on a persistent storage layer, you cannot simply run tests against your production data without risking corruption or inconsistencies. This tutorial explores how to implement a robust testing strategy for FastAPI applications. We focus on decoupling your database logic from your endpoints using dependency injection, allowing you to swap a real database for a lightning-fast, in-memory SQLite instance during test execution. Prerequisites To get the most out of this guide, you should be comfortable with Python basics and the REST architectural style. Familiarity with FastAPI and SQLAlchemy is recommended. You will also need pytest installed in your environment to run the test suite. Key Libraries & Tools - **FastAPI**: A modern web framework for building APIs with Python based on standard type hints. - **SQLAlchemy**: The Python SQL toolkit and Object Relational Mapper (ORM) that provides a full suite of enterprise-level persistence patterns. - **Pydantic**: Data validation and settings management using Python type annotations. - **pytest**: A mature full-featured Python testing tool that helps you write better programs. - **SQLite**: A C-language library that implements a small, fast, self-contained, high-reliability, full-featured, SQL database engine. Decoupling the Database with Dependency Injection The biggest hurdle in testing is often "hard-coded" database sessions within endpoints. If your endpoint creates its own session, you cannot easily point it to a test database. We solve this by using FastAPI's dependency injection system. ```python def get_db(): db = SessionLocal() try: yield db finally: db.close() @app.post("/items/") def create_item(item: ItemCreate, db: Session = Depends(get_db)): db_item = DBItem(**item.dict()) db.add(db_item) db.commit() return db_item ``` By passing `db` as a dependency via `Depends(get_db)`, the endpoint no longer cares where the session comes from. This architectural shift is the "secret sauce" that makes the application testable. Setting Up the Test Environment With dependency injection in place, we can now create an in-memory SQLite database specifically for our tests. This ensures tests are isolated and run quickly without leaving behind file-based artifacts. ```python SQLALCHEMY_DATABASE_URL = "sqlite:///:memory:" engine = create_engine(SQLALCHEMY_DATABASE_URL, connect_args={"check_same_thread": false}, poolclass=StaticPool) TestingSessionLocal = sessionmaker(autocommit=False, autoflush=False, bind=engine) def override_get_db(): db = TestingSessionLocal() try: yield db finally: db.close() app.dependency_overrides[get_db] = override_get_db ``` Using `StaticPool` is critical for in-memory SQLite because it ensures all connections share the same underlying memory space. Without it, one part of your test might write data that another part cannot see. Syntax Notes - **Yield Generators**: The `get_db` function uses `yield`. This allows FastAPI to execute the code before the yield to provide the dependency, then finish the code after the yield (like closing the session) once the response is sent. - **Dependency Overrides**: The `app.dependency_overrides` dictionary is a powerful FastAPI feature that allows you to swap out any dependency during testing without touching the original application code. Practical Examples Testing a POST request involves using the `TestClient` to simulate a real user interaction. We assert not just the status code, but also the structure of the returned JSON to ensure the Pydantic models are working correctly. ```python def test_create_item(): response = client.post("/items/", json={"name": "Test Item", "description": "A test"}) assert response.status_code == 200 data = response.json() assert data["name"] == "Test Item" assert "id" in data ``` Tips & Gotchas - **Setup and Tear Down**: Use pytest fixtures to create and drop tables before and after tests. This ensures every test starts with a clean slate. - **Separation of Concerns**: Don't mix database logic with route logic. Move database operations into a separate `crud.py` or `operations.py` file. This allows you to unit test the database logic independently of the API routes. - **Static Connection Pools**: If you use SQLite in-memory, always set `poolclass=StaticPool` to avoid "table not found" errors during concurrent test execution.
Oct 27, 2023
// ArjanCodes
The Promise of Autonomous Scaffolding Coding assistants have evolved from simple autocomplete tools to complex agents like GPT Engineer. This tool doesn't just suggest a line of code; it attempts to build entire applications from a single prompt. By asking clarifying questions before touching the keyboard, it simulates a discovery phase usually reserved for human developers. It aims to bridge the gap between high-level intent and a structured file system. While some dismiss it as a novelty, the practical reality lies in how it handles the friction of starting from zero. Model Showdown: GPT-4 vs GPT-3.5 The core of the experience depends heavily on the underlying model. Testing both reveals a stark contrast in architectural quality. GPT-4 tends to produce more sophisticated designs, utilizing dictionaries and cleaner abstractions to handle logic. Conversely, GPT-3.5 often falls back on brittle "if-else" chains and heavy code duplication. While 3.5 offers superior speed, the technical debt it generates makes it a risky choice for anything beyond the simplest scripts. Even with the advanced logic of 4.0, the generated code often requires manual intervention to fix outdated dependencies or missing imports. The Boilerplate Sweet Spot Where GPT Engineer truly shines is in generating boilerplate for modern frameworks like FastAPI or Flask. Instead of manually setting up SQLAlchemy models and CRUD endpoints, you can prompt the tool to scaffold the entire structure in minutes. This effectively replaces the "copy-paste from documentation" phase of development. It even assists in discovery; for instance, it might introduce you to libraries like pytube for specific tasks you haven't tackled before. It acts as a highly efficient template engine that understands context. The Reality of Maintenance and Integration The tool's biggest hurdle remains its inability to work within an existing codebase. As it stands, GPT Engineer is largely a greenfield specialist. It lacks the "edit" or "iterate" loop necessary for long-term project maintenance. Developers must still possess the skills to review, refactor, and integrate these AI-generated snippets into larger, complex systems. Without a solid understanding of software design, a user might end up with a functional application that is impossible to maintain. Final Verdict GPT Engineer is not a replacement for a software engineer, but it is a powerful accelerator for the initial stages of development. It excels at turning a concept into a working prototype and handling the tedious setup of API structures. If you need to spin up a microservice or explore a new library, it's an excellent companion. Just keep your refactoring tools sharp; you'll need them once the AI finishes its first draft.
Sep 22, 2023
// ArjanCodes
The Burden of Database Connectivity Software developers face a critical fork in the road when starting a new project: how to communicate with the database. This decision isn't just about syntax; it impacts long-term maintainability, security, and developer velocity. Picking the wrong abstraction layer creates technical debt that is notoriously difficult to migrate later. We must evaluate three distinct philosophies: the direct control of Raw SQL, the structured safety of SQL Query Builders, and the heavy abstraction of Object-Relational Mapping. The Raw Power of Direct Queries Writing Raw SQL remains the gold standard for performance and flexibility. By bypassing abstraction layers, developers access the full breadth of a database's native features. This approach forces a deep understanding of the relational model, which is a vital skill for any engineer. However, the risks are significant. Manual string concatenation often leads to SQL Injection vulnerabilities. Even with parameterized queries using placeholders like "?" or named dictionaries, managing hundreds of individual `.sql` files becomes a logistical nightmare as the application grows. Object-Relational Mapping: The Abstraction Heavyweight An ORM, such as SQLAlchemy for Python, creates a virtual object database on top of the relational one. The primary benefit is the tight integration with IDE features; since tables are defined as classes, you get robust autocomplete and type checking. This prevents many runtime errors by catching invalid field names during the coding phase. The downside is the "leaky abstraction." When an ORM generates inefficient queries or fails to support complex window functions, developers often find themselves fighting the tool rather than using it. Furthermore, frequent schema changes in data-heavy projects require constant, manual updates to the class-based models. The Middle Path: Query Builders SQL Query Builders like PyPika or Knex.js offer a compromise. They provide a programmatic way to construct queries using method chaining (e.g., `.select().from().where()`) without forcing a rigid object-oriented schema. This retains the flexibility of Raw SQL while shielding the application from injection attacks. While they lack the deep type-safety of an ORM, they excel in scenarios where queries must be dynamically generated based on complex user input. The Final Verdict Choosing the right tool depends on your project's volatility. For rapid prototyping or standard CRUD applications, the ORM provides the best developer experience through its type-aware schema. For high-performance systems or data science experiments where schemas shift daily, the flexibility of Raw SQL or a Query Builder is superior. While SQLAlchemy proves surprisingly versatile, never underestimate the power of knowing exactly what query hits your database.
Mar 31, 2023