Database Design 101: A Comprehensive Guide

In the rapidly evolving landscape of web development, database design has established itself as a cornerstone technology for developers in 2025. Whether you are building small personal projects or large-scale enterprise applications, understanding the nuances of database architecture is essential for writing clean, efficient, and maintainable code.

This comprehensive guide will take you from basic concepts to advanced techniques, with real-world examples and code snippets you can apply immediately.

Why Database Design Matters in 2025

The Foundation of Modern Applications

Every web application relies on data. Whether you're building a simple blog or a complex e-commerce platform, how you structure and manage your data determines:

Performance: Well-designed databases execute queries in milliseconds, not seconds
Scalability: Proper architecture allows your application to grow without major refactoring
Maintainability: Clear data relationships make updates and bug fixes easier
Data Integrity: Proper constraints ensure your data remains accurate and consistent

Real-World Impact

Consider a simple e-commerce site:

Poor Design:

CREATE TABLE orders (
    id INT PRIMARY KEY,
    customer_name VARCHAR(255),
    customer_email VARCHAR(255),
    customer_address TEXT,
    product_name VARCHAR(255),
    product_price DECIMAL(10,2),
    quantity INT,
    order_date DATETIME
);

Problems:

Customer information repeated for every order
Can't have multiple products per order
Customer address updates require modifying multiple rows
Difficult to query customer order history efficiently

Good Design:

CREATE TABLE customers (
    id INT PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    email VARCHAR(255) UNIQUE NOT NULL,
    address TEXT,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

CREATE TABLE products (
    id INT PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    price DECIMAL(10,2) NOT NULL,
    stock INT DEFAULT 0,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

CREATE TABLE orders (
    id INT PRIMARY KEY,
    customer_id INT NOT NULL,
    order_date TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    total DECIMAL(10,2),
    FOREIGN KEY (customer_id) REFERENCES customers(id)
);

CREATE TABLE order_items (
    id INT PRIMARY KEY,
    order_id INT NOT NULL,
    product_id INT NOT NULL,
    quantity INT NOT NULL,
    unit_price DECIMAL(10,2) NOT NULL,
    FOREIGN KEY (order_id) REFERENCES orders(id),
    FOREIGN KEY (product_id) REFERENCES products(id)
);

Benefits:

Customer information stored once
Each order can have multiple products
Single query can get customer order history
Easy to update customer information

"The only way to go fast, is to go well." — Robert C. Martin

Core Concepts and Architecture

1. Relational vs. Non-Relational Databases

Understanding when to use each type is crucial:

Relational Databases (SQL)

Best for:

Structured data with clear relationships
Transactions with ACID compliance
Complex queries with joins
Financial data, user management, inventory systems

Popular Options: PostgreSQL, MySQL, SQLite, Microsoft SQL Server

Example Schema:

-- E-commerce relational schema
CREATE TABLE categories (
    id SERIAL PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    description TEXT
);

CREATE TABLE products (
    id SERIAL PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    description TEXT,
    price DECIMAL(10,2) NOT NULL,
    category_id INT REFERENCES categories(id),
    stock INT DEFAULT 0,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Complex query with joins
SELECT
    p.name,
    c.name as category,
    COUNT(oi.id) as total_sold
FROM products p
LEFT JOIN categories c ON p.category_id = c.id
LEFT JOIN order_items oi ON p.id = oi.product_id
GROUP BY p.id, c.id
ORDER BY total_sold DESC
LIMIT 10;

Non-Relational Databases (NoSQL)

Best for:

Flexible, unstructured data
Rapid schema changes
High write throughput
Content management, real-time analytics, IoT data

Popular Options: MongoDB, CouchDB, Amazon DynamoDB, Firebase Firestore

Example Schema:

// MongoDB document example
{
  _id: ObjectId("507f1f77bcf86cd799439011"),
  name: "Wireless Headphones",
  description: "Premium noise-cancelling headphones",
  price: 199.99,
  stock: 150,
  tags: ["audio", "wireless", "noise-cancelling"],
  specs: {
    battery_life: "30 hours",
    connectivity: ["Bluetooth 5.0", "3.5mm jack"],
    weight: "250g"
  },
  reviews: [
    {
      user_id: ObjectId("507f1f77bcf86cd799439012"),
      rating: 5,
      comment: "Excellent sound quality",
      date: ISODate("2025-01-15")
    }
  ],
  created_at: ISODate("2025-01-01T00:00:00Z")
}

// Flexible query
db.products.find({
  "specs.battery_life": { $gte: "20 hours" },
  "reviews.rating": { $gte: 4 }
}).sort({ price: 1 });

2. Normalization: Organizing Your Data

Normalization reduces data redundancy and improves data integrity.

First Normal Form (1NF)

Rule: Each column contains atomic values, and each record is unique.

Bad:

CREATE TABLE bad_products (
    id INT PRIMARY KEY,
    name VARCHAR(255),
    tags VARCHAR(255)  -- "audio,wireless,bluetooth"
);

Good:

CREATE TABLE good_products (
    id INT PRIMARY KEY,
    name VARCHAR(255) NOT NULL
);

CREATE TABLE product_tags (
    id INT PRIMARY KEY,
    product_id INT REFERENCES good_products(id),
    tag VARCHAR(50) NOT NULL,
    UNIQUE(product_id, tag)
);

Second Normal Form (2NF)

Rule: All non-key columns depend on the entire primary key.

Bad:

CREATE TABLE bad_order_items (
    order_id INT,
    product_id INT,
    product_name VARCHAR(255),  -- Depends only on product_id
    quantity INT,
    PRIMARY KEY (order_id, product_id)
);

Good:

CREATE TABLE good_order_items (
    order_id INT,
    product_id INT,
    quantity INT,
    PRIMARY KEY (order_id, product_id),
    FOREIGN KEY (product_id) REFERENCES products(id)
);
-- product_name is now only in the products table

Third Normal Form (3NF)

Rule: No transitive dependencies (non-key columns don't depend on other non-key columns).

Bad:

CREATE TABLE bad_customers (
    id INT PRIMARY KEY,
    name VARCHAR(255),
    city VARCHAR(255),
    state VARCHAR(255),  -- Depends on city
    zip_code VARCHAR(10)
);

Good:

CREATE TABLE good_customers (
    id INT PRIMARY KEY,
    name VARCHAR(255),
    location_id INT REFERENCES locations(id)
);

CREATE TABLE locations (
    id INT PRIMARY KEY,
    city VARCHAR(255),
    state VARCHAR(255),
    zip_code VARCHAR(10) UNIQUE
);

3. Indexing: Speed Up Your Queries

Indexes dramatically improve query performance at the cost of write speed and storage.

When to Create Indexes

-- Index on frequently searched columns
CREATE INDEX idx_products_name ON products(name);

-- Index on foreign keys (automatic joins improvement)
CREATE INDEX idx_order_items_product_id ON order_items(product_id);

-- Composite index for multiple column queries
CREATE INDEX idx_orders_customer_date ON orders(customer_id, order_date);

-- Unique index for email lookups
CREATE UNIQUE INDEX idx_customers_email ON customers(email);

When NOT to Create Indexes

Small tables: Full table scan is faster
Frequently updated columns: Index maintenance overhead
Low selectivity columns: Columns with few unique values (e.g., boolean flags)

Analyzing Query Performance

-- PostgreSQL query analysis
EXPLAIN ANALYZE
SELECT p.name, c.name as category
FROM products p
LEFT JOIN categories c ON p.category_id = c.id
WHERE p.price > 100;

-- Example output:
-- Index Scan using idx_products_price on products p  (cost=0.28..8.29 rows=10 width=64)
--   Index Cond: (price > 100.00)
-- Execution Time: 0.123 ms

Practical Implementation

1. Designing a Social Media Database

Let's design a database for a social media platform with posts, comments, likes, and user relationships.

Schema Design

-- Users table
CREATE TABLE users (
    id SERIAL PRIMARY KEY,
    username VARCHAR(50) UNIQUE NOT NULL,
    email VARCHAR(255) UNIQUE NOT NULL,
    password_hash VARCHAR(255) NOT NULL,
    full_name VARCHAR(255),
    bio TEXT,
    profile_image_url VARCHAR(500),
    followers_count INT DEFAULT 0,
    following_count INT DEFAULT 0,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Posts table
CREATE TABLE posts (
    id SERIAL PRIMARY KEY,
    user_id INT NOT NULL REFERENCES users(id),
    content TEXT NOT NULL,
    image_url VARCHAR(500),
    likes_count INT DEFAULT 0,
    comments_count INT DEFAULT 0,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Comments table
CREATE TABLE comments (
    id SERIAL PRIMARY KEY,
    post_id INT NOT NULL REFERENCES posts(id) ON DELETE CASCADE,
    user_id INT NOT NULL REFERENCES users(id),
    content TEXT NOT NULL,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Likes table (many-to-many relationship)
CREATE TABLE likes (
    id SERIAL PRIMARY KEY,
    user_id INT NOT NULL REFERENCES users(id) ON DELETE CASCADE,
    post_id INT REFERENCES posts(id) ON DELETE CASCADE,
    comment_id INT REFERENCES comments(id) ON DELETE CASCADE,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    UNIQUE(user_id, COALESCE(post_id, 0), COALESCE(comment_id, 0))
);

-- Follows table (self-referencing many-to-many)
CREATE TABLE follows (
    id SERIAL PRIMARY KEY,
    follower_id INT NOT NULL REFERENCES users(id) ON DELETE CASCADE,
    following_id INT NOT NULL REFERENCES users(id) ON DELETE CASCADE,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    UNIQUE(follower_id, following_id),
    CHECK (follower_id != following_id)  -- Can't follow yourself
);

Performance Optimizations

-- Indexes for common queries
CREATE INDEX idx_posts_user_id ON posts(user_id);
CREATE INDEX idx_posts_created_at ON posts(created_at DESC);
CREATE INDEX idx_comments_post_id ON comments(post_id);
CREATE INDEX idx_likes_post_id ON likes(post_id);
CREATE INDEX idx_likes_user_id ON likes(user_id);
CREATE INDEX idx_follows_follower_id ON follows(follower_id);
CREATE INDEX idx_follows_following_id ON follows(following_id);

-- Full-text search index for post content
CREATE INDEX idx_posts_content_fts ON posts USING gin(to_tsvector('english', content));

-- Trigger to update counts automatically
CREATE OR REPLACE FUNCTION update_post_counts()
RETURNS TRIGGER AS $$
BEGIN
    IF TG_OP = 'INSERT' THEN
        IF NEW.post_id IS NOT NULL THEN
            UPDATE posts SET comments_count = comments_count + 1 WHERE id = NEW.post_id;
        END IF;
    ELSIF TG_OP = 'DELETE' THEN
        IF OLD.post_id IS NOT NULL THEN
            UPDATE posts SET comments_count = comments_count - 1 WHERE id = OLD.post_id;
        END IF;
    END IF;
    RETURN NULL;
END;
$$ LANGUAGE plpgsql;

CREATE TRIGGER trigger_update_post_counts
AFTER INSERT OR DELETE ON comments
FOR EACH ROW EXECUTE FUNCTION update_post_counts();

Common Queries

-- Get user's feed (posts from people they follow)
SELECT
    p.id,
    p.content,
    p.image_url,
    p.likes_count,
    p.comments_count,
    p.created_at,
    u.username,
    u.profile_image_url
FROM posts p
JOIN users u ON p.user_id = u.id
WHERE p.user_id IN (
    SELECT following_id FROM follows WHERE follower_id = $1
) OR p.user_id = $1
ORDER BY p.created_at DESC
LIMIT 20;

-- Search posts by content
SELECT
    p.*,
    ts_rank_cd(to_tsvector('english', content), query) as rank
FROM posts p,
     to_tsquery('english', $1) query
WHERE to_tsvector('english', content) @@ query
ORDER BY rank DESC
LIMIT 20;

-- Get post with comments and likes
SELECT
    p.*,
    json_agg(
        json_build_object(
            'id', c.id,
            'content', c.content,
            'user', json_build_object('id', u.id, 'username', u.username),
            'created_at', c.created_at
        )
    ) as comments
FROM posts p
LEFT JOIN comments c ON p.id = c.post_id
LEFT JOIN users u ON c.user_id = u.id
WHERE p.id = $1
GROUP BY p.id;

2. Implementing with Node.js and PostgreSQL

// Database connection setup (using node-postgres)
const { Pool } = require('pg');

const pool = new Pool({
  user: 'your_username',
  host: 'localhost',
  database: 'social_media',
  password: 'your_password',
  port: 5432,
  max: 20, // Maximum pool size
  idleTimeoutMillis: 30000,
  connectionTimeoutMillis: 2000,
});

// Create post
async function createPost(userId, content, imageUrl = null) {
  const client = await pool.connect();

  try {
    await client.query('BEGIN');

    const result = await client.query(
      `INSERT INTO posts (user_id, content, image_url)
       VALUES ($1, $2, $3)
       RETURNING *`,
      [userId, content, imageUrl]
    );

    await client.query('COMMIT');
    return result.rows[0];
  } catch (error) {
    await client.query('ROLLBACK');
    throw error;
  } finally {
    client.release();
  }
}

// Get user feed with pagination
async function getUserFeed(userId, page = 1, limit = 20) {
  const offset = (page - 1) * limit;

  const query = `
    SELECT
      p.*,
      u.username,
      u.profile_image_url,
      CASE WHEN l.user_id = $1 THEN true ELSE false END as liked
    FROM posts p
    JOIN users u ON p.user_id = u.id
    LEFT JOIN likes l ON l.post_id = p.id AND l.user_id = $1
    WHERE p.user_id IN (
      SELECT following_id FROM follows WHERE follower_id = $1
    ) OR p.user_id = $1
    ORDER BY p.created_at DESC
    LIMIT $2 OFFSET $3
  `;

  const result = await pool.query(query, [userId, limit, offset]);
  return result.rows;
}

// Search posts with full-text search
async function searchPosts(query, userId, page = 1, limit = 20) {
  const offset = (page - 1) * limit;

  const searchQuery = `
    SELECT
      p.*,
      u.username,
      u.profile_image_url,
      ts_rank_cd(to_tsvector('english', p.content), query) as rank
    FROM posts p
    JOIN users u ON p.user_id = u.id,
         to_tsquery('english', $2) query
    WHERE to_tsvector('english', p.content) @@ query
    ORDER BY rank DESC
    LIMIT $1 OFFSET $3
  `;

  const result = await pool.query(searchQuery, [limit, query, offset]);
  return result.rows;
}

// Get post with all comments
async function getPostWithComments(postId, userId) {
  const client = await pool.connect();

  try {
    // Get post details
    const postResult = await client.query(
      `SELECT
         p.*,
         u.username,
         u.profile_image_url,
         CASE WHEN l.user_id = $2 THEN true ELSE false END as liked
       FROM posts p
       JOIN users u ON p.user_id = u.id
       LEFT JOIN likes l ON l.post_id = p.id AND l.user_id = $2
       WHERE p.id = $1`,
      [postId, userId]
    );

    if (postResult.rows.length === 0) {
      return null;
    }

    const post = postResult.rows[0];

    // Get comments with pagination
    const commentsResult = await client.query(
      `SELECT
         c.*,
         u.username,
         u.profile_image_url
       FROM comments c
       JOIN users u ON c.user_id = u.id
       WHERE c.post_id = $1
       ORDER BY c.created_at ASC`,
      [postId]
    );

    post.comments = commentsResult.rows;
    return post;
  } finally {
    client.release();
  }
}

// Follow user
async function followUser(followerId, followingId) {
  const client = await pool.connect();

  try {
    await client.query('BEGIN');

    // Create follow relationship
    await client.query(
      `INSERT INTO follows (follower_id, following_id)
       VALUES ($1, $2)
       ON CONFLICT (follower_id, following_id) DO NOTHING`,
      [followerId, followingId]
    );

    // Update follower/following counts
    await client.query(
      `UPDATE users SET following_count = following_count + 1 WHERE id = $1`,
      [followerId]
    );
    await client.query(
      `UPDATE users SET followers_count = followers_count + 1 WHERE id = $2`,
      [followingId]
    );

    await client.query('COMMIT');
    return { success: true };
  } catch (error) {
    await client.query('ROLLBACK');
    throw error;
  } finally {
    client.release();
  }
}

Common Pitfalls and Best Practices

1. Pitfalls to Avoid

Over-Normalization

-- Too many tables can hurt performance
CREATE TABLE phone_number_types (
    id INT PRIMARY KEY,
    type VARCHAR(50)
);

CREATE TABLE user_phone_numbers (
    id INT PRIMARY KEY,
    user_id INT REFERENCES users(id),
    phone_type_id INT REFERENCES phone_number_types(id),
    phone_number VARCHAR(20)
);

-- Simpler approach for simple cases
CREATE TABLE user_phone_numbers (
    id INT PRIMARY KEY,
    user_id INT REFERENCES users(id),
    phone_type VARCHAR(50),  -- Just store the type as a string
    phone_number VARCHAR(20)
);

N+1 Query Problem

// BAD: N+1 queries
async function getAllPostsWithUsers() {
  const posts = await pool.query('SELECT * FROM posts');

  for (const post of posts.rows) {
    const user = await pool.query('SELECT * FROM users WHERE id = $1', [post.user_id]);
    post.user = user.rows[0];
  }

  return posts.rows;
}

// GOOD: Single query with JOIN
async function getAllPostsWithUsers() {
  const result = await pool.query(
    `SELECT p.*, u.username, u.profile_image_url
     FROM posts p
     JOIN users u ON p.user_id = u.id`
  );

  return result.rows;
}

Missing Constraints

-- BAD: No validation
CREATE TABLE orders (
    id INT PRIMARY KEY,
    user_id INT,
    total DECIMAL(10,2),
    status VARCHAR(50)
);

-- GOOD: Proper constraints
CREATE TABLE orders (
    id INT PRIMARY KEY,
    user_id INT NOT NULL REFERENCES users(id),
    total DECIMAL(10,2) NOT NULL CHECK (total >= 0),
    status VARCHAR(50) NOT NULL CHECK (status IN ('pending', 'processing', 'shipped', 'delivered', 'cancelled')),
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

2. Best Practices

Use Transactions for Data Integrity

async function transferMoney(fromUserId, toUserId, amount) {
  const client = await pool.connect();

  try {
    await client.query('BEGIN');

    // Check balance
    const balanceResult = await client.query(
      'SELECT balance FROM accounts WHERE user_id = $1 FOR UPDATE',
      [fromUserId]
    );

    if (balanceResult.rows[0].balance < amount) {
      throw new Error('Insufficient funds');
    }

    // Deduct from sender
    await client.query(
      'UPDATE accounts SET balance = balance - $1 WHERE user_id = $2',
      [amount, fromUserId]
    );

    // Add to receiver
    await client.query(
      'UPDATE accounts SET balance = balance + $1 WHERE user_id = $2',
      [amount, toUserId]
    );

    // Record transaction
    await client.query(
      `INSERT INTO transactions (from_user_id, to_user_id, amount)
       VALUES ($1, $2, $3)`,
      [fromUserId, toUserId, amount]
    );

    await client.query('COMMIT');
    return { success: true };
  } catch (error) {
    await client.query('ROLLBACK');
    throw error;
  } finally {
    client.release();
  }
}

Use Soft Deletes for Audit Trail

CREATE TABLE users (
    id INT PRIMARY KEY,
    username VARCHAR(50) UNIQUE NOT NULL,
    email VARCHAR(255) UNIQUE NOT NULL,
    deleted_at TIMESTAMP NULL,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Instead of DELETE, use soft delete
UPDATE users SET deleted_at = NOW() WHERE id = $1;

-- Query for non-deleted users
SELECT * FROM users WHERE deleted_at IS NULL;

Implement Proper Backup Strategy

# PostgreSQL backup
pg_dump -U username -d database_name -f backup_$(date +%Y%m%d).sql

# Automated backup script
#!/bin/bash
BACKUP_DIR="/backups"
DB_NAME="your_database"
DATE=$(date +%Y%m%d_%H%M%S)

# Create backup
pg_dump -U $DB_USER -d $DB_NAME -f $BACKUP_DIR/backup_$DATE.sql

# Keep last 7 days only
find $BACKUP_DIR -name "backup_*.sql" -mtime +7 -delete

Database Scaling Strategies

1. Vertical Scaling

When to use:

Small to medium applications
Simple data models
Budget constraints

Benefits:

Simple implementation
No application changes needed
Single source of truth

2. Horizontal Scaling (Read Replicas)

-- Master database handles writes
-- Replica databases handle reads

-- Application code example
const writePool = new Pool({ host: 'master-db.example.com' });
const readPool = new Pool({ host: 'replica-db.example.com' });

async function createPost(content) {
  // Write to master
  return writePool.query('INSERT INTO posts (content) VALUES ($1)', [content]);
}

async function getPosts() {
  // Read from replica
  return readPool.query('SELECT * FROM posts');
}

3. Sharding

When to use:

Very large datasets (TB+)
High write throughput
Geographic distribution

Example:

// Hash-based sharding
function getShardId(userId, shardCount) {
  return userId % shardCount;
}

function getShardPool(userId) {
  const shardId = getShardId(userId, 4);
  return shardPools[shardId];
}

async function getUser(userId) {
  const pool = getShardPool(userId);
  return pool.query('SELECT * FROM users WHERE id = $1', [userId]);
}

Security Best Practices

1. Parameterized Queries

// BAD: SQL injection vulnerability
async function getUser(username) {
  const query = `SELECT * FROM users WHERE username = '${username}'`;
  return pool.query(query);  // If username is "admin' OR '1'='1", this returns all users
}

// GOOD: Parameterized query
async function getUser(username) {
  return pool.query('SELECT * FROM users WHERE username = $1', [username]);
}

2. Least Privilege Principle

-- Create application user with limited permissions
CREATE USER app_user WITH PASSWORD 'secure_password';

-- Grant only necessary permissions
GRANT SELECT, INSERT, UPDATE ON ALL TABLES IN SCHEMA public TO app_user;
GRANT USAGE, SELECT ON ALL SEQUENCES IN SCHEMA public TO app_user;

3. Encrypt Sensitive Data

-- Use pgcrypto for encryption
CREATE EXTENSION IF NOT EXISTS pgcrypto;

-- Store encrypted credit card numbers
INSERT INTO payments (
  user_id,
  encrypted_card_number,
  card_last_four
) VALUES (
  1,
  pgp_sym_encrypt('4111111111111111', 'encryption_key'),
  '1111'
);

-- Decrypt when needed
SELECT pgp_sym_decrypt(encrypted_card_number::bytea, 'encryption_key')
FROM payments
WHERE user_id = 1;

Monitoring and Performance

1. Slow Query Logging

-- Enable slow query logging
ALTER SYSTEM SET log_min_duration_statement = 1000;  -- Log queries > 1 second
SELECT pg_reload_conf();

2. Connection Pooling

const pool = new Pool({
  max: 20,              // Maximum number of clients in the pool
  min: 2,               // Minimum number of clients in the pool
  idleTimeoutMillis: 30000,
  connectionTimeoutMillis: 2000,
});

3. Database Health Checks

async function healthCheck() {
  try {
    const result = await pool.query('SELECT 1 as health');
    return {
      status: 'healthy',
      database: 'connected',
      timestamp: new Date().toISOString()
    };
  } catch (error) {
    return {
      status: 'unhealthy',
      database: 'disconnected',
      error: error.message,
      timestamp: new Date().toISOString()
    };
  }
}

Frequently Asked Questions (FAQ)

Q: Should I use SQL or NoSQL for my project?

A: Start with SQL if:

Your data is structured and has clear relationships
You need ACID transactions
You're building a traditional web application

Choose NoSQL if:

Your data is unstructured or schema-flexible
You need rapid schema changes
You're handling massive write throughput

Q: When should I denormalize my database?

A: Denormalize when:

Read performance is critical
You have heavy reporting requirements
The cost of joins is too high

Always balance this against data integrity and maintenance complexity.

Q: How do I choose between PostgreSQL and MySQL?

A: Both are excellent choices. PostgreSQL offers:

More advanced features (JSON, full-text search, GIS)
Better standard compliance
More extensibility

MySQL offers:

Simpler learning curve
Better performance for simple queries
Wider adoption in web hosting

Q: How do I handle database migrations?

A: Use migration tools like:

PostgreSQL: Flyway, Alembic, Knex.js
MySQL: Liquibase, dbmate
MongoDB: Mongoose migrations

Always version control your migrations and test them before deploying to production.

Conclusion

Mastering database design is more than just learning SQL syntax; it's about understanding data relationships, performance trade-offs, and scalability considerations.

Key takeaways:

Normalize first, denormalize later: Start with 3NF, then optimize based on actual performance needs
Index strategically: Create indexes based on query patterns, not on every column
Use constraints: Validate data at the database level
Monitor performance: Regularly analyze slow queries and optimize accordingly
Plan for growth: Design your schema with scalability in mind

Remember: Good database design is the foundation of performant, maintainable applications. Take the time to design it right, and your future self will thank you.

Happy coding!