Aon infotech

Posted on Jun 17

Image Optimization for AI-Generated Content — WebP, LCP, and Layout Shift

#nextjs #performance #ai #webdev

AI-generated images create a specific performance problem most optimization guides don't cover: you don't know the image dimensions, format, or content until generation completes — but you still need to avoid layout shift and slow loading.

Here's how I approached it building free AI image generator no sign up unlimited.

The Core Problem

Standard image optimization assumes you know the image ahead of time — you can pre-calculate dimensions, generate responsive srcsets, and optimize at build time.

Generated images break this assumption. The image doesn't exist until the user requests it. You can't pre-optimize what doesn't exist yet.

Format Selection — Why WebP Wins

Raw inference output is typically PNG. For delivery, that's the wrong choice.

PNG: lossless, larger files, ~800KB-1.2MB typical
WebP: ~25-35% smaller than PNG at equivalent quality
AVIF: smaller still, but inconsistent browser support

The conversion step matters:

// Conceptual flow — actual implementation varies by stack
async function optimizeForDelivery(rawImageBuffer) {
  const optimized = await convertToWebP(rawImageBuffer, {
    quality: 85, // Sweet spot for generated content
  });

  return optimized;
}

Why quality 85, not 100: AI-generated images already have inherent texture noise from the generation process. Higher compression quality settings show diminishing returns — the difference between 85 and 100 is rarely visible but adds significant file size.

The Aspect Ratio Pre-Sizing Problem

Users select aspect ratios (1:1, 16:9, 9:16, 4:3) before generating. This is actually an optimization advantage — you know the shape before the image exists.

// Pre-size the container based on selected ratio
const aspectRatioClasses = {
  '1:1': 'aspect-square',
  '16:9': 'aspect-video', 
  '9:16': 'aspect-[9/16]',
  '4:3': 'aspect-[4/3]',
};

function ImageContainer({ ratio, children }) {
  return (
    <div className={`w-full ${aspectRatioClasses[ratio]} 
      rounded-2xl overflow-hidden bg-neutral-100`}>
      {children}
    </div>
  );
}

This single decision eliminates cumulative layout shift (CLS) — the container exists at the correct dimensions before the image arrives.

LCP — The Generated Image IS the LCP Element

For a generation tool, Largest Contentful Paint is almost always the output image. This makes LCP optimization unusually important compared to typical content sites.

Three changes that mattered most:

1. Priority loading hint

<Image
  src={generatedImageUrl}
  alt={altText}
  width={1024}
  height={1024}
  priority // Tells the framework this is critical
/>

2. Preconnect to image delivery domain

<link rel="preconnect" href="https://your-cdn-domain.com" />

This shaves connection setup time off the critical path — DNS lookup, TLS handshake happen before the image request fires.

3. Skeleton matching final dimensions

{isGenerating && (
  <div className="w-full aspect-square bg-neutral-100 
    dark:bg-neutral-800 animate-pulse rounded-2xl" />
)}

The skeleton occupies the exact space the final image will fill — zero shift when it arrives.

Caching Strategy for Generated Content

Generated images present an interesting caching question: many users generate similar or identical prompts. Should you cache?

Arguments for caching:

Identical prompts (rare but happens) return instantly
Reduces redundant inference costs Arguments against:
Generation should feel "live" — caching undermines the perception of fresh creation
Storage costs accumulate
Privacy consideration — should identical prompts from different users return the same cached image? The approach I landed on: Cache at the CDN layer for delivery speed (repeated requests for the same generated image load fast), but don't cache at the inference layer (every generation request produces a fresh image, even for identical prompts).

// CDN-level caching headers
{
  'Cache-Control': 'public, max-age=31536000, immutable',
  // Each generated image gets a unique URL,
  // so caching is safe — it's never going to change
}

Since each generated image gets a unique identifier, aggressive CDN caching is safe — the URL only exists once a specific image has been generated.

Lazy Loading Considerations

Standard lazy loading wisdom (loading="lazy") doesn't apply to the primary generated image — it's above the fold and needs immediate loading.

It does apply to secondary content — example galleries, previous generations in a session history, related content sections.

{/* Primary generated image — eager load */}
<Image src={currentImage} priority />

{/* Gallery of examples below the fold — lazy load */}
{exampleImages.map(img => (
  <Image key={img.id} src={img.url} loading="lazy" />
))}

Results

Metric	Before optimization	After
Average delivered size	~900KB	~200KB
LCP (median)	4.1s	1.9s
CLS	Present	Eliminated

The CLS elimination came almost entirely from pre-sizing containers based on selected aspect ratio — a decision that's available specifically because users choose dimensions before generation starts.

TL;DR

Convert to WebP at ~85 quality for generated content — diminishing returns above that
Pre-size containers using the aspect ratio selected before generation
Mark the generated image as priority — it's almost always your LCP element
CDN-cache aggressively since each generated image has a unique, immutable URL
Reserve lazy loading for below-the-fold secondary content only For the architecture decisions behind the generation pipeline itself, I covered that in a separate post here.

What's your approach to optimizing dynamically generated content? Comments open.

DEV Community