Frame Graph — Build It

Rendering Architecture - This article is part of a series.

Part : This Article

Navigation

0% read

📖 Part II of III. Theory → Build It → Production Engines

Three iterations from blank file to working frame graph with automatic barriers and memory aliasing. Each version builds on the last — by the end you’ll have something you can drop into a real renderer.

Part I covered what a frame graph is — the three-phase lifecycle (declare → compile → execute), the DAG, and why every major engine uses one. Now we implement it. Three C++ iterations, each adding a layer: v1 is scaffolding, v2 adds the dependency graph with automatic barriers and pass culling, v3 adds lifetime analysis and memory aliasing.

🏗️ API Design
#

We start from the API you want to write — a minimal FrameGraph that declares a depth prepass, GBuffer pass, and lighting pass in ~20 lines of C++.

🎯 Design principles
#

λ² Two lambdas

Setup — runs at declaration. Declares reads & writes. No GPU work.
Execute — runs later. Records GPU commands into a fully resolved environment.

📐 Virtual resources

Requested by description ({1920, 1080, RGBA8}), not GPU handle. Virtual until the compiler maps them to memory.

♻️ Owned lifetimes

The graph owns every transient resource from first use to last. You never call create or destroy.

These three ideas produce a natural pipeline — declare your intent, let the compiler optimize, then execute:

① Declaration CPU

addPass(setup, execute)
├ setup lambda runs
• declare reads / writes
• request resources
└ no GPU work, no allocation

Resources are virtual — just a description + handle index. Zero bytes allocated.

② Compile CPU

├ sort — topo order (Kahn's)
├ cull — remove dead passes
├ alias — map virtual → physical
└ barrier — emit transitions

Aliasing and allocation happen here — non-overlapping lifetimes share the same heap, physical memory is bound before execute.

③ Execute GPU

for each pass in sorted order:
├ insert automatic barriers
└ call execute lambda
→ draw / dispatch / copy

Lambdas see a fully resolved environment — memory bound, barriers placed, resources ready.

🧩 Putting it together
#

Here’s how the final API reads — three passes, ~20 lines:

FrameGraph fg;
auto depth = fg.createResource({1920, 1080, Format::D32F});
auto gbufA = fg.createResource({1920, 1080, Format::RGBA8});
auto gbufN = fg.createResource({1920, 1080, Format::RGBA8});
auto hdr   = fg.createResource({1920, 1080, Format::RGBA16F});

fg.addPass("DepthPrepass",
    [&]() { fg.write(0, depth); },
    [&](/*cmd*/) { /* draw scene depth-only */ });

fg.addPass("GBuffer",
    [&]() { fg.read(1, depth); fg.write(1, gbufA); fg.write(1, gbufN); },
    [&](/*cmd*/) { /* draw scene to GBuffer MRTs */ });

fg.addPass("Lighting",
    [&]() { fg.read(2, gbufA); fg.read(2, gbufN); fg.write(2, hdr); },
    [&](/*cmd*/) { /* fullscreen lighting pass */ });

fg.execute();  // → topo-sort, cull, alias, barrier, run

Three passes, declared as lambdas. The graph handles the rest — ordering, barriers, memory. We build this step by step below.

🧱 MVP v1 — Declare & Execute
#

Data structures:

FrameGraph (UE5: FRDGBuilder)

passes[] → RenderPass (UE5: FRDGPass)
  • name
  • setup() ← build the DAG
  • execute() ← record GPU cmds

resources[] → ResourceDesc (UE5: FRDGTextureDesc)
• width, height, format
• virtual — no GPU handle yet

ResourceHandle = index into resources[]
(UE5: FRDGTextureRef / FRDGBufferRef)

← linear allocator: all frame-scoped, free at frame end

Flow: Declare passes in order → execute in order. No dependency tracking yet. Resources are created eagerly.

📑
frame_graph_v1.h
73 lines
▸ source#pragma once
// Frame Graph MVP v1 — Declare & Execute
// No dependency tracking, no barriers, no aliasing.
// Passes execute in declaration order.
//
// Compile: any C++17 compiler (header-only, no GPU backend needed)
//   g++ -std=c++17 -c frame_graph_v1.h
//   or just #include it from your example/test file.

#include <cstdint>
#include <functional>
#include <string>
#include <vector>

// ── Resource description (virtual until compile) ──────────────
enum class Format { RGBA8, RGBA16F, R8, D32F };

struct ResourceDesc {
    uint32_t width  = 0;
    uint32_t height = 0;
    Format   format = Format::RGBA8;
};

// Handle = typed index into the graph's resource array.
// No GPU memory behind it yet — just a number.
struct ResourceHandle {
    uint32_t index = UINT32_MAX;
    bool isValid() const { return index != UINT32_MAX; }
};

// ── Render pass ───────────────────────────────────────────────
struct RenderPass {
    std::string                        name;
    std::function<void()>              setup;    // build the DAG (v1: unused)
    std::function<void(/*cmd list*/)>  execute;  // record GPU commands
};

// ── Frame graph ───────────────────────────────────────────────
class FrameGraph {
public:
    // Create a virtual resource — returns a handle, not GPU memory.
    ResourceHandle createResource(const ResourceDesc& desc) {
        resources_.push_back(desc);
        return { static_cast<uint32_t>(resources_.size() - 1) };
    }

    // Register a pass. Setup runs now; execute is stored for later.
    template <typename SetupFn, typename ExecFn>
    void addPass(const std::string& name, SetupFn&& setup, ExecFn&& exec) {
        passes_.push_back({ name, std::forward<SetupFn>(setup),
                                   std::forward<ExecFn>(exec) });
        passes_.back().setup();  // run setup immediately
    }

    // Compile + execute. v1 is trivial — just run in declaration order.
    void execute() {
        // In v1 we skip compile entirely — no sorting, no culling,
        // no barriers. Just run every pass in the order it was added.
        for (auto& pass : passes_) {
            // Here you'd bind resources, begin render pass, etc.
            pass.execute(/* &cmdList */);
        }

        // Frame over — clear everything for next frame.
        passes_.clear();
        resources_.clear();
    }

private:
    std::vector<RenderPass>    passes_;
    std::vector<ResourceDesc>  resources_;
};

📄 example_v1.cpp 47 lines ▸ source

// Frame Graph MVP v1 -- Usage Example
// Compile: g++ -std=c++17 -o example_v1 example_v1.cpp
#include "frame_graph_v1.h"
#include <cstdio>

int main() {
    printf("=== Frame Graph v1: Declare & Execute ===\n");
    printf("No dependencies, no barriers, no aliasing.\n");
    printf("Passes execute in declaration order.\n\n");

    FrameGraph fg;

    auto depth  = fg.createResource({1920, 1080, Format::D32F});
    auto gbufA  = fg.createResource({1920, 1080, Format::RGBA8});
    auto gbufN  = fg.createResource({1920, 1080, Format::RGBA8});
    auto hdr    = fg.createResource({1920, 1080, Format::RGBA16F});
    printf("Resources created:\n");
    printf("  [0] depth   1920x1080 D32F\n");
    printf("  [1] gbufA   1920x1080 RGBA8\n");
    printf("  [2] gbufN   1920x1080 RGBA8\n");
    printf("  [3] hdr     1920x1080 RGBA16F\n\n");

    fg.addPass("DepthPrepass",
        [&]()          { printf("  setup: DepthPrepass registered\n"); },
        [&](/*cmd*/)   { printf("  [pass 0] DepthPrepass  -> writes depth\n"); });

    fg.addPass("GBuffer",
        [&]()          { printf("  setup: GBuffer registered\n"); },
        [&](/*cmd*/)   { printf("  [pass 1] GBuffer       -> reads depth, writes gbufA, gbufN\n"); });

    fg.addPass("Lighting",
        [&]()          { printf("  setup: Lighting registered\n"); },
        [&](/*cmd*/)   { printf("  [pass 2] Lighting      -> reads gbufA, gbufN, writes hdr\n"); });

    printf("\nPasses added (setup phase):\n");
    printf("\nExecuting (declaration order -- no sorting):\n");
    fg.execute();

    printf("\n[OK] v1 complete. Passes ran in order, but:\n");
    printf("  - No dependency tracking -- order is manual\n");
    printf("  - No barrier insertion -- GPU hazards possible\n");
    printf("  - No dead-pass culling\n");
    printf("  - No memory aliasing\n");
    printf("  -> v2 will fix the first three.\n");
    return 0;
}

Compiles and runs — the execute lambdas are stubs, but the scaffolding is real. Every piece we add in v2 and v3 goes into this same FrameGraph class.

✓ What it proves
The lambda-based pass declaration pattern works. You can already compose passes without manual barrier calls (even though barriers are no-ops here).

✗ What it lacks
Executes passes in declaration order, creates every resource upfront. Correct but wasteful. Version 2 adds the graph.

🔗 MVP v2 — Dependencies & Barriers
#

🎯 Goal: Automatic pass ordering, dead-pass culling, and barrier insertion — the core value of a render graph.

Four pieces, each feeding the next:

edges → execution order

✂️

Pass Culling

kill unreachable passes

🚧

Barriers

emit GPU transitions

🔀 Resource versioning & the dependency graph
#

Multiple passes can read the same resource without conflict — but when a pass writes to it, every later reader needs to know which write they depend on. The solution: each write bumps the resource’s version number. Readers attach to the version that existed when they were declared, so dependency edges stay precise even when the same resource is written multiple times per frame.

Pixel History — HDR target through the frame

WRITE

Lighting — renders lit color into HDR target

read

Bloom — samples bright pixels (still v1)

read

Reflections — samples for SSR (still v1)

read

Fog — reads scene color for aerial blending (still v1)

WRITE

Composite — overwrites with final blended result (bumps to v2)

read

Tonemap — maps HDR → SDR for display (reads v2, not v1)

Reads never bump the version — three passes read v1 without conflict. Only a write creates v2. Tonemap depends on Composite (v2 writer), with no edge to Lighting or any v1 reader.

📊 Topological sort (Kahn’s algorithm)
#

Count incoming edges per pass. Any pass with zero incoming edges has all dependencies satisfied — emit it, decrement its neighbors’ counts, repeat until the queue is empty. If the output is shorter than the pass count, the graph has a cycle.

✂️ Pass culling
#

🔙 Algorithm: Walk backwards from the final output (present / backbuffer). Mark every reachable pass as alive. 💀 Result: Any unmarked pass is dead — removed along with all its resource declarations. No #ifdef, no flag. ⏱️ Cost: O(V + E) — one linear walk over the graph.

Toggle edges in the DAG to see it live — disconnect a pass and the compiler removes it along with its resources. No #ifdef, no feature flag — just a missing edge.

🚧 Barrier insertion
#

A GPU resource can’t be a render target and a shader input at the same time — the hardware needs to flush caches, change memory layout, and switch access modes between those uses. That transition is a barrier.

The graph already knows the sorted pass order and what each pass reads or writes. So for every resource handoff — GBuffer goes from “being written by pass A” to “being read by pass B” — it inserts the correct barrier automatically. Here’s every type of barrier a real frame needs:

Transition	Example	API
Render Target → Shader Read	GBuffer → Lighting samples it	RENDER_TARGET → PIXEL_SHADER_RESOURCE
Depth Write → Depth Read	Shadows → Lighting reads as texture	DEPTH_WRITE → PIXEL_SHADER_RESOURCE
UAV Write → UAV Read	Bloom mip N → mip N+1	UAV barrier (flush caches)
Shader Read → Render Target	Lighting read HDR → Tonemap writes	PIXEL_SHADER_RESOURCE → RENDER_TARGET
Render Target → Present	Final composite → swapchain	RENDER_TARGET → PRESENT
Aliasing Barrier	GBuffer dies → HDR reuses memory	RESOURCE_BARRIER_TYPE_ALIASING

Example:

Resource	Format	Current State	Barrier

A real frame needs dozens of these. Miss one → rendering corruption or a GPU crash. Add an unnecessary one → the GPU stalls waiting for nothing. Managing this by hand is tedious and error-prone — the graph sees every read/write edge and emits the exact set automatically.

🧩 Putting it together — v1 → v2 diff
#

We need five new pieces: (1) resource versioning with read/write tracking, (2) adjacency list for the DAG, (3) topological sort, (4) pass culling, and (5) barrier insertion. Additions marked with // NEW v2 in the source:

v1 → v2 — Key structural changes

@@ RenderPass struct @@

struct RenderPass {

std::string name;

std::function<void()> setup;

std::function<void(/*cmd list*/)> execute;

+ std::vector<ResourceHandle> reads; // NEW v2

+ std::vector<ResourceHandle> writes; // NEW v2

+ std::vector<uint32_t> dependsOn; // NEW v2

+ std::vector<uint32_t> successors; // NEW v2

+ uint32_t inDegree = 0; // NEW v2

+ bool alive = false; // NEW v2

};

@@ FrameGraph class — new methods @@

+ void read(uint32_t passIdx, ResourceHandle h); // link to resource version

+ void write(uint32_t passIdx, ResourceHandle h); // create new version

@@ FrameGraph::execute() @@

- // v1: just run every pass in declaration order.

- for (auto& pass : passes_)

- pass.execute();

+ // v2: build edges, topo-sort, cull, then run in sorted order.

+ buildEdges();

+ auto sorted = topoSort(); // Kahn's algorithm — O(V+E)

+ cull(sorted); // backward walk from output

+ for (uint32_t idx : sorted) {

+ if (!passes_[idx].alive) continue; // skip dead

+ insertBarriers(idx); // auto barriers

+ passes_[idx].execute();

+ }

@@ New internal data @@

- std::vector<ResourceDesc> resources_;

+ std::vector<ResourceEntry> entries_; // now with versioning

Full updated source:

frame_graph_v2.h 225 lines ▸ source                                                                                                                                                                                     

example_v2.cpp51 lines ▸

📑  an>
#pragma once class=cl>// Frame Graph MVP v2 — Dependencies & Barriers class=cl>// Adds: resource versioning, DAG with adjacency list, Kahn's topo-sort, class=cl>//       pass culling, and automatic barrier insertion. class=cl>// class=cl>// Compile: any C++17 compiler (header-only, no GPU backend needed) class=cl>//   g++ -std=c++17 -c frame_graph_v2.h class=cl>//   or just #include it from your example/test file. class=cl> class=cl>#include <algorithm> class=cl>#include <cassert> class=cl>#include <cstdint> class=cl>#include <cstdio> class=cl>#include <functional> class=cl>#include <queue> class=cl>#include <string> class=cl>#include <unordered_set> class=cl>#include <vector> class=cl> class=cl>// ── Resource description (virtual until compile) ────────────── class=cl>enum class Format { RGBA8, RGBA16F, R8, D32F }; class=cl>struct ResourceDesc { uint32_t width  = 0; uint32_t height = 0; Format   format = Format::RGBA8; class=cl>}; class=cl>struct ResourceHandle { uint32_t index = UINT32_MAX; bool isValid() const { return index != UINT32_MAX; } class=cl>}; class=cl>// ── Resource state tracking (NEW v2) ────────────────────────── class=cl>enum class ResourceState { Undefined, ColorAttachment, DepthAttachment, ShaderRead, Present }; class=cl>inline const char* stateName(ResourceState s) { switch (s) { case ResourceState::Undefined:       return "Undefined"; case ResourceState::ColorAttachment: return "ColorAttachment"; case ResourceState::DepthAttachment: return "DepthAttachment"; case ResourceState::ShaderRead:      return "ShaderRead"; case ResourceState::Present:         return "Present"; default:                             return "?"; } class=cl>} class=cl>struct ResourceVersion {                 // NEW v2 uint32_t writerPass = UINT32_MAX;    // which pass wrote this version std::vector<uint32_t> readerPasses;  // which passes read it class=cl>}; class=cl>// Extend ResourceDesc with tracking: class=cl>struct ResourceEntry { ResourceDesc desc; std::vector<ResourceVersion> versions;  // version 0, 1, 2... ResourceState currentState = ResourceState::Undefined; class=cl>}; class=cl>// ── Updated render pass ─────────────────────────────────────── class=cl>struct RenderPass { std::string name; std::function<void()>             setup; std::function<void(/*cmd list*/)> execute; std::vector<ResourceHandle> reads;    // NEW v2 std::vector<ResourceHandle> writes;   // NEW v2 std::vector<uint32_t> dependsOn;      // NEW v2 — passes this pass depends on std::vector<uint32_t> successors;     // NEW v2 — passes that depend on this pass uint32_t inDegree = 0;                // NEW v2 — for Kahn's bool     alive    = false;            // NEW v2 — for culling class=cl>}; class=cl>// ── Updated FrameGraph ──────────────────────────────────────── class=cl>class FrameGraph { class=cl>public: ResourceHandle createResource(const ResourceDesc& desc) { entries_.push_back({ desc, {{}}, ResourceState::Undefined }); return { static_cast<uint32_t>(entries_.size() - 1) }; } // Declare a read — links this pass to the resource's current version. void read(uint32_t passIdx, ResourceHandle h) {    // NEW v2 auto& ver = entries_[h.index].versions.back(); if (ver.writerPass != UINT32_MAX) { passes_[passIdx].dependsOn.push_back(ver.writerPass); } ver.readerPasses.push_back(passIdx); passes_[passIdx].reads.push_back(h); } // Declare a write — creates a new version of the resource. void write(uint32_t passIdx, ResourceHandle h) {   // NEW v2 entries_[h.index].versions.push_back({}); entries_[h.index].versions.back().writerPass = passIdx; passes_[passIdx].writes.push_back(h); } template <typename SetupFn, typename ExecFn> void addPass(const std::string& name, SetupFn&& setup, ExecFn&& exec) { uint32_t idx = static_cast<uint32_t>(passes_.size()); passes_.push_back({ name, std::forward<SetupFn>(setup), std::forward<ExecFn>(exec) }); currentPass_ = idx;   // NEW v2 — so setup can call read()/write() passes_.back().setup(); } void execute() { printf("\n[1] Building dependency edges...\n"); buildEdges();        // NEW v2 printf("[2] Topological sort...\n"); auto sorted = topoSort();        // NEW v2 printf("[3] Culling dead passes...\n"); cull(sorted);        // NEW v2 class=cl> printf("[4] Executing (with automatic barriers):\n"); for (uint32_t idx : sorted) { if (!passes_[idx].alive) { printf("  -- skip: %s (CULLED)\n", passes_[idx].name.c_str()); continue; } insertBarriers(idx);                 // NEW v2 passes_[idx].execute(/* &cmdList */); } passes_.clear(); entries_.clear(); } class=cl>private: uint32_t currentPass_ = 0; std::vector<RenderPass>    passes_; std::vector<ResourceEntry> entries_; // ── Build dependency edges ──────────────────────────────── void buildEdges() {                              // NEW v2 for (uint32_t i = 0; i < passes_.size(); i++) { // Deduplicate dependency edges and build successor list. std::unordered_set<uint32_t> seen; for (uint32_t dep : passes_[i].dependsOn) { if (seen.insert(dep).second) { passes_[dep].successors.push_back(i); passes_[i].inDegree++; } } } } // ── Kahn's topological sort — O(V + E) ──────────────────── std::vector<uint32_t> topoSort() {               // NEW v2 std::queue<uint32_t> q; std::vector<uint32_t> inDeg(passes_.size()); for (uint32_t i = 0; i < passes_.size(); i++) { inDeg[i] = passes_[i].inDegree; if (inDeg[i] == 0) q.push(i); } std::vector<uint32_t> order; while (!q.empty()) { uint32_t cur = q.front(); q.pop(); order.push_back(cur); // Walk the adjacency list — O(E) total across all nodes. for (uint32_t succ : passes_[cur].successors) { if (--inDeg[succ] == 0) q.push(succ); } } assert(order.size() == passes_.size() && "Cycle detected!"); printf("  Topological order: "); for (uint32_t i = 0; i < order.size(); i++) { printf("%s%s", passes_[order[i]].name.c_str(), i + 1 < order.size() ? " -> " : "\n"); } return order; } // ── Cull dead passes (backward walk from output) ────────── void cull(const std::vector<uint32_t>& sorted) { // NEW v2 // Mark the last pass (present) as alive, then walk backward. if (sorted.empty()) return; passes_[sorted.back()].alive = true; for (int i = static_cast<int>(sorted.size()) - 1; i >= 0; i--) { if (!passes_[sorted[i]].alive) continue; for (uint32_t dep : passes_[sorted[i]].dependsOn) passes_[dep].alive = true; } printf("  Culling result:   "); for (uint32_t i = 0; i < passes_.size(); i++) { printf("%s=%s%s", passes_[i].name.c_str(), passes_[i].alive ? "ALIVE" : "DEAD", i + 1 < passes_.size() ? ", " : "\n"); } } // ── Insert barriers where resource state changes ────────── void insertBarriers(uint32_t passIdx) {          // NEW v2 auto stateForUsage = [](bool isWrite, Format fmt) { if (isWrite) return (fmt == Format::D32F) ? ResourceState::DepthAttachment : ResourceState::ColorAttachment; return ResourceState::ShaderRead; }; for (auto& h : passes_[passIdx].reads) { ResourceState needed = ResourceState::ShaderRead; if (entries_[h.index].currentState != needed) { printf("    barrier: resource[%u] %s -> %s\n", h.index, stateName(entries_[h.index].currentState), stateName(needed)); entries_[h.index].currentState = needed; } } for (auto& h : passes_[passIdx].writes) { ResourceState needed = stateForUsage(true, entries_[h.index].desc.format); if (entries_[h.index].currentState != needed) { printf("    barrier: resource[%u] %s -> %s\n", h.index, stateName(entries_[h.index].currentState), stateName(needed)); entries_[h.index].currentState = needed; } } } class=cl>}; iv>📄 span> y8gRnJhbWUgR3JhcGggTVZQIHYyIC0tIFVzYWdlIEV4YW1wbGUKLy8gQ29tcGlsZTogZysrIC1zdGQ9YysrMTcgLW8gZXhhbXBsZV92MiBleGFtcGxlX3YyLmNwcAojcHJhZ21hIG9uY2UKLy8gRnJhbWUgR3JhcGggTVZQIHYyIOKAlCBEZXBlbmRlbmNpZXMgJiBCYXJyaWVycwovLyBBZGRzOiByZXNvdXJjZSB2ZXJzaW9uaW5nLCBEQUcgd2l0aCBhZGphY2VuY3kgbGlzdCwgS2FobidzIHRvcG8tc29ydCwKLy8gICAgICAgcGFzcyBjdWxsaW5nLCBhbmQgYXV0b21hdGljIGJhcnJpZXIgaW5zZXJ0aW9uLgovLwovLyBDb21waWxlOiBhbnkgQysrMTcgY29tcGlsZXIgKGhlYWRlci1vbmx5LCBubyBHUFUgYmFja2VuZCBuZWVkZWQpCi8vICAgZysrIC1zdGQ9YysrMTcgLWMgZnJhbWVfZ3JhcGhfdjIuaAovLyAgIG9yIGp1c3QgI2luY2x1ZGUgaXQgZnJvbSB5b3VyIGV4YW1wbGUvdGVzdCBmaWxlLgoKI2luY2x1ZGUgPGFsZ29yaXRobT4KI2luY2x1ZGUgPGNhc3NlcnQ+CiNpbmNsdWRlIDxjc3RkaW50PgojaW5jbHVkZSA8Y3N0ZGlvPgojaW5jbHVkZSA8ZnVuY3Rpb25hbD4KI2luY2x1ZGUgPHF1ZXVlPgojaW5jbHVkZSA8c3RyaW5nPgojaW5jbHVkZSA8dW5vcmRlcmVkX3NldD4KI2luY2x1ZGUgPHZlY3Rvcj4KCi8vIOKUgOKUgCBSZXNvdXJjZSBkZXNjcmlwdGlvbiAodmlydHVhbCB1bnRpbCBjb21waWxlKSDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIAKZW51bSBjbGFzcyBGb3JtYXQgeyBSR0JBOCwgUkdCQTE2RiwgUjgsIEQzMkYgfTsKCnN0cnVjdCBSZXNvdXJjZURlc2MgewogICAgdWludDMyX3Qgd2lkdGggID0gMDsKICAgIHVpbnQzMl90IGhlaWdodCA9IDA7CiAgICBGb3JtYXQgICBmb3JtYXQgPSBGb3JtYXQ6OlJHQkE4Owp9OwoKc3RydWN0IFJlc291cmNlSGFuZGxlIHsKICAgIHVpbnQzMl90IGluZGV4ID0gVUlOVDMyX01BWDsKICAgIGJvb2wgaXNWYWxpZCgpIGNvbnN0IHsgcmV0dXJuIGluZGV4ICE9IFVJTlQzMl9NQVg7IH0KfTsKCi8vIOKUgOKUgCBSZXNvdXJjZSBzdGF0ZSB0cmFja2luZyAoTkVXIHYyKSDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIAKZW51bSBjbGFzcyBSZXNvdXJjZVN0YXRlIHsgVW5kZWZpbmVkLCBDb2xvckF0dGFjaG1lbnQsIERlcHRoQXR0YWNobWVudCwKICAgICAgICAgICAgICAgICAgICAgICAgICAgU2hhZGVyUmVhZCwgUHJlc2VudCB9OwoKaW5saW5lIGNvbnN0IGNoYXIqIHN0YXRlTmFtZShSZXNvdXJjZVN0YXRlIHMpIHsKICAgIHN3aXRjaCAocykgewogICAgICAgIGNhc2UgUmVzb3VyY2VTdGF0ZTo6VW5kZWZpbmVkOiAgICAgICByZXR1cm4gIlVuZGVmaW5lZCI7CiAgICAgICAgY2FzZSBSZXNvdXJjZVN0YXRlOjpDb2xvckF0dGFjaG1lbnQ6IHJldHVybiAiQ29sb3JBdHRhY2htZW50IjsKICAgICAgICBjYXNlIFJlc291cmNlU3RhdGU6OkRlcHRoQXR0YWNobWVudDogcmV0dXJuICJEZXB0aEF0dGFjaG1lbnQiOwogICAgICAgIGNhc2UgUmVzb3VyY2VTdGF0ZTo6U2hhZGVyUmVhZDogICAgICByZXR1cm4gIlNoYWRlclJlYWQiOwogICAgICAgIGNhc2UgUmVzb3VyY2VTdGF0ZTo6UHJlc2VudDogICAgICAgICByZXR1cm4gIlByZXNlbnQiOwogICAgICAgIGRlZmF1bHQ6ICAgICAgICAgICAgICAgICAgICAgICAgICAgICByZXR1cm4gIj8iOwogICAgfQp9CgpzdHJ1Y3QgUmVzb3VyY2VWZXJzaW9uIHsgICAgICAgICAgICAgICAgIC8vIE5FVyB2MgogICAgdWludDMyX3Qgd3JpdGVyUGFzcyA9IFVJTlQzMl9NQVg7ICAgIC8vIHdoaWNoIHBhc3Mgd3JvdGUgdGhpcyB2ZXJzaW9uCiAgICBzdGQ6OnZlY3Rvcjx1aW50MzJfdD4gcmVhZGVyUGFzc2VzOyAgLy8gd2hpY2ggcGFzc2VzIHJlYWQgaXQKfTsKCi8vIEV4dGVuZCBSZXNvdXJjZURlc2Mgd2l0aCB0cmFja2luZzoKc3RydWN0IFJlc291cmNlRW50cnkgewogICAgUmVzb3VyY2VEZXNjIGRlc2M7CiAgICBzdGQ6OnZlY3RvcjxSZXNvdXJjZVZlcnNpb24+IHZlcnNpb25zOyAgLy8gdmVyc2lvbiAwLCAxLCAyLi4uCiAgICBSZXNvdXJjZVN0YXRlIGN1cnJlbnRTdGF0ZSA9IFJlc291cmNlU3RhdGU6OlVuZGVmaW5lZDsKfTsKCi8vIOKUgOKUgCBVcGRhdGVkIHJlbmRlciBwYXNzIOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgApzdHJ1Y3QgUmVuZGVyUGFzcyB7CiAgICBzdGQ6OnN0cmluZyBuYW1lOwogICAgc3RkOjpmdW5jdGlvbjx2b2lkKCk+ICAgICAgICAgICAgIHNldHVwOwogICAgc3RkOjpmdW5jdGlvbjx2b2lkKC8qY21kIGxpc3QqLyk+IGV4ZWN1dGU7CgogICAgc3RkOjp2ZWN0b3I8UmVzb3VyY2VIYW5kbGU+IHJlYWRzOyAgICAvLyBORVcgdjIKICAgIHN0ZDo6dmVjdG9yPFJlc291cmNlSGFuZGxlPiB3cml0ZXM7ICAgLy8gTkVXIHYyCiAgICBzdGQ6OnZlY3Rvcjx1aW50MzJfdD4gZGVwZW5kc09uOyAgICAgIC8vIE5FVyB2MiDigJQgcGFzc2VzIHRoaXMgcGFzcyBkZXBlbmRzIG9uCiAgICBzdGQ6OnZlY3Rvcjx1aW50MzJfdD4gc3VjY2Vzc29yczsgICAgIC8vIE5FVyB2MiDigJQgcGFzc2VzIHRoYXQgZGVwZW5kIG9uIHRoaXMgcGFzcwogICAgdWludDMyX3QgaW5EZWdyZWUgPSAwOyAgICAgICAgICAgICAgICAvLyBORVcgdjIg4oCUIGZvciBLYWhuJ3MKICAgIGJvb2wgICAgIGFsaXZlICAgID0gZmFsc2U7ICAgICAgICAgICAgLy8gTkVXIHYyIOKAlCBmb3IgY3VsbGluZwp9OwoKLy8g4pSA4pSAIFVwZGF0ZWQgRnJhbWVHcmFwaCDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIAKY2xhc3MgRnJhbWVHcmFwaCB7CnB1YmxpYzoKICAgIFJlc291cmNlSGFuZGxlIGNyZWF0ZVJlc291cmNlKGNvbnN0IFJlc291cmNlRGVzYyYgZGVzYykgewogICAgICAgIGVudHJpZXNfLnB1c2hfYmFjayh7IGRlc2MsIHt7fX0sIFJlc291cmNlU3RhdGU6OlVuZGVmaW5lZCB9KTsKICAgICAgICByZXR1cm4geyBzdGF0aWNfY2FzdDx1aW50MzJfdD4oZW50cmllc18uc2l6ZSgpIC0gMSkgfTsKICAgIH0KCiAgICAvLyBEZWNsYXJlIGEgcmVhZCDigJQgbGlua3MgdGhpcyBwYXNzIHRvIHRoZSByZXNvdXJjZSdzIGN1cnJlbnQgdmVyc2lvbi4KICAgIHZvaWQgcmVhZCh1aW50MzJfdCBwYXNzSWR4LCBSZXNvdXJjZUhhbmRsZSBoKSB7ICAgIC8vIE5FVyB2MgogICAgICAgIGF1dG8mIHZlciA9IGVudHJpZXNfW2guaW5kZXhdLnZlcnNpb25zLmJhY2soKTsKICAgICAgICBpZiAodmVyLndyaXRlclBhc3MgIT0gVUlOVDMyX01BWCkgewogICAgICAgICAgICBwYXNzZXNfW3Bhc3NJZHhdLmRlcGVuZHNPbi5wdXNoX2JhY2sodmVyLndyaXRlclBhc3MpOwogICAgICAgIH0KICAgICAgICB2ZXIucmVhZGVyUGFzc2VzLnB1c2hfYmFjayhwYXNzSWR4KTsKICAgICAgICBwYXNzZXNfW3Bhc3NJZHhdLnJlYWRzLnB1c2hfYmFjayhoKTsKICAgIH0KCiAgICAvLyBEZWNsYXJlIGEgd3JpdGUg4oCUIGNyZWF0ZXMgYSBuZXcgdmVyc2lvbiBvZiB0aGUgcmVzb3VyY2UuCiAgICB2b2lkIHdyaXRlKHVpbnQzMl90IHBhc3NJZHgsIFJlc291cmNlSGFuZGxlIGgpIHsgICAvLyBORVcgdjIKICAgICAgICBlbnRyaWVzX1toLmluZGV4XS52ZXJzaW9ucy5wdXNoX2JhY2soe30pOwogICAgICAgIGVudHJpZXNfW2guaW5kZXhdLnZlcnNpb25zLmJhY2soKS53cml0ZXJQYXNzID0gcGFzc0lkeDsKICAgICAgICBwYXNzZXNfW3Bhc3NJZHhdLndyaXRlcy5wdXNoX2JhY2soaCk7CiAgICB9CgogICAgdGVtcGxhdGUgPHR5cGVuYW1lIFNldHVwRm4sIHR5cGVuYW1lIEV4ZWNGbj4KICAgIHZvaWQgYWRkUGFzcyhjb25zdCBzdGQ6OnN0cmluZyYgbmFtZSwgU2V0dXBGbiYmIHNldHVwLCBFeGVjRm4mJiBleGVjKSB7CiAgICAgICAgdWludDMyX3QgaWR4ID0gc3RhdGljX2Nhc3Q8dWludDMyX3Q+KHBhc3Nlc18uc2l6ZSgpKTsKICAgICAgICBwYXNzZXNfLnB1c2hfYmFjayh7IG5hbWUsIHN0ZDo6Zm9yd2FyZDxTZXR1cEZuPihzZXR1cCksCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgc3RkOjpmb3J3YXJkPEV4ZWNGbj4oZXhlYykgfSk7CiAgICAgICAgY3VycmVudFBhc3NfID0gaWR4OyAgIC8vIE5FVyB2MiDigJQgc28gc2V0dXAgY2FuIGNhbGwgcmVhZCgpL3dyaXRlKCkKICAgICAgICBwYXNzZXNfLmJhY2soKS5zZXR1cCgpOwogICAgfQoKICAgIHZvaWQgZXhlY3V0ZSgpIHsKICAgICAgICBwcmludGYoIlxuWzFdIEJ1aWxkaW5nIGRlcGVuZGVuY3kgZWRnZXMuLi5cbiIpOwogICAgICAgIGJ1aWxkRWRnZXMoKTsgICAgICAgIC8vIE5FVyB2MgogICAgICAgIHByaW50ZigiWzJdIFRvcG9sb2dpY2FsIHNvcnQuLi5cbiIpOwogICAgICAgIGF1dG8gc29ydGVkID0gdG9wb1NvcnQoKTsgICAgICAgIC8vIE5FVyB2MgogICAgICAgIHByaW50ZigiWzNdIEN1bGxpbmcgZGVhZCBwYXNzZXMuLi5cbiIpOwogICAgICAgIGN1bGwoc29ydGVkKTsgICAgICAgIC8vIE5FVyB2MgoKICAgICAgICBwcmludGYoIls0XSBFeGVjdXRpbmcgKHdpdGggYXV0b21hdGljIGJhcnJpZXJzKTpcbiIpOwogICAgICAgIGZvciAodWludDMyX3QgaWR4IDogc29ydGVkKSB7CiAgICAgICAgICAgIGlmICghcGFzc2VzX1tpZHhdLmFsaXZlKSB7CiAgICAgICAgICAgICAgICBwcmludGYoIiAgLS0gc2tpcDogJXMgKENVTExFRClcbiIsIHBhc3Nlc19baWR4XS5uYW1lLmNfc3RyKCkpOwogICAgICAgICAgICAgICAgY29udGludWU7CiAgICAgICAgICAgIH0KICAgICAgICAgICAgaW5zZXJ0QmFycmllcnMoaWR4KTsgICAgICAgICAgICAgICAgIC8vIE5FVyB2MgogICAgICAgICAgICBwYXNzZXNfW2lkeF0uZXhlY3V0ZSgvKiAmY21kTGlzdCAqLyk7CiAgICAgICAgfQogICAgICAgIHBhc3Nlc18uY2xlYXIoKTsKICAgICAgICBlbnRyaWVzXy5jbGVhcigpOwogICAgfQoKcHJpdmF0ZToKICAgIHVpbnQzMl90IGN1cnJlbnRQYXNzXyA9IDA7CiAgICBzdGQ6OnZlY3RvcjxSZW5kZXJQYXNzPiAgICBwYXNzZXNfOwogICAgc3RkOjp2ZWN0b3I8UmVzb3VyY2VFbnRyeT4gZW50cmllc187CgogICAgLy8g4pSA4pSAIEJ1aWxkIGRlcGVuZGVuY3kgZWRnZXMg4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSA4pSACiAgICB2b2lkIGJ1aWxkRWRnZXMoKSB7ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgLy8gTkVXIHYyCiAgICAgICAgZm9yICh1aW50MzJfdCBpID0gMDsgaSA8IHBhc3Nlc18uc2l6ZSgpOyBpKyspIHsKICAgICAgICAgICAgLy8gRGVkdXBsaWNhdGUgZGVwZW5kZW5jeSBlZGdlcyBhbmQgYnVpbGQgc3VjY2Vzc29yIGxpc3QuCiAgICAgICAgICAgIHN0ZDo6dW5vcmRlcmVkX3NldDx1aW50MzJfdD4gc2VlbjsKICAgICAgICAgICAgZm9yICh1aW50MzJfdCBkZXAgOiBwYXNzZXNfW2ldLmRlcGVuZHNPbikgewogICAgICAgICAgICAgICAgaWYgKHNlZW4uaW5zZXJ0KGRlcCkuc2Vjb25kKSB7CiAgICAgICAgICAgICAgICAgICAgcGFzc2VzX1tkZXBdLnN1Y2Nlc3NvcnMucHVzaF9iYWNrKGkpOwogICAgICAgICAgICAgICAgICAgIHBhc3Nlc19baV0uaW5EZWdyZWUrKzsKICAgICAgICAgICAgICAgIH0KICAgICAgICAgICAgfQogICAgICAgIH0KICAgIH0KCiAgICAvLyDilIDilIAgS2FobidzIHRvcG9sb2dpY2FsIHNvcnQg4oCUIE8oViArIEUpIOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgAogICAgc3RkOjp2ZWN0b3I8dWludDMyX3Q+IHRvcG9Tb3J0KCkgeyAgICAgICAgICAgICAgIC8vIE5FVyB2MgogICAgICAgIHN0ZDo6cXVldWU8dWludDMyX3Q+IHE7CiAgICAgICAgc3RkOjp2ZWN0b3I8dWludDMyX3Q+IGluRGVnKHBhc3Nlc18uc2l6ZSgpKTsKICAgICAgICBmb3IgKHVpbnQzMl90IGkgPSAwOyBpIDwgcGFzc2VzXy5zaXplKCk7IGkrKykgewogICAgICAgICAgICBpbkRlZ1tpXSA9IHBhc3Nlc19baV0uaW5EZWdyZWU7CiAgICAgICAgICAgIGlmIChpbkRlZ1tpXSA9PSAwKSBxLnB1c2goaSk7CiAgICAgICAgfQogICAgICAgIHN0ZDo6dmVjdG9yPHVpbnQzMl90PiBvcmRlcjsKICAgICAgICB3aGlsZSAoIXEuZW1wdHkoKSkgewogICAgICAgICAgICB1aW50MzJfdCBjdXIgPSBxLmZyb250KCk7IHEucG9wKCk7CiAgICAgICAgICAgIG9yZGVyLnB1c2hfYmFjayhjdXIpOwogICAgICAgICAgICAvLyBXYWxrIHRoZSBhZGphY2VuY3kgbGlzdCDigJQgTyhFKSB0b3RhbCBhY3Jvc3MgYWxsIG5vZGVzLgogICAgICAgICAgICBmb3IgKHVpbnQzMl90IHN1Y2MgOiBwYXNzZXNfW2N1cl0uc3VjY2Vzc29ycykgewogICAgICAgICAgICAgICAgaWYgKC0taW5EZWdbc3VjY10gPT0gMCkKICAgICAgICAgICAgICAgICAgICBxLnB1c2goc3VjYyk7CiAgICAgICAgICAgIH0KICAgICAgICB9CiAgICAgICAgYXNzZXJ0KG9yZGVyLnNpemUoKSA9PSBwYXNzZXNfLnNpemUoKSAmJiAiQ3ljbGUgZGV0ZWN0ZWQhIik7CiAgICAgICAgcHJpbnRmKCIgIFRvcG9sb2dpY2FsIG9yZGVyOiAiKTsKICAgICAgICBmb3IgKHVpbnQzMl90IGkgPSAwOyBpIDwgb3JkZXIuc2l6ZSgpOyBpKyspIHsKICAgICAgICAgICAgcHJpbnRmKCIlcyVzIiwgcGFzc2VzX1tvcmRlcltpXV0ubmFtZS5jX3N0cigpLAogICAgICAgICAgICAgICAgICAgaSArIDEgPCBvcmRlci5zaXplKCkgPyAiIC0+ICIgOiAiXG4iKTsKICAgICAgICB9CiAgICAgICAgcmV0dXJuIG9yZGVyOwogICAgfQoKICAgIC8vIOKUgOKUgCBDdWxsIGRlYWQgcGFzc2VzIChiYWNrd2FyZCB3YWxrIGZyb20gb3V0cHV0KSDilIDilIDilIDilIDilIDilIDilIDilIDilIDilIAKICAgIHZvaWQgY3VsbChjb25zdCBzdGQ6OnZlY3Rvcjx1aW50MzJfdD4mIHNvcnRlZCkgeyAvLyBORVcgdjIKICAgICAgICAvLyBNYXJrIHRoZSBsYXN0IHBhc3MgKHByZXNlbnQpIGFzIGFsaXZlLCB0aGVuIHdhbGsgYmFja3dhcmQuCiAgICAgICAgaWYgKHNvcnRlZC5lbXB0eSgpKSByZXR1cm47CiAgICAgICAgcGFzc2VzX1tzb3J0ZWQuYmFjaygpXS5hbGl2ZSA9IHRydWU7CiAgICAgICAgZm9yIChpbnQgaSA9IHN0YXRpY19jYXN0PGludD4oc29ydGVkLnNpemUoKSkgLSAxOyBpID49IDA7IGktLSkgewogICAgICAgICAgICBpZiAoIXBhc3Nlc19bc29ydGVkW2ldXS5hbGl2ZSkgY29udGludWU7CiAgICAgICAgICAgIGZvciAodWludDMyX3QgZGVwIDogcGFzc2VzX1tzb3J0ZWRbaV1dLmRlcGVuZHNPbikKICAgICAgICAgICAgICAgIHBhc3Nlc19bZGVwXS5hbGl2ZSA9IHRydWU7CiAgICAgICAgfQogICAgICAgIHByaW50ZigiICBDdWxsaW5nIHJlc3VsdDogICAiKTsKICAgICAgICBmb3IgKHVpbnQzMl90IGkgPSAwOyBpIDwgcGFzc2VzXy5zaXplKCk7IGkrKykgewogICAgICAgICAgICBwcmludGYoIiVzPSVzJXMiLCBwYXNzZXNfW2ldLm5hbWUuY19zdHIoKSwKICAgICAgICAgICAgICAgICAgIHBhc3Nlc19baV0uYWxpdmUgPyAiQUxJVkUiIDogIkRFQUQiLAogICAgICAgICAgICAgICAgICAgaSArIDEgPCBwYXNzZXNfLnNpemUoKSA/ICIsICIgOiAiXG4iKTsKICAgICAgICB9CiAgICB9CgogICAgLy8g4pSA4pSAIEluc2VydCBiYXJyaWVycyB3aGVyZSByZXNvdXJjZSBzdGF0ZSBjaGFuZ2VzIOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgOKUgAogICAgdm9pZCBpbnNlcnRCYXJyaWVycyh1aW50MzJfdCBwYXNzSWR4KSB7ICAgICAgICAgIC8vIE5FVyB2MgogICAgICAgIGF1dG8gc3RhdGVGb3JVc2FnZSA9IFtdKGJvb2wgaXNXcml0ZSwgRm9ybWF0IGZtdCkgewogICAgICAgICAgICBpZiAoaXNXcml0ZSkKICAgICAgICAgICAgICAgIHJldHVybiAoZm10ID09IEZvcm1hdDo6RDMyRikgPyBSZXNvdXJjZVN0YXRlOjpEZXB0aEF0dGFjaG1lbnQKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgOiBSZXNvdXJjZVN0YXRlOjpDb2xvckF0dGFjaG1lbnQ7CiAgICAgICAgICAgIHJldHVybiBSZXNvdXJjZVN0YXRlOjpTaGFkZXJSZWFkOwogICAgICAgIH07CgogICAgICAgIGZvciAoYXV0byYgaCA6IHBhc3Nlc19bcGFzc0lkeF0ucmVhZHMpIHsKICAgICAgICAgICAgUmVzb3VyY2VTdGF0ZSBuZWVkZWQgPSBSZXNvdXJjZVN0YXRlOjpTaGFkZXJSZWFkOwogICAgICAgICAgICBpZiAoZW50cmllc19baC5pbmRleF0uY3VycmVudFN0YXRlICE9IG5lZWRlZCkgewogICAgICAgICAgICAgICAgcHJpbnRmKCIgICAgYmFycmllcjogcmVzb3VyY2VbJXVdICVzIC0+ICVzXG4iLAogICAgICAgICAgICAgICAgICAgICAgIGguaW5kZXgsCiAgICAgICAgICAgICAgICAgICAgICAgc3RhdGVOYW1lKGVudHJpZXNfW2guaW5kZXhdLmN1cnJlbnRTdGF0ZSksCiAgICAgICAgICAgICAgICAgICAgICAgc3RhdGVOYW1lKG5lZWRlZCkpOwogICAgICAgICAgICAgICAgZW50cmllc19baC5pbmRleF0uY3VycmVudFN0YXRlID0gbmVlZGVkOwogICAgICAgICAgICB9CiAgICAgICAgfQogICAgICAgIGZvciAoYXV0byYgaCA6IHBhc3Nlc19bcGFzc0lkeF0ud3JpdGVzKSB7CiAgICAgICAgICAgIFJlc291cmNlU3RhdGUgbmVlZGVkID0gc3RhdGVGb3JVc2FnZSh0cnVlLCBlbnRyaWVzX1toLmluZGV4XS5kZXNjLmZvcm1hdCk7CiAgICAgICAgICAgIGlmIChlbnRyaWVzX1toLmluZGV4XS5jdXJyZW50U3RhdGUgIT0gbmVlZGVkKSB7CiAgICAgICAgICAgICAgICBwcmludGYoIiAgICBiYXJyaWVyOiByZXNvdXJjZVsldV0gJXMgLT4gJXNcbiIsCiAgICAgICAgICAgICAgICAgICAgICAgaC5pbmRleCwKICAgICAgICAgICAgICAgICAgICAgICBzdGF0ZU5hbWUoZW50cmllc19baC5pbmRleF0uY3VycmVudFN0YXRlKSwKICAgICAgICAgICAgICAgICAgICAgICBzdGF0ZU5hbWUobmVlZGVkKSk7CiAgICAgICAgICAgICAgICBlbnRyaWVzX1toLmluZGV4XS5jdXJyZW50U3RhdGUgPSBuZWVkZWQ7CiAgICAgICAgICAgIH0KICAgICAgICB9CiAgICB9Cn07CgojaW5jbHVkZSA8Y3N0ZGlvPgoKaW50IG1haW4oKSB7CiAgICBwcmludGYoIj09PSBGcmFtZSBHcmFwaCB2MjogRGVwZW5kZW5jaWVzICYgQmFycmllcnMgPT09XG4iKTsKICAgIHByaW50ZigiQWRkczogZGVwZW5kZW5jeSBEQUcsIHRvcG8tc29ydCwgcGFzcyBjdWxsaW5nLCBhdXRvIGJhcnJpZXJzLlxuXG4iKTsKCiAgICBGcmFtZUdyYXBoIGZnOwogICAgYXV0byBkZXB0aCA9IGZnLmNyZWF0ZVJlc291cmNlKHsxOTIwLCAxMDgwLCBGb3JtYXQ6OkQzMkZ9KTsKICAgIGF1dG8gZ2J1ZkEgPSBmZy5jcmVhdGVSZXNvdXJjZSh7MTkyMCwgMTA4MCwgRm9ybWF0OjpSR0JBOH0pOwogICAgYXV0byBoZHIgICA9IGZnLmNyZWF0ZVJlc291cmNlKHsxOTIwLCAxMDgwLCBGb3JtYXQ6OlJHQkExNkZ9KTsKICAgIGF1dG8gZGVidWcgPSBmZy5jcmVhdGVSZXNvdXJjZSh7MTkyMCwgMTA4MCwgRm9ybWF0OjpSR0JBOH0pOwogICAgcHJpbnRmKCJSZXNvdXJjZXMgY3JlYXRlZDpcbiIpOwogICAgcHJpbnRmKCIgIFswXSBkZXB0aCAgIDE5MjB4MTA4MCBEMzJGXG4iKTsKICAgIHByaW50ZigiICBbMV0gZ2J1ZkEgICAxOTIweDEwODAgUkdCQThcbiIpOwogICAgcHJpbnRmKCIgIFsyXSBoZHIgICAgIDE5MjB4MTA4MCBSR0JBMTZGXG4iKTsKICAgIHByaW50ZigiICBbM10gZGVidWcgICAxOTIweDEwODAgUkdCQTggICh1bnVzZWQgLS0gc2hvdWxkIGJlIGN1bGxlZClcblxuIik7CgogICAgZmcuYWRkUGFzcygiRGVwdGhQcmVwYXNzIiwKICAgICAgICBbJl0oKSB7IGZnLndyaXRlKDAsIGRlcHRoKTsgfSwKICAgICAgICBbJl0oLypjbWQqLykgeyBwcmludGYoIiAgPj4gZXhlYzogRGVwdGhQcmVwYXNzICAod3JpdGVzIGRlcHRoKVxuIik7IH0pOwoKICAgIGZnLmFkZFBhc3MoIkdCdWZmZXIiLAogICAgICAgIFsmXSgpIHsgZmcucmVhZCgxLCBkZXB0aCk7IGZnLndyaXRlKDEsIGdidWZBKTsgfSwKICAgICAgICBbJl0oLypjbWQqLykgeyBwcmludGYoIiAgPj4gZXhlYzogR0J1ZmZlciAgICAgICAocmVhZHMgZGVwdGgsIHdyaXRlcyBnYnVmQSlcbiIpOyB9KTsKCiAgICBmZy5hZGRQYXNzKCJMaWdodGluZyIsCiAgICAgICAgWyZdKCkgeyBmZy5yZWFkKDIsIGdidWZBKTsgZmcud3JpdGUoMiwgaGRyKTsgfSwKICAgICAgICBbJl0oLypjbWQqLykgeyBwcmludGYoIiAgPj4gZXhlYzogTGlnaHRpbmcgICAgICAocmVhZHMgZ2J1ZkEsIHdyaXRlcyBoZHIpXG4iKTsgfSk7CgogICAgLy8gVGhpcyBwYXNzIHdyaXRlcyBkZWJ1ZyBidXQgbm90aGluZyByZWFkcyBpdCAtLSB0aGUgZ3JhcGggd2lsbCBjdWxsIGl0LgogICAgZmcuYWRkUGFzcygiRGVidWdPdmVybGF5IiwKICAgICAgICBbJl0oKSB7IGZnLndyaXRlKDMsIGRlYnVnKTsgfSwKICAgICAgICBbJl0oLypjbWQqLykgeyBwcmludGYoIiAgPj4gZXhlYzogRGVidWdPdmVybGF5ICAod3JpdGVzIGRlYnVnKVxuIik7IH0pOwoKICAgIHByaW50ZigiUGFzc2VzIGFkZGVkOiBEZXB0aFByZXBhc3MsIEdCdWZmZXIsIExpZ2h0aW5nLCBEZWJ1Z092ZXJsYXlcbiIpOwogICAgcHJpbnRmKCJEZXBlbmRlbmNpZXM6IEdCdWZmZXItPkRlcHRoUHJlcGFzcywgTGlnaHRpbmctPkdCdWZmZXJcbiIpOwogICAgcHJpbnRmKCJEZWJ1Z092ZXJsYXkgaGFzIG5vIGNvbnN1bWVycyAtLSBpdCdzIGEgZGVhZCBwYXNzLlxuXG4iKTsKCiAgICBwcmludGYoIkNvbXBpbGluZyBncmFwaC4uLlxuIik7CiAgICBmZy5leGVjdXRlKCk7CgogICAgcHJpbnRmKCJcbltPS10gdjIgY29tcGxldGUuIEF1dG9tYXRpYyBkZXBlbmRlbmN5IHNvcnRpbmcsIGRlYWQtcGFzcyBjdWxsaW5nLFxuIik7CiAgICBwcmludGYoIiAgYW5kIGJhcnJpZXIgaW5zZXJ0aW9uIC0tIG5vIG1hbnVhbCB3b3JrIG5lZWRlZC5cbiIpOwogICAgcHJpbnRmKCIgIEJ1dCBlYWNoIHJlc291cmNlIGdldHMgaXRzIG93biBHUFUgYWxsb2NhdGlvbiAobm8gcmV1c2UpLlxuIik7CiAgICBwcmludGYoIiAgLT4gdjMgd2lsbCBhZGQgbGlmZXRpbWUgYW5hbHlzaXMgYW5kIG1lbW9yeSBhbGlhc2luZy5cbiIpOwogICAgcmV0dXJuIDA7Cn0K> pPropagation(),openInGodbolt(this)>↗ CE source
// Frame Graph MVP v2 -- Usage Example class=cl>// Compile: g++ -std=c++17 -o example_v2 example_v2.cpp class=cl>#include "frame_graph_v2.h" class=cl>#include <cstdio> class=cl> class=cl>int main() { printf("=== Frame Graph v2: Dependencies & Barriers ===\n"); printf("Adds: dependency DAG, topo-sort, pass culling, auto barriers.\n\n"); FrameGraph fg; auto depth = fg.createResource({1920, 1080, Format::D32F}); auto gbufA = fg.createResource({1920, 1080, Format::RGBA8}); auto hdr   = fg.createResource({1920, 1080, Format::RGBA16F}); auto debug = fg.createResource({1920, 1080, Format::RGBA8}); printf("Resources created:\n"); printf("  [0] depth   1920x1080 D32F\n"); printf("  [1] gbufA   1920x1080 RGBA8\n"); printf("  [2] hdr     1920x1080 RGBA16F\n"); printf("  [3] debug   1920x1080 RGBA8  (unused -- should be culled)\n\n"); fg.addPass("DepthPrepass", [&]() { fg.write(0, depth); }, [&](/*cmd*/) { printf("  >> exec: DepthPrepass  (writes depth)\n"); }); fg.addPass("GBuffer", [&]() { fg.read(1, depth); fg.write(1, gbufA); }, [&](/*cmd*/) { printf("  >> exec: GBuffer       (reads depth, writes gbufA)\n"); }); fg.addPass("Lighting", [&]() { fg.read(2, gbufA); fg.write(2, hdr); }, [&](/*cmd*/) { printf("  >> exec: Lighting      (reads gbufA, writes hdr)\n"); }); // This pass writes debug but nothing reads it -- the graph will cull it. fg.addPass("DebugOverlay", [&]() { fg.write(3, debug); }, [&](/*cmd*/) { printf("  >> exec: DebugOverlay  (writes debug)\n"); }); printf("Passes added: DepthPrepass, GBuffer, Lighting, DebugOverlay\n"); printf("Dependencies: GBuffer->DepthPrepass, Lighting->GBuffer\n"); printf("DebugOverlay has no consumers -- it's a dead pass.\n\n"); printf("Compiling graph...\n"); fg.execute(); printf("\n[OK] v2 complete. Automatic dependency sorting, dead-pass culling,\n"); printf("  and barrier insertion -- no manual work needed.\n"); printf("  But each resource gets its own GPU allocation (no reuse).\n"); printf("  -> v3 will add lifetime analysis and memory aliasing.\n"); return 0; class=cl>} class=compile-output>
That’s three of the four intro promises delivered — automatic ordering, barrier insertion, and dead-pass culling. The only piece missing: resources still live for the entire frame. Version 3 fixes that with lifetime analysis and memory aliasing.
UE5’s RDG does the same thing. When you call FRDGBuilder::AddPass, RDG builds the dependency graph from your declared reads/writes, topologically sorts it, culls dead passes, and inserts barriers — all before recording a single GPU command.
💾 MVP v3 — Lifetimes & Aliasing
#
V2 gives us ordering, culling, and barriers — but every transient resource lives for the entire frame. A 1080p deferred pipeline allocates ~52 MB of transient textures that are each used for only 2–3 passes. If their lifetimes don’t overlap, they can share physical memory. That’s aliasing, and it typically saves 30–50% VRAM.
The algorithm has two steps. First, scan lifetimes: walk the sorted pass list and record each transient resource’s firstUse and lastUse pass indices (imported resources are excluded — they’re externally owned). Second, free-list scan: sort resources by first-use, then greedily try to fit each one into an existing physical block that’s compatible (same memory type, large enough, and whose last user finished before this resource’s first use). Fit → reuse. No fit → allocate a new block. This is greedy interval-coloring.
Without aliasing, every transient resource is a committed allocation — its own chunk of VRAM from creation to end of frame, even if it’s only used for 2–3 passes. Here’s what that looks like for six transient resources at 1080p:
❌ No aliasing — every resource owns its memory for the full frame
P1
P2
P3
P4
P5
P6
P7
GBuffer Albedo8 MB
GBuffer Normals8 MB
SSAO Scratch2 MB
SSAO Result2 MB
HDR Lighting16 MB
Bloom Scratch16 MB
Red cells = memory allocated but unused — wasted VRAM. Each resource holds its full allocation across the entire frame even though it's only active for 2–3 passes. Total: 52 MB committed.
Most of that memory sits idle. The colored bars show when each resource is actually used — everything else is waste. The graph knows every lifetime, so it can do better. Resources whose lifetimes don’t overlap can share the same physical memory:
Resource
P1
P2
P3
P4
P5
P6
P7
GBuffer Albedo8 MB
HDR Lighting16 MB → slot A
GBuffer Normals8 MB
Bloom Scratch16 MB → slot B
SSAO Scratch2 MB
SSAO Result2 MB → slot C
Same color = same physical memory. GBuffer Albedo dies at P4, HDR Lighting starts at P5 → both fit in slot A. Three physical blocks serve six virtual resources.
Without aliasing
52 MB
→
With aliasing
36 MB
3 physical blocks shared across 6 virtual resources.
31% saved — in complex pipelines: 40–50%.
This requires placed resources at the API level — GPU memory allocated from a heap, with resources bound to offsets within it. In D3D12, that means ID3D12Heap + CreatePlacedResource. In Vulkan, VkDeviceMemory + vkBindImageMemory at different offsets. Without placed resources (i.e., CreateCommittedResource or Vulkan dedicated allocations), each resource gets its own memory and aliasing is impossible — which is why the graph’s allocator works with heaps.
Drag the interactive timeline below to see how resources share physical blocks as their lifetimes end:
🔬
Interactive: Resource Lifetimes & Memory AliasingToggle aliasing to share physical memory between non-overlapping resources. Watch VRAM shrink.
Example:




Memory Aliasing
← flip this
🧩 Putting it together — v2 → v3 diff
#
Two additions to the FrameGraph class: (1) a lifetime scan that records each transient resource’s first and last use in the sorted pass order, and (2) a greedy free-list allocator that reuses physical blocks when lifetimes don’t overlap.
v2 → v3 — Key additions for lifetime analysis & aliasing
@@ New structs @@
+struct PhysicalBlock { // physical memory slot
+ uint32_t sizeBytes = 0;
+ Format format = Format::RGBA8;
+ uint32_t availAfter = 0; // free after this pass
+};
+
+struct Lifetime { // per-resource timing
+ uint32_t firstUse = UINT32_MAX;
+ uint32_t lastUse = 0;
+ bool isTransient = true;
+};
@@ FrameGraph::compile() @@
auto sorted = topoSort();
cull(sorted);
+ auto lifetimes = scanLifetimes(sorted); // NEW v3
+ auto mapping = aliasResources(lifetimes); // NEW v3
+ // mapping now holds physical bindings — execute just runs passes
@@ scanLifetimes() — walk sorted passes, record first/last use @@
+ for (uint32_t order = 0; order < sorted.size(); order++) {
+ for (auto& h : passes_[sorted[order]].reads) {
+ life[h.index].firstUse = min(life[h.index].firstUse, order);
+ life[h.index].lastUse = max(life[h.index].lastUse, order);
+ }
+ // ... same for writes ...
+ }
@@ aliasResources() — greedy free-list scan @@
+ // sort resources by firstUse, then scan free list:
+ for (uint32_t resIdx : indices) {
+ for (uint32_t b = 0; b < freeList.size(); b++) {
+ if (freeList[b].availAfter < firstUse && sizeOK) {
+ mapping[resIdx] = b; // reuse!
+ break;
+ }
+ }
+ if (!reused) freeList.push_back(newBlock); // allocate
+ }
Complete v3 source — all v2 code plus lifetime analysis and aliasing:
📑
frame_graph_v3.h
363 lines
▸ source
#pragma once
// Frame Graph MVP v3 — Lifetimes & Aliasing
// Adds: lifetime analysis, greedy free-list memory aliasing.
// Builds on v2 (dependencies, topo-sort, culling, barriers).
//
// Compile: any C++17 compiler (header-only, no GPU backend needed)
//   g++ -std=c++17 -c frame_graph_v3.h
//   or just #include it from your example/test file.

#include <algorithm>
#include <cassert>
#include <cstdint>
#include <cstdio>
#include <functional>
#include <numeric>
#include <queue>
#include <string>
#include <unordered_set>
#include <vector>

// ── Resource description (virtual until compile) ──────────────
enum class Format { RGBA8, RGBA16F, R8, D32F };

struct ResourceDesc {
    uint32_t width  = 0;
    uint32_t height = 0;
    Format   format = Format::RGBA8;
};

struct ResourceHandle {
    uint32_t index = UINT32_MAX;
    bool isValid() const { return index != UINT32_MAX; }
};

// ── Resource state tracking ───────────────────────────────────
enum class ResourceState { Undefined, ColorAttachment, DepthAttachment,
                           ShaderRead, Present };

inline const char* stateName(ResourceState s) {
    switch (s) {
        case ResourceState::Undefined:       return "Undefined";
        case ResourceState::ColorAttachment: return "ColorAttachment";
        case ResourceState::DepthAttachment: return "DepthAttachment";
        case ResourceState::ShaderRead:      return "ShaderRead";
        case ResourceState::Present:         return "Present";
        default:                             return "?";
    }
}

struct ResourceVersion {
    uint32_t writerPass = UINT32_MAX;
    std::vector<uint32_t> readerPasses;
};

struct ResourceEntry {
    ResourceDesc desc;
    std::vector<ResourceVersion> versions;
    ResourceState currentState = ResourceState::Undefined;
};

// ── Physical memory block (NEW v3) ────────────────────────────
struct PhysicalBlock {
    uint32_t sizeBytes   = 0;
    Format   format      = Format::RGBA8;
    uint32_t availAfter  = 0;  // pass index after which this block is free
};

// ── Bytes-per-pixel helper (NEW v3) ───────────────────────────
inline uint32_t bytesPerPixel(Format fmt) {
    switch (fmt) {
        case Format::R8:      return 1;
        case Format::RGBA8:   return 4;
        case Format::D32F:    return 4;
        case Format::RGBA16F: return 8;
        default:              return 4;
    }
}

// ── Lifetime info per resource (NEW v3) ───────────────────────
struct Lifetime {
    uint32_t firstUse = UINT32_MAX;
    uint32_t lastUse  = 0;
    bool     isTransient = true;
};

// ── Render pass ───────────────────────────────────────────────
struct RenderPass {
    std::string name;
    std::function<void()>             setup;
    std::function<void(/*cmd list*/)> execute;

    std::vector<ResourceHandle> reads;
    std::vector<ResourceHandle> writes;
    std::vector<uint32_t> dependsOn;
    std::vector<uint32_t> successors;
    uint32_t inDegree = 0;
    bool     alive    = false;
};

// ── Frame graph (v3: full MVP) ────────────────────────────────
class FrameGraph {
public:
    ResourceHandle createResource(const ResourceDesc& desc) {
        entries_.push_back({ desc, {{}}, ResourceState::Undefined });
        return { static_cast<uint32_t>(entries_.size() - 1) };
    }

    void read(uint32_t passIdx, ResourceHandle h) {
        auto& ver = entries_[h.index].versions.back();
        if (ver.writerPass != UINT32_MAX) {
            passes_[passIdx].dependsOn.push_back(ver.writerPass);
        }
        ver.readerPasses.push_back(passIdx);
        passes_[passIdx].reads.push_back(h);
    }

    void write(uint32_t passIdx, ResourceHandle h) {
        entries_[h.index].versions.push_back({});
        entries_[h.index].versions.back().writerPass = passIdx;
        passes_[passIdx].writes.push_back(h);
    }

    template <typename SetupFn, typename ExecFn>
    void addPass(const std::string& name, SetupFn&& setup, ExecFn&& exec) {
        uint32_t idx = static_cast<uint32_t>(passes_.size());
        passes_.push_back({ name, std::forward<SetupFn>(setup),
                                   std::forward<ExecFn>(exec) });
        currentPass_ = idx;
        passes_.back().setup();
    }

    // ── v3: compile — builds the execution plan + allocates memory ──
    struct CompiledPlan {
        std::vector<uint32_t> sorted;
        std::vector<uint32_t> mapping;   // mapping[virtualIdx] → physicalBlock
    };

    CompiledPlan compile() {
        printf("\n[1] Building dependency edges...\n");
        buildEdges();
        printf("[2] Topological sort...\n");
        auto sorted   = topoSort();
        printf("[3] Culling dead passes...\n");
        cull(sorted);
        printf("[4] Scanning resource lifetimes...\n");
        auto lifetimes = scanLifetimes(sorted);   // NEW v3
        printf("[5] Aliasing resources (greedy free-list)...\n");
        auto mapping   = aliasResources(lifetimes); // NEW v3

        // Physical bindings are now decided — execute can't change them.
        // This makes the compiled plan cacheable and thread-safe.
        return { std::move(sorted), std::move(mapping) };
    }

    // ── v3: execute — just records GPU commands ───────────────
    void execute() {
        auto plan = compile();

        // In a real renderer: bind physical memory blocks using plan.mapping here.
        // Each virtual handle maps to an aliased physical slot decided at compile time.
        // Our MVP prints the mapping but skips actual GPU binding.

        printf("[6] Executing (with automatic barriers):\n");
        for (uint32_t idx : plan.sorted) {
            if (!passes_[idx].alive) {
                printf("  -- skip: %s (CULLED)\n", passes_[idx].name.c_str());
                continue;
            }
            insertBarriers(idx);
            passes_[idx].execute(/* &cmdList */);
        }
        passes_.clear();
        entries_.clear();
    }

private:
    uint32_t currentPass_ = 0;
    std::vector<RenderPass>    passes_;
    std::vector<ResourceEntry> entries_;

    // ── Build dependency edges ────────────────────────────────
    void buildEdges() {
        for (uint32_t i = 0; i < passes_.size(); i++) {
            std::unordered_set<uint32_t> seen;
            for (uint32_t dep : passes_[i].dependsOn) {
                if (seen.insert(dep).second) {
                    passes_[dep].successors.push_back(i);
                    passes_[i].inDegree++;
                }
            }
        }
    }

    // ── Kahn's topological sort — O(V + E) ────────────────────
    std::vector<uint32_t> topoSort() {
        std::queue<uint32_t> q;
        std::vector<uint32_t> inDeg(passes_.size());
        for (uint32_t i = 0; i < passes_.size(); i++) {
            inDeg[i] = passes_[i].inDegree;
            if (inDeg[i] == 0) q.push(i);
        }
        std::vector<uint32_t> order;
        while (!q.empty()) {
            uint32_t cur = q.front(); q.pop();
            order.push_back(cur);
            for (uint32_t succ : passes_[cur].successors) {
                if (--inDeg[succ] == 0)
                    q.push(succ);
            }
        }
        assert(order.size() == passes_.size() && "Cycle detected!");
        printf("  Topological order: ");
        for (uint32_t i = 0; i < order.size(); i++) {
            printf("%s%s", passes_[order[i]].name.c_str(),
                   i + 1 < order.size() ? " -> " : "\n");
        }
        return order;
    }

    // ── Cull dead passes ──────────────────────────────────────
    void cull(const std::vector<uint32_t>& sorted) {
        if (sorted.empty()) return;
        passes_[sorted.back()].alive = true;
        for (int i = static_cast<int>(sorted.size()) - 1; i >= 0; i--) {
            if (!passes_[sorted[i]].alive) continue;
            for (uint32_t dep : passes_[sorted[i]].dependsOn)
                passes_[dep].alive = true;
        }
        printf("  Culling result:   ");
        for (uint32_t i = 0; i < passes_.size(); i++) {
            printf("%s=%s%s", passes_[i].name.c_str(),
                   passes_[i].alive ? "ALIVE" : "DEAD",
                   i + 1 < passes_.size() ? ", " : "\n");
        }
    }

    // ── Insert barriers ───────────────────────────────────────
    void insertBarriers(uint32_t passIdx) {
        auto stateForUsage = [](bool isWrite, Format fmt) {
            if (isWrite)
                return (fmt == Format::D32F) ? ResourceState::DepthAttachment
                                             : ResourceState::ColorAttachment;
            return ResourceState::ShaderRead;
        };

        for (auto& h : passes_[passIdx].reads) {
            ResourceState needed = ResourceState::ShaderRead;
            if (entries_[h.index].currentState != needed) {
                printf("    barrier: resource[%u] %s -> %s\n",
                       h.index,
                       stateName(entries_[h.index].currentState),
                       stateName(needed));
                entries_[h.index].currentState = needed;
            }
        }
        for (auto& h : passes_[passIdx].writes) {
            ResourceState needed = stateForUsage(true, entries_[h.index].desc.format);
            if (entries_[h.index].currentState != needed) {
                printf("    barrier: resource[%u] %s -> %s\n",
                       h.index,
                       stateName(entries_[h.index].currentState),
                       stateName(needed));
                entries_[h.index].currentState = needed;
            }
        }
    }

    // ── Scan lifetimes (NEW v3) ───────────────────────────────
    std::vector<Lifetime> scanLifetimes(const std::vector<uint32_t>& sorted) {
        std::vector<Lifetime> life(entries_.size());

        for (uint32_t order = 0; order < sorted.size(); order++) {
            uint32_t passIdx = sorted[order];
            if (!passes_[passIdx].alive) continue;

            for (auto& h : passes_[passIdx].reads) {
                life[h.index].firstUse = std::min(life[h.index].firstUse, order);
                life[h.index].lastUse  = std::max(life[h.index].lastUse,  order);
            }
            for (auto& h : passes_[passIdx].writes) {
                life[h.index].firstUse = std::min(life[h.index].firstUse, order);
                life[h.index].lastUse  = std::max(life[h.index].lastUse,  order);
            }
        }
        printf("  Lifetimes (in sorted pass order):\n");
        for (uint32_t i = 0; i < life.size(); i++) {
            if (life[i].firstUse == UINT32_MAX) {
                printf("    resource[%u] unused (dead)\n", i);
            } else {
                printf("    resource[%u] alive [pass %u .. pass %u]\n",
                       i, life[i].firstUse, life[i].lastUse);
            }
        }
        return life;
    }

    // ── Greedy free-list aliasing (NEW v3) ────────────────────
    std::vector<uint32_t> aliasResources(const std::vector<Lifetime>& lifetimes) {
        std::vector<PhysicalBlock> freeList;
        std::vector<uint32_t> mapping(entries_.size(), UINT32_MAX);
        uint32_t totalWithout = 0;

        std::vector<uint32_t> indices(entries_.size());
        std::iota(indices.begin(), indices.end(), 0);
        std::sort(indices.begin(), indices.end(), [&](uint32_t a, uint32_t b) {
            return lifetimes[a].firstUse < lifetimes[b].firstUse;
        });

        printf("  Aliasing:\n");
        for (uint32_t resIdx : indices) {
            if (!lifetimes[resIdx].isTransient) continue;
            if (lifetimes[resIdx].firstUse == UINT32_MAX) continue;

            uint32_t needed = entries_[resIdx].desc.width
                            * entries_[resIdx].desc.height
                            * bytesPerPixel(entries_[resIdx].desc.format);
            totalWithout += needed;
            bool reused = false;

            for (uint32_t b = 0; b < freeList.size(); b++) {
                if (freeList[b].availAfter < lifetimes[resIdx].firstUse
                    && freeList[b].sizeBytes >= needed) {
                    mapping[resIdx] = b;
                    freeList[b].availAfter = lifetimes[resIdx].lastUse;
                    reused = true;
                    printf("    resource[%u] -> reuse physical block %u  "
                           "(%.1f MB, lifetime [%u..%u])\n",
                           resIdx, b, needed / (1024.0f * 1024.0f),
                           lifetimes[resIdx].firstUse,
                           lifetimes[resIdx].lastUse);
                    break;
                }
            }

            if (!reused) {
                mapping[resIdx] = static_cast<uint32_t>(freeList.size());
                printf("    resource[%u] -> NEW physical block %u   "
                       "(%.1f MB, lifetime [%u..%u])\n",
                       resIdx, static_cast<uint32_t>(freeList.size()),
                       needed / (1024.0f * 1024.0f),
                       lifetimes[resIdx].firstUse,
                       lifetimes[resIdx].lastUse);
                freeList.push_back({ needed, entries_[resIdx].desc.format,
                                     lifetimes[resIdx].lastUse });
            }
        }

        uint32_t totalWith = 0;
        for (auto& blk : freeList) totalWith += blk.sizeBytes;
        printf("  Memory: %u physical blocks for %u virtual resources\n",
               static_cast<uint32_t>(freeList.size()),
               static_cast<uint32_t>(entries_.size()));
        printf("  Without aliasing: %.1f MB\n",
               totalWithout / (1024.0f * 1024.0f));
        printf("  With aliasing:    %.1f MB (saved %.1f MB, %.0f%%)\n",
               totalWith / (1024.0f * 1024.0f),
               (totalWithout - totalWith) / (1024.0f * 1024.0f),
               totalWithout > 0 ? 100.0f * (totalWithout - totalWith) / totalWithout : 0.0f);

        return mapping;
    }
};
📄
example_v3.cpp
67 lines


▸ source
// Frame Graph MVP v3 -- Usage Example
// Compile: g++ -std=c++17 -o example_v3 example_v3.cpp
#include "frame_graph_v3.h"
#include <cstdio>

int main() {
    printf("=== Frame Graph v3: Lifetimes & Memory Aliasing ===\n");
    printf("Adds: lifetime analysis, greedy free-list aliasing.\n");
    printf("Resources that don't overlap in time share GPU memory.\n\n");

    FrameGraph fg;
    auto depth = fg.createResource({1920, 1080, Format::D32F});
    auto gbufA = fg.createResource({1920, 1080, Format::RGBA8});
    auto gbufN = fg.createResource({1920, 1080, Format::RGBA8});
    auto hdr   = fg.createResource({1920, 1080, Format::RGBA16F});
    auto bloom = fg.createResource({960,  540,  Format::RGBA16F});
    auto debug = fg.createResource({1920, 1080, Format::RGBA8});

    printf("Resources created:\n");
    printf("  [0] depth   1920x1080 D32F    (%.1f MB)\n", 1920*1080*4 / (1024.0f*1024.0f));
    printf("  [1] gbufA   1920x1080 RGBA8   (%.1f MB)\n", 1920*1080*4 / (1024.0f*1024.0f));
    printf("  [2] gbufN   1920x1080 RGBA8   (%.1f MB)\n", 1920*1080*4 / (1024.0f*1024.0f));
    printf("  [3] hdr     1920x1080 RGBA16F (%.1f MB)\n", 1920*1080*8 / (1024.0f*1024.0f));
    printf("  [4] bloom    960x540  RGBA16F (%.1f MB)\n",  960*540*8  / (1024.0f*1024.0f));
    printf("  [5] debug   1920x1080 RGBA8   (unused -- should be culled)\n\n");

    fg.addPass("DepthPrepass",
        [&]() { fg.write(0, depth); },
        [&](/*cmd*/) { printf("  >> exec: DepthPrepass  (writes depth)\n"); });

    fg.addPass("GBuffer",
        [&]() { fg.read(1, depth); fg.write(1, gbufA); fg.write(1, gbufN); },
        [&](/*cmd*/) { printf("  >> exec: GBuffer       (reads depth, writes gbufA, gbufN)\n"); });

    fg.addPass("Lighting",
        [&]() { fg.read(2, gbufA); fg.read(2, gbufN); fg.write(2, hdr); },
        [&](/*cmd*/) { printf("  >> exec: Lighting      (reads gbufA+gbufN, writes hdr)\n"); });

    fg.addPass("Bloom",
        [&]() { fg.read(3, hdr); fg.write(3, bloom); },
        [&](/*cmd*/) { printf("  >> exec: Bloom         (reads hdr, writes bloom)\n"); });

    fg.addPass("Tonemap",
        [&]() { fg.read(4, bloom); fg.write(4, hdr); },
        [&](/*cmd*/) { printf("  >> exec: Tonemap       (reads bloom, writes hdr -> final)\n"); });

    // Dead pass -- nothing reads debug, so the graph will cull it.
    fg.addPass("DebugOverlay",
        [&]() { fg.write(5, debug); },
        [&](/*cmd*/) { printf("  >> exec: DebugOverlay  (writes debug)\n"); });

    printf("Passes: DepthPrepass, GBuffer, Lighting, Bloom, Tonemap, DebugOverlay\n");
    printf("Note: gbufA and gbufN are dead after Lighting -- their memory\n");
    printf("      can be reused by bloom/hdr in later passes.\n\n");

    printf("Compiling graph...\n");
    fg.execute();

    printf("\n[OK] v3 complete -- full MVP:\n");
    printf("  - Automatic dependency sorting\n");
    printf("  - Dead-pass culling (DebugOverlay removed)\n");
    printf("  - Automatic barrier insertion\n");
    printf("  - Lifetime analysis + memory aliasing\n");
    printf("  - Feature-equivalent to Frostbite 2017 GDC demo\n");
    return 0;
}
~70 new lines on top of v2. Aliasing runs once per frame in O(R log R) — sort, then linear scan of the free list. Sub-microsecond for 15 transient resources.
That’s the full value prop — automatic memory aliasing and automatic barriers from a single FrameGraph class. UE5’s transient resource allocator does the same thing: any FRDGTexture created through FRDGBuilder::CreateTexture (vs RegisterExternalTexture) is transient and eligible for aliasing, using the same lifetime analysis and free-list scan we just built.
✅ What the MVP delivers
#
Three iterations produced a single FrameGraph class. Here’s what it does every frame, broken down by phase — the same declare → compile → execute lifecycle from Part I:
① Declare
Each addPass runs its setup lambda:
• declare reads & writes
• request virtual resources
• version tracking builds edges
Zero GPU work. Resources are descriptions — no memory allocated yet.
② Compile
All automatic, all linear-time:
• sort — topo order (Kahn's)
• cull — kill dead passes
• scan lifetimes — first/last use
• alias — free-list reuse
• compute barriers
Everything linear or near-linear — all data fits in L1 cache.
③ Execute
Walk sorted, living passes:
• insert automatic barriers
• call execute lambda
• resources already aliased & bound
Lambdas see a fully resolved environment. No manual barriers, no manual memory.
Compile cost by step:
Compile step Complexity Algorithm
Topological sort O(V + E) Kahn's — passes + edges
Pass culling O(V + E) Backward reachability from output
Lifetime scan O(V + E) Walk sorted passes and their read/write edges
Aliasing O(R log R) Sort by first-use, greedy free-list scan
Barrier computation O(V + E) Walk passes and their read/write edges with state lookup
V = passes (~25), E = dependency edges (~50), R = transient resources (~15). Everything linear or near-linear.
The graph doesn’t care about your rendering strategy. It cares about your dependencies. Deferred or forward, the same FrameGraph class handles both — different topology, same automatic barriers and aliasing. That’s the whole point.
← Previous: Part I — Theory
Next: Part III — Production Engines →
Rendering Architecture -
This article is part of a series.
Part :
Frame Graph — Production Engines
Part :
This Article
Part :
Frame Graph — Theory

Frame Graph — Build It

🏗️ API Design
#

🎯 Design principles
#

🧩 Putting it together
#

🧱 MVP v1 — Declare & Execute
#

🔗 MVP v2 — Dependencies & Barriers
#

🔀 Resource versioning & the dependency graph
#

📊 Topological sort (Kahn’s algorithm)
#

✂️ Pass culling
#

🚧 Barrier insertion
#

🧩 Putting it together — v1 → v2 diff
#

`💾 MVP v3 — Lifetimes & Aliasing #`

`🧩 Putting it together — v2 → v3 diff #`

`✅ What the MVP delivers #`

Compile step	Complexity	Algorithm
Topological sort	O(V + E)	Kahn's — passes + edges
Pass culling	O(V + E)	Backward reachability from output
Lifetime scan	O(V + E)	Walk sorted passes and their read/write edges
Aliasing	O(R log R)	Sort by first-use, greedy free-list scan
Barrier computation	O(V + E)	Walk passes and their read/write edges with state lookup

🏗️ API Design#

🎯 Design principles#

🧩 Putting it together#

🧱 MVP v1 — Declare & Execute#

🔗 MVP v2 — Dependencies & Barriers#

🔀 Resource versioning & the dependency graph#

📊 Topological sort (Kahn’s algorithm)#

✂️ Pass culling#

🚧 Barrier insertion#

🧩 Putting it together — v1 → v2 diff#

💾 MVP v3 — Lifetimes & Aliasing#

🧩 Putting it together — v2 → v3 diff#

✅ What the MVP delivers#

🏗️ API Design
#

🎯 Design principles
#

🧩 Putting it together
#

🧱 MVP v1 — Declare & Execute
#

🔗 MVP v2 — Dependencies & Barriers
#

🔀 Resource versioning & the dependency graph
#

📊 Topological sort (Kahn’s algorithm)
#

✂️ Pass culling
#

🚧 Barrier insertion
#

🧩 Putting it together — v1 → v2 diff
#

`💾 MVP v3 — Lifetimes & Aliasing #`

`🧩 Putting it together — v2 → v3 diff #`

`✅ What the MVP delivers #`