Treemap

Data Lake Storage Architecture

Dark-themed treemap visualizing data lake storage distribution across bronze, silver, and gold layers with different data formats.

Output
Data Lake Storage Architecture
Python
import matplotlib.pyplot as plt
import squarify
import numpy as np

# Data lake layers (in TB)
labels = ['Bronze Raw', 'Bronze Logs', 'Silver Cleaned', 'Silver Enriched',
          'Gold Analytics', 'Gold ML Features', 'Archive Cold', 'Staging']
sizes = [85, 120, 45, 65, 28, 35, 180, 22]
total = sum(sizes)

# Color scheme: bronze/silver/gold themed with CLAUDE.md
colors = ['#C82909', '#F54927', '#27D3F5', '#276CF5', 
          '#F5D327', '#F5B027', '#5314E6', '#6CF527']

fig, ax = plt.subplots(figsize=(12, 8), facecolor='#0a0a0f')
ax.set_facecolor('#0a0a0f')

pct = [s/total*100 for s in sizes]
squarify.plot(sizes=sizes, 
              label=[f'{l}\n{s}TB ({p:.1f}%)' for l, s, p in zip(labels, sizes, pct)],
              color=colors, alpha=0.85, ax=ax,
              text_kwargs={'fontsize': 10, 'color': 'white', 'fontweight': 'bold'})

ax.axis('off')
ax.set_title(f'Data Lake Architecture - {total/1000:.2f} PB Total', 
             fontsize=18, color='#f8fafc', fontweight='bold', pad=20)

plt.tight_layout()
plt.show()
Library

Matplotlib

Category

Part-to-Whole

Did this help you?

Support PyLucid to keep it free & growing

Support