Premium AI Training Datasets
for AEC Compliance & Automation

Curated conversational datasets covering IBC, NFPA, ASHRAE, ADA, NEC, and more. Train your AI models on real-world compliance scenarios.

⬇️ 103 downloads

What is Lumen-Models?

Lumen-Models is a premium collection of conversational datasets designed to train AI models in Architecture, Engineering, and Construction (AEC). Each conversation is a professional dialogue between a BIM Auditor and an expert GPT, solving real compliance problems. The dataset covers 20+ international codes including IBC, NFPA, ASHRAE, ADA, and NEC. Perfect for LLM fine-tuning, RAG systems, and automated compliance checking.

Why Lumen-Models?

📋 Comprehensive Compliance Coverage

50+ conversation blocks covering 20+ international codes including IBC, NFPA, ASHRAE, ADA, and NEC.

⚙️ Ready for LLM Fine-Tuning

Structured JSON format optimized for Qwen, Llama, and other open-source models. Perfect for RAG and automated compliance checking.

🧠 Real-World Scenarios

Each conversation is a professional dialogue between a BIM Auditor and an expert GPT, solving actual AEC compliance problems.

What's Inside the Dataset?

Codes & Standards Covered

IBCNFPA 13NFPA 101 NECADAASHRAE 90.1 ASHRAE 62.1AISC 358ASCE 7 ASTMIEBCIPC AIAOSHALEED NBCOBC

Dataset Preview

Each conversation includes: Human (BIM Auditor) → GPT Expert → Follow-up → Documentation.

{
  "conversation": [
    { "role": "human", "content": "Does IBC require fire dampers in 2-hour rated shafts?" },
    { "role": "gpt", "content": "Yes, IBC Section 717 requires fire dampers..." },
    { "role": "human", "content": "What about NFPA 90A?" },
    { "role": "gpt", "content": "NFPA 90A parallels IBC but adds..." }
  ],
  "metadata": { "code": "IBC", "discipline": "Fire Protection" }
}

Full dataset contains 32+ blocks with detailed compliance dialogues.

Simple, Transparent Pricing

sample

Free Sample

$0
  • 20 conversation blocks
  • Preview quality & format
  • Basic compliance topics
  • ❌ No ACC module
  • ❌ No commercial license
Download Free Sample
popular

Full Dataset

$29 one-time
  • 32+ conversation blocks
  • 20+ codes & disciplines
  • ACC module included
  • 1 year updates
  • ✅ Commercial license
  • JSON format, ready to use
Buy Now – $29
⭐ premium

Premium Dataset

$49 one-time
  • 100+ conversation blocks (vs. 32 in Full)
  • 40+ technical topics (vs. 20 in Full)
  • Extended code coverage: IBC, NFPA, ASHRAE, ADA, NEC, AISC, ASTM, OSHA, LEED
  • ACC module included
  • Lifetime updates (no recurring fees)
  • ✅ Commercial license included
  • JSON format, ready for LLM fine-tuning & RAG
  • Professional-grade dataset for enterprises and research
Buy Now – $49

💳 Secure payments via Gumroad. After payment, you'll receive an email with your download link.

Get Your Dataset

Free sample: Download a preview of 20 conversation blocks.

📄 lumen_models_sample.json · 20 blocks · 45 KB
⬇ Download Free Sample

Full dataset: After payment, you'll receive an email with your download link from Gumroad.

📦 lumen_models_full.json · 32+ blocks · 2.4 MB
⚡ Automatic delivery · Commercial license included

Need a Custom Dataset?

Contact us for custom datasets, larger volumes, special requests, or Hugging Face integration.

🤗 We support Hugging Face datasets & Spaces
🤗

Hugging Face Integration

Our datasets are formatted for seamless integration with Hugging Face Datasets and Spaces. Ask us about custom fine-tuning for your models.