This guide explains AI models in NodeTool and how to choose between local and cloud execution.
What Are AI Models?
An AI model is a trained program for a specific task:
| Model Type | Function | Example |
|---|---|---|
| Language Model (LLM) | Text generation | Stories, Q&A, summarization |
| Image Model | Image generation/editing | Artwork, photos, variations |
| Speech Model | Speech↔text conversion | Transcription, text-to-speech |
| Vision Model | Image understanding | Photo descriptions, OCR |
Models come pre-trained. Select one for your task.
Local vs. Cloud
NodeTool runs AI models locally or through cloud APIs.
Local Models
Pros:
- 🔒 Private – Data stays local
- 💰 Free – No usage costs
- 📶 Offline – Works without internet
Cons:
- 💾 Requires space – 4-15 GB per model
- ⚡ Needs hardware – Faster with GPU
- ⏳ Initial download – One-time setup
Cloud Models
Pros:
- 🚀 Fast – No downloads
- 💻 Any hardware – Works on older machines
- 🆕 Latest models – Access newest capabilities
Cons:
- 💵 Usage costs – Pay per task
- 🌐 Requires internet
- 📤 Data sent externally
Mixed Approach (Recommended)
Combine local and cloud:
- Speech recognition – local for privacy
- Image generation – cloud for quality
- Document processing – local for confidential files
State-of-the-Art Cloud Models for Creative Workflows
NodeTool provides access to the latest high-quality generative AI models through cloud providers. These models represent the cutting edge in video, image, and audio generation:
Top Video Generation Models
| Model | Provider | Capabilities | Key Features |
|---|---|---|---|
| OpenAI Sora 2 Pro | OpenAI | T2V/I2V up to 15s | Realistic motion, refined physics, synchronized native audio, 1080p output |
| Google Veo 3.1 | T2V/I2V with references | Upgraded realistic motion, extended clip length, multi-image references, native 1080p with synced audio | |
| xAI Grok Imagine | xAI | T2V/I2V/T2I | Multimodal text/image to short video with coherent motion and synchronized audio; also text-to-image |
| Alibaba Wan 2.6 | Alibaba | Multi-shot T2V/I2V | Affordable 1080p with stable characters and native audio; reference-guided generation |
| MiniMax Hailuo 2.3 | MiniMax | High-fidelity T2V/I2V | Expressive characters, complex motion and lighting effects |
| Kling 2.6 | Kling | T2V/I2V with audio | Text/image to synchronized video with speech, ambient sound, and effects; strong audio-visual coherence |
Top Image Generation Models
| Model | Provider | Capabilities | Key Features |
|---|---|---|---|
| Black Forest Labs FLUX.2 | BFL | T2I with control | Photoreal images, multi-reference consistency, accurate text rendering, flexible control |
| Google Nano Banana Pro | High-res T2I | Sharper 2K output, intelligent 4K scaling, improved text rendering, enhanced character consistency |
Using These Models
Access these models through NodeTool’s generic nodes:
- For Video: Use
nodetool.video.TextToVideoornodetool.video.ImageToVideo - For Images: Use
nodetool.image.TextToImage - Select Provider: Click the model dropdown in the node properties
- Configure API: Add provider API keys in
Settings → Providers
Direct NodeTool API Key Support:
- OpenAI Sora 2 Pro:
OPENAI_API_KEY - Google Veo 3.1:
GEMINI_API_KEY - MiniMax Hailuo 2.3:
MINIMAX_API_KEY
Access via kie.ai (recommended for other models):
- xAI Grok Imagine, Alibaba Wan 2.6, Kling 2.6, Black Forest Labs FLUX.2, Google Nano Banana Pro
- Configure using
KIE_API_KEYinSettings → Providers
Cost Considerations: Cloud models typically charge per generation. Check each provider’s pricing before extensive use. Local models are free after download but require capable hardware.
Alternative Access: Many of these models are available through kie.ai, an AI provider aggregator that often offers competitive or lower pricing compared to upstream providers. For models without direct NodeTool API key support (xAI, Alibaba, Kling), kie.ai is the recommended access method.
Getting Started
Option 1: Start with Local Models (Recommended)
- Open Models → Model Manager in NodeTool
- Install these starter models:
- GPT-OSS (~4 GB) – Text generation and chat
- Flux (~12 GB) – High-quality image generation
- Wait for downloads to complete
- Run templates – they’ll work offline!
Option 2: Start with Cloud Providers
- Get an API key from a provider:
- In NodeTool, go to Settings → Providers
- Paste your API key
- Select the provider when using AI nodes
Understanding Model Names
Model names can look confusing, but they follow patterns:
| Name Part | Meaning | Example |
|---|---|---|
| Base name | The model family | GPT, Llama, Flux |
| Number | Version/size | GPT-4, Llama-3 |
| Size indicator | Capability level | mini, small, large |
| Quantization | Compression level | Q4, Q8 (lower = smaller file) |
Example: llama-3-8b-instruct-Q4 means:
- Llama version 3
- 8 billion parameters (medium size)
- Instruction-tuned (follows directions well)
- Q4 quantization (compressed to save space)
Don’t memorize this – NodeTool’s Model Manager shows compatible models for each task.
Detailed Guides
General
- Models Manager – Download and manage AI models
- Getting Started – First workflow
Local AI
- Supported Models – List of local models (llama.cpp, MLX, Whisper, Flux)
Cloud AI
- Providers Guide – Set up OpenAI, Anthropic, Google
- HuggingFace Integration – Access 500,000+ models
Advanced
- Proxy & Self-Hosted – Secure deployments
- Deployment Guide – Cloud infrastructure
Quick Reference: Common Tasks
“I want to generate text”
- Local: Install GPT-OSS or Llama model
- Cloud: Use OpenAI GPT-4 or Anthropic Claude
“I want to create images”
- Local: Install Flux or Stable Diffusion
- Cloud: Use OpenAI DALL-E or Fal.ai
“I want to transcribe audio”
- Local: Install Whisper (recommended for privacy)
- Cloud: Use OpenAI Whisper API
“I want to understand images”
- Local: Install a Vision model (Llava, Qwen-VL)
- Cloud: Use GPT-4 Vision or Claude with images
FAQ
Q: Do I need a powerful computer?
A: For local models, a GPU helps but isn’t required. Cloud providers work on any computer.
Q: How much do cloud models cost?
A: Typically $0.001-0.03 per task. Most providers offer free credits.
Q: Can I switch models later?
A: Yes. Use the Model button on any AI node to change models without rebuilding the workflow.
Q: Local or cloud?
A: Depends on your needs. Try both.