Hi NVIDIA team,
I’m a developer building a personal AI assistant (Hermes Agent) that integrates NVIDIA NIM API as a free inference provider. I’m currently using models like DeepSeek V4 Pro/Flash and Llama 4 Maverick for various tasks including code generation, data analysis, and natural language processing.
Current situation:
-
Account email: [somnia1130@icloud.com]
-
API Key: [the key ending in …Uw0J]
-
Current rate limit: 40 RPM
-
Models used: deepseek-ai/deepseek-v4-pro, deepseek-ai/deepseek-v4-flash, meta/llama-4-maverick-17b-128e-instruct
Why I need 200 RPM:
-
My AI agent runs automated tasks (cron jobs) that require parallel API calls for food nutrition database verification, daily health tracking summaries, and scheduled reminders
-
During development and testing, I frequently hit the 40 RPM limit when running batch operations
-
The agent also performs real-time tasks where response speed matters, and rate limiting causes delays
Use case: Personal productivity tool — automated meal tracking, health data analysis, and AI-powered daily briefings. All data stays local, no commercial use.
I would greatly appreciate it if you could increase my rate limit to 200 RPM. This would allow me to fully utilize the NVIDIA NIM platform for my personal AI development projects.
Thank you for providing such an amazing free API service!
Best regards,
Joe