Account Email:2323855978@qq.com
Last 4 digits:9NUv
Current Limit:40 RPM
Requested Limit:200 RPM
Use Case:
I am developing a personal research prototype that evaluates the reasoning capabilities of large language models (specifically DeepSeek-V4-Pro) on multi-turn dialogue tasks and complex agentic workflows. This is a non-commercial, academic exploration project intended for my own learning and open-source contribution.
Why 40 RPM is insufficient:
My test suite consists of 500 predefined query–response pairs, each requiring 3–5 iterative follow‑up turns to assess contextual understanding.
· 500 queries × 4 turns = 2,000 API calls per full test run.
· At 40 RPM, this would take 50 minutes for a single run.
· To debug and iterate, I typically need 3–4 runs per day, which becomes impractical (over 3 hours of waiting) and severely slows down my development cycle.
Why 200 RPM is reasonable:
· 200 RPM translates to ~3.3 requests per second, which is still very modest and well within the capacity of the free-tier shared infrastructure.
· It would compress my test run from 50 minutes to about 10 minutes, making rapid iteration feasible without overloading the system.
· I have implemented client-side retry logic and will strictly limit my concurrency to avoid flooding the endpoint.
Commitment & Compliance:
· I will not use this for any production or commercial service.
· I will abide by all NVIDIA usage policies and immediately cease abnormal spikes.
· If needed, I am happy to share my project repository or usage logs for review.
I understand that this is a discretionary request, and I truly appreciate your consideration. Thank you for supporting the developer community.