A new BitGN Sandbox test mode is now available!
Here, we simulate the operation of a personal agent that has full access to a specific user’s Obsidian Vault. This is not a competitive project at the moment but rather a technical experiment with infrastructure. There are only seven tasks, and some aspects, such as prompt injection attempts, are already present 🙂
You can use a Python-based personal agent for SGR as an example, connect to the platform, and run eval. Alternatively, you can utilize the SDK to implement a similar mechanism in your own programming language.
All necessary links are available at: https://api.bitgn.com
Can you try creating an agent that is resistant to hidden prompts or instructions?
Your colleague @llm_under_hood 🤗
In the future, we plan to add leaderboards, user profiles, and other familiar features.
Created with n8n:
https://cutt.ly/n8n
Created with syllaby:
https://cutt.ly/syllaby
