We need to have some standardised tests for what this LLM can and cannot do, and make the relevant choices. This issue should be used as a discussion for what websites are best for testing this package.
- Auth websites
- Long/short HTML websites
- any others?