Hi, Im trying to use this model https://huggingface.co/dphn/Dolphin-Mistral-24B-Venice-Edition-FP8, so my env variables look like this :
When I try to send a curl request Im getting this :
INFO 09-07 03:11:46 [init.py:244] Automatically detected platform cuda.
Traceback (most recent call last):
File "/src/handler.py", line 13, in
from utils import format_chat_prompt, create_error_response
File "/src/utils.py", line 3, in
from .models import ChatMessage, ErrorResponse
ImportError: attempted relative import with no known parent package
Any ideas how to fix it @pandyamarut ? Thanks you
edit : its not working even with no fp8 quantization with the model dphn/Dolphin-Mistral-24B-Venice-Edition
Hi, Im trying to use this model https://huggingface.co/dphn/Dolphin-Mistral-24B-Venice-Edition-FP8, so my env variables look like this :
When I try to send a curl request Im getting this :
INFO 09-07 03:11:46 [init.py:244] Automatically detected platform cuda.
Traceback (most recent call last):
File "/src/handler.py", line 13, in
from utils import format_chat_prompt, create_error_response
File "/src/utils.py", line 3, in
from .models import ChatMessage, ErrorResponse
ImportError: attempted relative import with no known parent package
Any ideas how to fix it @pandyamarut ? Thanks you
edit : its not working even with no fp8 quantization with the model dphn/Dolphin-Mistral-24B-Venice-Edition