python3 torchchat.py export llama3.1 --output-dso-path exportedModels/llama3.1.so Using device=cuda Setting max_seq_length to 300 for DSO export. Loading model ...