OpenAI stopped releasing information about their models after gpt-3, which was 1...

Baeocystin · on Oct 15, 2024

Any pointers on where we can check the best local models per amount of VRAM available? I only have consumer level cards available, but I would think something that just fits in to a 24Gb card should noticably outperform something scaled for an 8Gb card, yes?

fnord77 · on Oct 15, 2024

lm studio tells you what models fit in your available RAM, with or without quantization