I can imagine a couple scenarios in which a high-quality, large model would be m...

		Wuzado 3 days ago \| parent \| context \| favorite \| on: Show HN: Llama 3.1 70B on a single RTX 3090 via NV... I can imagine a couple scenarios in which a high-quality, large model would be much preferred over lower latency models, primarily when you need the quality.

		help