Learning of an unsupervised Fine-Tuning
My goal with this experiment was to add new knowledge to a model with LoRA, and make it in an automated way. For the topic I’ve chosen a niche area of vintage computers so I’ve chosen to teach a model…
My goal with this experiment was to add new knowledge to a model with LoRA, and make it in an automated way. For the topic I’ve chosen a niche area of vintage computers so I’ve chosen to teach a model…
Since I have the 2nd 3090 I’m working on niche Fine-tuned model. As I’m generating the data or do training the machine is running all the time. I’m sharing 2 set of monitors between that and other computers, and it’s…
Last summer I decided to build an AI Workstation for myself with dual goals:1) Enable me to play with not just models but also learn about the various hardware configurations that are usually hidden from you if you’re using the…
In conventional application architecture, client-side components interact with backend services to deliver functionality and value to end-users. Application developers must possess a comprehensive understanding of the application programming interface (API) to facilitate this interaction. Moreover, the application must be adaptable…
I’ve experimented with the Orca2 model in the last few weeks, and the generation quality for the parameter size has truly impressed me. I believe that the Orca2 7B and 13B quantized models offer the optimal combination of quality and…
LM Studio provides a remarkable tool that empowers users to efficiently run quantized Large Language Models (LLMs) locally. I have identified three standout features: Apple Metal demonstrated a remarkable 2.8x improvement in tokens per second using the LLaMa CPP-based approach…