Today on GitHub, there are new projects focusing on large language model optimization and Rails application tooling.
- KVSplit: Enables larger context windows and heavier LLMs on Apple Silicon by using differentiated quantization on the KV cache, reducing memory usage, and potentially boosting inference speed.
- active_storage_dashboard: Provides a dependency-free Rails engine dashboard to monitor and inspect Active Storage data within your application.