- Fixed a bug where the app's memory usage kept increasing after switching models i.e. the memory acquired by the previous model was not 'released' when selecting a different model
- Align default inference parameters with those found in `llama` executable
UI Improvements:
- Chat message actions like share/copy/edit are now available in a dialog which appears when the message is long-pressed
- Preserve query text in the search box when a model is opened while browsing HuggingFace