## Description
- Quantized the CLIP text encoder
- Moved preprocessing and postprocessing of face detection inside the
model
- Optimised the ONNX models more wherever possible
- Created a place in infra for ML version control of sorts
## Tests
Have tested the changes on mobile, but not on desktop. Please carefully
review the changes on desktop, especially regarding the face detection
post-processing, more specifically the image (re-)size correction.