Xuan-Son Nguyen
|
f5c6ae1827
|
mtmd, server: add "placeholder bitmap" for counting tokens , add */input_tokens API (#23913)
* mtmd: add "placeholder bitmap" for counting tokens w/o preprocessing
* fast path skip preproc for placeholder
* fix build
* correct the api
* add server endpoint + tests
* add object name
* update docs
* add proxy handling
* fix build
* fix audio input path
* use is_placeholder in process_mtmd_prompt()
* nits
* nits (2)
* docs: clarify chat/completions/input_tokens is not official
* fix merge problem
|
2026-06-06 11:06:51 +02:00 |
|
Xuan-Son Nguyen
|
1e64534570
|
mtmd: add clip_graph::build_mm() (#20751)
* clip: add build_mm()
* apply to all models
* add TODO for bias overload
|
2026-03-19 13:11:39 +01:00 |
|
Tarek Dakhran
|
c945aaaef2
|
mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876)
|
2026-01-16 11:23:08 +01:00 |
|
Xuan-Son Nguyen
|
8ea958d4d9
|
model : add ASR support for LFM2-Audio-1.5B (conformer) (#18106)
* ASR with LFM2-Audio-1.5B
* Set rope_theta
* Fix comment
* Remove rope_theta setting
* Address PR feedback
* rename functions to conformer
* remove some redundant ggml_cont
* fix missing tensor
* add prefix "a." for conv tensors
* remove redundant reshape
* clean up
* add test model
---------
Co-authored-by: Tarek Dakhran <tarek@liquid.ai>
|
2025-12-19 00:18:01 +01:00 |
|