Commit Graph

  • fa8a8e05af
    fix(router): fix openapi and add jsonschema validation (#1578) OlivierDehaene 2024-02-21 11:05:32 +0100
  • f8b54b78d7 remove unused files OlivierDehaene 2024-02-21 10:27:35 +0100
  • befca7a9a3 fix snapshot OlivierDehaene 2024-02-21 10:19:12 +0100
  • 5c189c92ac Move gcs functions file in util folder. dstnluong-google 2024-02-20 23:25:57 +0000
  • 064c110123 feat: Support loading models from GCS dstnluong-google 2024-02-20 23:16:30 +0000
  • bd3d0621c5 fix: adjust fmt issue drbh 2024-02-20 14:49:48 -0500
  • 2d79e7744a fix: prefer name completion over fill in middle drbh 2024-02-20 14:42:31 -0500
  • 10621a30be fix: improve import readability drbh 2024-02-20 13:21:35 -0500
  • 3d71a44f52 fix: add back imports after rebase drbh 2024-02-20 13:15:20 -0500
  • 6a0f737cb6 feat: support fill in the middle templates drbh 2024-02-20 14:52:57 +0000
  • 0fb864ef44 feat: rebase latest changes drbh 2024-02-19 17:40:46 +0000
  • fa9aad3ec4 feat: return expected types and append suffix drbh 2024-02-06 10:31:43 -0500
  • cade8dbc2b feat: accept legacy request format and response drbh 2024-02-02 09:57:31 -0500
  • dc049e8a7b fix: skip serializing if empty and remove log drbh 2024-02-20 15:49:39 +0000
  • c9f4c1af31
    fix: refactor syntax to correctly include structs (#1580) drbh 2024-02-20 10:38:35 -0500
  • 3fe6c45132 fix: refactor syntax to correctly include VertexInstance drbh 2024-02-20 15:26:44 +0000
  • e0c9441a86 fix: avoid default message drbh 2024-02-20 15:22:19 +0000
  • 1ab935c72c
    Merge branch 'main' into fix/openapi OlivierDehaene 2024-02-20 16:13:34 +0100
  • 767d21cbf8 add integration tests OlivierDehaene 2024-02-20 16:09:21 +0100
  • 5da4b01cc5 rename OlivierDehaene 2024-02-20 15:43:44 +0100
  • f4eec092c6 fix OlivierDehaene 2024-01-25 19:34:14 +0100
  • 885591acdb feat: add support for golden gate OlivierDehaene 2024-01-23 10:30:17 +0100
  • 19dd0a8fde parse string to json OlivierDehaene 2024-02-20 15:16:02 +0100
  • bada345055 ??? OlivierDehaene 2024-02-20 14:55:30 +0100
  • df23062574
    improve endpoint support (#1577) drbh 2024-02-20 08:04:51 -0500
  • 0533e67ea6 use Draft202012 OlivierDehaene 2024-02-19 17:33:23 +0100
  • 111a3f6809 feat: improve syntax and conditional openapi docs drbh 2024-02-19 16:16:41 +0000
  • 2251f5b16f fix(router): fix openapi and add jsonschema validation OlivierDehaene 2024-02-19 16:56:09 +0100
  • d19c768cb8
    Fix mistral with length > window_size for long prefills (rotary doesn't create long enough cos, sin). (#1571) Nicolas Patry 2024-02-19 15:23:12 +0100
  • 384b4eaec4 fix: adjust typo and incorrect function drbh 2024-01-26 14:30:21 -0500
  • 7513c057db fix: prefer defaults, only overwrite if set and print log of feature drbh 2024-01-26 12:39:53 -0500
  • 7abb5ae4fa fix: dynamically set routes when using google feature drbh 2024-01-26 11:49:05 -0500
  • 5e38c4bfda feat: support google feature and read env vars drbh 2024-01-17 18:57:43 -0500
  • f4fd89b224 feat: support vertex api drbh 2024-01-16 15:05:44 -0500
  • c74c7fc65b
    fixes fatal error in generate() Simon Stone 2024-02-19 09:08:54 -0500
  • b189342170 Let it work. Nicolas Patry 2024-02-19 10:06:56 +0000
  • a4d3a00d98
    Fix dependencies (#56) regisss 2024-02-19 10:19:23 +0100
  • dca9ac6508 Revert "Solve dependency issue" regisss 2024-02-19 07:28:04 +0000
  • ea2b93dd75 Solve dependency issue regisss 2024-02-19 07:26:37 +0000
  • 2060bb58bf
    Fix trust remote code (#55) regisss 2024-02-19 07:53:24 +0100
  • 2804a74276 Fix mistral with length > window_size for long prefills (rotary doesn't create long enough cos, sin). Nicolas Patry 2024-02-16 18:08:02 +0000
  • 4139054b82
    v1.4.1 (#1568) v1.4.1 OlivierDehaene 2024-02-16 17:50:57 +0100
  • 346bc74acd fix OlivierDehaene 2024-02-16 17:18:15 +0100
  • a337182b43 rollback to torch 2.1 OlivierDehaene 2024-02-16 16:40:16 +0100
  • af23c432e8 remove torch from requirements.txts and re-format OlivierDehaene 2024-02-16 11:24:46 +0100
  • 14b5ff0b82 v1.4.1 OlivierDehaene 2024-02-16 11:03:31 +0100
  • 0f2daad8b9
    feat: add chat template struct to avoid tuple ordering errors (#1570) OlivierDehaene 2024-02-16 16:37:32 +0100
  • dc10a89434 feat: add chat template struct to avoid tuple ordering errors OlivierDehaene 2024-02-16 12:01:21 +0100
  • 9946165ee0
    chore: add pre-commit (#1569) OlivierDehaene 2024-02-16 11:58:58 +0100
  • 142cdabed3
    Bugfix: eos and bos tokens positions are inconsistent (#1567) Aaron Mihalik 2024-02-16 05:44:04 -0500
  • 3281f37a7e fix OlivierDehaene 2024-02-16 11:41:32 +0100
  • 2af328f945 chore: add pre-commit OlivierDehaene 2024-02-16 11:33:57 +0100
  • 1e51fdc1a5
    Bugfix: eos and bos tokens positions are inconsistent Aaron Mihalik 2024-02-16 03:30:44 -0500
  • c55abac384
    Added name field to OpenAI compatible API Messages (#1563) Aaron Mihalik 2024-02-15 13:30:31 -0500
  • 5cdee2a591 Merge branch 'amihalik-update-chat-completion-messages' into ci-amihalik-update-chat-completion-messages ci-amihalik-update-chat-completion-messages drbh 2024-02-15 17:50:14 +0000
  • 1135de23d1
    fix: add name arg to tests drbh 2024-02-15 12:29:22 -0500
  • cef0553d59
    Outlines guided generation (#1539) drbh 2024-02-15 04:28:10 -0500
  • 9f67f4226b feat: added name field to openai message Aaron Mihalik 2024-02-14 16:43:30 -0500
  • 12c7aae3ff fix: add disable_grammar_support to docker_launcher args drbh 2024-02-14 17:49:08 +0000
  • c791187b02 feat: support spectulative decoding grammar advances drbh 2024-02-14 17:18:47 +0000
  • 63c52fb22d feat: prefer disable grammar drbh 2024-02-14 17:18:04 +0000
  • f0cdd9c8ea fix: remove unnecessary code, avoid copies and make deser safer drbh 2024-02-14 15:54:32 +0000
  • be7835475b fix: add missing grammar type typo drbh 2024-02-14 00:47:28 +0000
  • 3974e4dcaa fix: adjust next type typo drbh 2024-02-14 00:36:08 +0000
  • 8b13c47d63 fix: adjust type typo drbh 2024-02-14 00:31:50 +0000
  • d849641b28 feat: include grammar type in request, avoid alloc and improve proto types drbh 2024-02-14 00:03:56 +0000
  • 8b9430fb68 feat: improve client typing and update tests drbh 2024-02-13 23:59:56 +0000
  • 3df37fa941 feat: fix advance_grammar sig, add comment and move advance call drbh 2024-02-13 17:26:01 +0000
  • d39e45abc3 fix: adjust client and conftest for grammar drbh 2024-02-13 02:51:28 +0000
  • 5ba1baccb0 feat: add grammar_support to test launcher drbh 2024-02-13 01:36:47 +0000
  • 95e577b971 fix: add grammar_support to validation tests drbh 2024-02-13 00:50:45 +0000
  • 8f1651f16d fix: update docs drbh 2024-02-13 00:36:35 +0000
  • fe787d1361 feat: add --grammar-support cli flag and validation error drbh 2024-02-13 00:34:36 +0000
  • 91a114a490 feat: advance grammars in all models drbh 2024-02-13 00:33:36 +0000
  • 8f14019053 feat: improve grammar advance logic to avoid blocking GPU drbh 2024-02-13 00:01:19 +0000
  • d0d7cd9e92 feat: improve grammar init drbh 2024-02-12 18:42:09 +0000
  • 13e07b8257 feat: remove states from proto and simplify logit processor drbh 2024-02-12 17:33:30 +0000
  • a28ba7212c feat: address syntactically comments drbh 2024-02-12 11:08:35 -0500
  • f94fc831f4 feat: add grammar tests and typo tweaks drbh 2024-02-10 03:11:59 +0000
  • 74a586b457 fix: resolve typos and docs drbh 2024-02-10 02:32:47 +0000
  • 565f246fc7 fix: re update docs drbh 2024-02-10 02:21:14 +0000
  • 518ebfb13c fix: adjust docs drbh 2024-02-10 02:17:45 +0000
  • 8413cc45ec fix: adjust protos and router drbh 2024-02-10 02:17:28 +0000
  • ffc228831c fix: remove small unneeded changes drbh 2024-02-10 01:43:31 +0000
  • f1d43f2df4 fix: prefer grammar as logit processor drbh 2024-02-10 01:41:22 +0000
  • a1c630d5c1 fix: update grammar states after filter drbh 2024-02-09 17:40:19 +0000
  • ff6e8d9e23 feat: support grammars in batch drbh 2024-02-09 15:58:00 +0000
  • 8fd2664a3c feat: support other models and add fsm caching drbh 2024-02-08 19:56:16 +0000
  • 56e919e459 fix: update all NextTokenChoosers drbh 2024-02-08 11:33:00 -0500
  • 066d3d4872 feat: support JSON schemas drbh 2024-02-08 11:24:34 -0500
  • 0245506718 feat: support simple grammars via outlines drbh 2024-02-07 19:35:39 -0500
  • b013cb4f4a feat: first draft constraining generation via outlines drbh 2024-02-01 23:13:34 +0000
  • 4c2848b24b
    Small cleanup. (#1560) Nicolas Patry 2024-02-14 15:30:07 +0100
  • ab60d15962 Desperate attempt. Nicolas Patry 2024-02-14 10:27:21 +0000
  • 584c5fa0a0 Tailscale. Nicolas Patry 2024-02-14 10:22:36 +0000
  • 212e1cbcbe no sudo. Nicolas Patry 2024-02-14 10:19:46 +0000
  • ffa1804a34 .. Nicolas Patry 2024-02-14 10:16:24 +0000
  • 5b0befee43 Test. Nicolas Patry 2024-02-14 10:13:45 +0000
  • df91f105e8 Ofc. Nicolas Patry 2024-02-14 10:11:22 +0000
  • 7f0a816a22 Maybe XML wasn't so bad after all. Nicolas Patry 2024-02-14 10:10:28 +0000