Commit Graph

  • ed024ed433 add integration test OlivierDehaene 2023-09-27 19:40:23 +0200
  • 5b1aaeceb2 update flash OlivierDehaene 2023-09-27 19:17:39 +0200
  • 630e417ca0 add window size in proto OlivierDehaene 2023-09-27 12:20:20 +0200
  • 2811ec9bff wip OlivierDehaene 2023-09-27 10:38:46 +0200
  • 259a230028
    Automatic docs for TGI (#1045) Merve Noyan 2023-09-27 16:01:38 +0200
  • 397726c9df Adding a title Nicolas Patry 2023-09-27 13:53:27 +0000
  • 97bac44f39 Adding it to the octree Nicolas Patry 2023-09-27 12:09:09 +0000
  • 82213c9d59 Remove custom env value. Nicolas Patry 2023-09-27 10:58:05 +0000
  • ae0775084a Should be green now. Nicolas Patry 2023-09-27 10:52:07 +0000
  • c09eb2f8bf Adding the script Nicolas Patry 2023-09-27 10:43:12 +0000
  • 9a3cb6fba8 Update to use simpler python script. Nicolas Patry 2023-09-27 10:41:30 +0000
  • 137a5d28c2 Updat Nicolas Patry 2023-09-27 10:40:44 +0000
  • 1e3ec3c91f
    Complete FastLinear.load parameters in OPTDecoder initialization (#1060) zhangsibo1129 2023-09-27 18:25:59 +0800
  • 47954b81e9
    feat: format code (#1070) OlivierDehaene 2023-09-27 12:22:09 +0200
  • 07ad048277 feat: format code OlivierDehaene 2023-09-27 12:21:30 +0200
  • b32e9ce9d5
    Remove the stripping of the prefix space (and any other mangling that tokenizers might do). (#1065) Nicolas Patry 2023-09-27 12:13:45 +0200
  • 95a4bb696a
    Support eetq weight only quantization (#1068) Nicolas Patry 2023-09-27 11:42:57 +0200
  • 3cf9093b97 Preserve newlines. Nicolas Patry 2023-09-27 09:33:20 +0000
  • bee42de54b Remove git port. Nicolas Patry 2023-09-27 09:21:14 +0000
  • efd0ce3cba Fix. Nicolas Patry 2023-09-27 09:18:30 +0000
  • 6de3b949d8 Update the script. Nicolas Patry 2023-09-27 09:13:32 +0000
  • 6bc580d5bb Update autodocs.yml Merve Noyan 2023-09-22 11:51:53 +0200
  • 57ee8c707b Update autodocs.yml Merve Noyan 2023-09-22 11:49:49 +0200
  • 0fade03308 Update autodocs.yml Merve Noyan 2023-09-22 11:47:50 +0200
  • a6b053845f Update autodocs.yml Merve Noyan 2023-09-22 11:36:08 +0200
  • 4f9a3287b1 Update autodocs.yml Merve Noyan 2023-09-22 11:30:21 +0200
  • 073d8528d4 Update autodocs.yml Merve Noyan 2023-09-22 11:29:20 +0200
  • ff13ab8e67 Update autodocs.yml Merve Noyan 2023-09-22 11:17:25 +0200
  • ee4c029d9f Update autodocs.yml Merve Noyan 2023-09-22 11:16:02 +0200
  • e34cdba4d7 Update autodocs.yml Merve Noyan 2023-09-22 11:12:50 +0200
  • 87e9078634 Update autodocs.yml Merve Noyan 2023-09-22 11:10:45 +0200
  • ae6c25852c Update autodocs.yml Merve Noyan 2023-09-22 11:07:12 +0200
  • 2f11bacf03 Update autodocs.yml Merve Noyan 2023-09-22 11:00:12 +0200
  • 41be28b9d6 Update autodocs.yml Merve Noyan 2023-09-22 10:54:50 +0200
  • cd1c674099 Update launcher.md Merve Noyan 2023-09-22 10:31:17 +0200
  • 59bb9042a5 Purposefully failing Merve Noyan 2023-09-22 10:30:46 +0200
  • d2babff311 Added token Merve Noyan 2023-09-21 19:21:27 +0200
  • a0e319d0ae Update autodocs.yml Merve Noyan 2023-09-21 19:08:25 +0200
  • 14f59ae2ea Update paths Merve Noyan 2023-09-21 19:04:38 +0200
  • d13d61fbaf Update launcher.md Merve Noyan 2023-09-21 19:03:31 +0200
  • 2d16cea90d Update launcher.md Merve Noyan 2023-09-21 19:01:29 +0200
  • 5118f9d41f Update autodocs.yml Merve Noyan 2023-09-21 18:45:53 +0200
  • 85bc6c1f3f Update autodocs.yml Merve Noyan 2023-09-21 18:33:51 +0200
  • c646d74ba7 Update autodocs.yml Merve Noyan 2023-09-21 18:30:30 +0200
  • 0ae1aaff82 Simplified Merve Noyan 2023-09-21 18:26:38 +0200
  • 6572edc5ba Minor fix in path Merve Noyan 2023-09-21 17:54:10 +0200
  • 10a88586a6 missing fi Merve Noyan 2023-09-21 17:51:17 +0200
  • 95d5302960 Update autodocs.yml Merve Noyan 2023-09-21 17:47:40 +0200
  • 99c937337f Update autodocs.yml Merve Noyan 2023-09-21 17:44:29 +0200
  • e74ce5a70c Added check for initial case where launcher doesn't exist in main repo Merve Noyan 2023-09-21 17:36:27 +0200
  • 75e49b8abd Added check for file not existing Merve Noyan 2023-09-21 17:26:15 +0200
  • 048d77da7a Create empty launcher.md as a test Merve Noyan 2023-09-21 17:21:27 +0200
  • 87c076e1c1 Update autodocs.yml Merve Noyan 2023-09-21 17:16:09 +0200
  • 7384e06dcd Update autodocs.yml Merve Noyan 2023-09-21 17:04:10 +0200
  • 3ab4c199cd Update autodocs.yml Merve Noyan 2023-09-21 16:59:55 +0200
  • d79760aaa2 Update autodocs.yml Merve Noyan 2023-09-21 16:57:03 +0200
  • adfeec63f1 Check if content is updated Merve Noyan 2023-09-21 14:16:14 +0200
  • 0e884aa267 installation Merve Noyan 2023-09-21 13:48:46 +0200
  • ea458581ee Remove if merged for now Merve Noyan 2023-09-21 13:13:24 +0200
  • a0844641d5 trigger action Merve Noyan 2023-09-21 13:11:53 +0200
  • ebfea7205c Automatic docs for TGI Merve Noyan 2023-09-21 13:09:12 +0200
  • 36c2868853
    Added note on weight-cache-override (#994) Merve Noyan 2023-09-27 11:06:07 +0200
  • e93705e272 Fmt. Nicolas Patry 2023-09-27 09:05:39 +0000
  • bbafcee44d Update the tests with the breaking change. Nicolas Patry 2023-09-27 09:04:16 +0000
  • 086d62dbe3 Put back Cargo.lock... Nicolas Patry 2023-09-27 08:47:04 +0000
  • 085c43243d Putting the deprecation notice on bnb (8bit). Nicolas Patry 2023-09-27 08:39:40 +0000
  • 2d13b6ff6c Support weight only quantization zhaosida 2023-09-27 10:33:55 +0800
  • a049864270
    Preping 1.1.0 (#1066) Nicolas Patry 2023-09-27 10:40:18 +0200
  • f29af8f38e Support weight only quantization zhaosida 2023-09-27 10:33:55 +0800
  • 45bf7597ac dtype default to None instead of float16 and each model could set it's default type according to the platform Wang, Yi A 2023-09-21 19:20:41 -0700
  • e0df456edb Fmt. Nicolas Patry 2023-09-26 20:24:08 +0000
  • 853f09035c Preping 1.1.0 Nicolas Patry 2023-09-26 20:04:38 +0000
  • 6378303d23 Revert test change. Nicolas Patry 2023-09-26 16:46:06 +0200
  • 8a16c48595 Ignoring special tokens + updating 1 test case. Nicolas Patry 2023-09-26 16:35:53 +0200
  • 8672cad2cb
    Fix top_n_tokens returning non-log probs for some models (#1023) Vincent Brouwers 2023-09-26 16:16:43 +0200
  • 76d5bbb0aa Remove the stripping of the prefix space (and any other mangling that tokenizers might do). Nicolas Patry 2023-09-26 14:12:25 +0000
  • 1fff6746ab
    Fix position ids logic instantiation of idefics vision part (#1064) Victor SANH 2023-09-26 15:41:15 +0200
  • ae623b8d2d
    Install curl to be able to perform more advanced healthchecks (#1033) oOraph 2023-09-26 15:23:47 +0200
  • bf2b92217f
    Apply suggestions from code review Nicolas Patry 2023-09-26 15:07:38 +0200
  • eba6ab1c5d
    fix discard_names bug in safetensors convertion (#1052) zhangsibo1129 2023-09-26 21:05:40 +0800
  • 9c0f679d1d Simpler fix. Nicolas Patry 2023-09-26 13:03:45 +0000
  • edc95a0e7d
    support local model config file (#1058) zhangsibo1129 2023-09-26 20:57:53 +0800
  • 1053e5d09a
    Apply suggestions from code review Nicolas Patry 2023-09-26 14:36:08 +0200
  • 5a6c5725ed
    Fix position ids logic instantiation of idefics vision part Victor SANH 2023-09-26 14:22:23 +0200
  • 57433201b2 Fix shared weights load bug and T5 loading zhangsibo1129 2023-09-26 17:57:59 +0800
  • 2f51645ad7
    Fix GQA llama + AWQ (#1061) Nicolas Patry 2023-09-26 08:27:50 +0200
  • 1ab173a260 Fix GQA llama + AWQ Nicolas Patry 2023-09-26 06:26:23 +0000
  • 99da7ce121 Complete OPT load parameters zhangsibo1129 2023-09-26 13:41:29 +0800
  • 7d0aaede63 Complete OPTDecoder load parameters zhangsibo1129 2023-09-26 13:24:05 +0800
  • 5f76dae04a support local model config file zhangsibo1129 2023-09-26 10:50:24 +0800
  • c5de7cd886
    Add AWQ quantization inference support (#1019) (#1054) Nicolas Patry 2023-09-25 15:31:27 +0200
  • e27438aac0 Fix dockerfile. Nicolas Patry 2023-09-25 12:46:26 +0000
  • fef36cea42
    Fixing t5 loading. (#1042) Nicolas Patry 2023-09-25 12:22:28 +0200
  • 97292ec21c Fix and test sharded version. Nicolas Patry 2023-09-25 10:21:46 +0000
  • cbf047b4ae Support TheBloke exported models. Nicolas Patry 2023-09-25 10:02:49 +0000
  • 2d8c034df3 Adding target list. Nicolas Patry 2023-09-25 09:58:52 +0000
  • ce8eaaf2be Better fix. Nicolas Patry 2023-09-25 09:50:42 +0000
  • 4a29074291 Update dockerfile with new build. Nicolas Patry 2023-09-25 09:41:25 +0000
  • 02d4f62a1f Make awq install optional + integration tests values. Nicolas Patry 2023-09-25 09:19:12 +0000
  • a8f870aa75 Change deploy. Nicolas Patry 2023-09-25 09:01:07 +0000