Commit Graph

  • e0abb576ae remove requirements_common Félix Marty 2023-11-23 15:31:34 +0100
  • 5e6356793e Merge branch 'main' into tgi-rocm Félix Marty 2023-11-23 15:21:38 +0100
  • c6bb76703f
    Fix IDEFICS dtype (#1214) Vince Jankovics 2023-11-23 14:00:09 +0000
  • a8da815b81 Fixing Idefics dtype. Nicolas Patry 2023-11-23 13:57:28 +0000
  • 861acdeab1 Revert "Idefics force bfloat16" Nicolas Patry 2023-11-23 13:57:02 +0000
  • a11e4c06c3 Merge branch 'tgi-rocm' of https://github.com/huggingface/text-generation-inference into tgi-rocm Felix Marty 2023-11-23 13:01:24 +0000
  • bdb6c9d1ed fix bug on cuda build Felix Marty 2023-11-23 13:01:19 +0000
  • 35509ff5de
    chore: update to torch 2.1.0 (#1182) OlivierDehaene 2023-11-23 13:38:50 +0100
  • e12c34bd25
    Load PEFT weights from local directory (#1260) Traun Leyden 2023-11-23 12:56:17 +0100
  • b5790224a5
    Apply suggestions from code review Nicolas Patry 2023-11-23 12:55:18 +0100
  • 3126055e3d Merge branch 'main' into tgi-rocm Félix Marty 2023-11-23 12:48:31 +0100
  • 91111a0dc2
    Fix missing trust_remote_code flag for AutoTokenizer in utils.peft (#1270) Diwank Singh Tomer 2023-11-23 03:41:05 -0800
  • e5bf35435c conda install OlivierDehaene 2023-11-23 12:30:26 +0100
  • 289eee535e 12.1 OlivierDehaene 2023-11-23 11:28:21 +0100
  • a9348eb5e0 update deps Shannon Phu 2023-11-22 17:49:36 +0000
  • 703302ae19 python3.10 OlivierDehaene 2023-11-22 17:22:15 +0100
  • 217abbe95a remove double torch install OlivierDehaene 2023-11-22 16:32:39 +0100
  • 902fd945a4 revert to 11.8 OlivierDehaene 2023-11-22 15:30:34 +0100
  • 2fd1156d5a tp optims HuaYZhao 2023-11-22 19:58:40 +0800
  • dad29f7299 tp optims HuaYZhao 2023-11-22 17:22:47 +0800
  • 1503de86ac limit concurrency OlivierDehaene 2023-11-21 17:39:02 +0100
  • ebc0a7152b WIP: Added custom_logits_processors API and connecting it to flow Noam Gat 2023-11-21 17:43:47 +0200
  • 4ca2c5c945 WIP: Adding logits processors to protocol Noam Gat 2023-11-21 16:56:24 +0200
  • b226e469c9
    Update README.md (#1272) Nicolas Patry 2023-11-21 10:39:18 +0100
  • d1460e6b4f
    Update README.md Nicolas Patry 2023-11-21 10:38:14 +0100
  • abb498a907 pray OlivierDehaene 2023-11-20 19:03:32 +0100
  • 2e75027187 update to 12.1 OlivierDehaene 2023-11-20 14:21:09 +0100
  • a99ee04348 add shm_size OlivierDehaene 2023-11-20 12:16:24 +0100
  • 3e4ab6c553 chore: update to torch 2.1.0 OlivierDehaene 2023-10-20 09:45:29 +0200
  • 1b0236cb3c github.job is null only available within steps Felix Marty 2023-11-20 10:38:41 +0000
  • bb638fe559 remove broken cosign Felix Marty 2023-11-20 10:28:50 +0000
  • b6e5c28dff +rocm as a docker tag seems illegal Felix Marty 2023-11-20 10:24:20 +0000
  • 2a644b5397 is concurrency the reason for cancelled job? Felix Marty 2023-11-20 10:18:34 +0000
  • 7052827be4 Merge branch 'main' into tgi-rocm Félix Marty 2023-11-20 10:51:19 +0100
  • 3dbc649b11
    fix: do not leak inputs on error (#1228) OlivierDehaene 2023-11-20 10:33:44 +0100
  • b1581ccb55
    Fix missing trust_remote_code flag for AutoTokenizer in utils.peft Diwank Singh Tomer 2023-11-19 18:44:34 -0800
  • 2679613d22 add support for CUDA 118 (suports H100s) Andres Restrepo 2023-11-18 06:10:20 +0000
  • eb0e93789d update doc Felix Marty 2023-11-16 17:36:24 +0000
  • 8acdc1fae7 hotfix 1.1.1 v1.1.1 OlivierDehaene 2023-11-16 18:35:09 +0100
  • ddf0c85836 Merge branch 'tgi-rocm' of https://github.com/huggingface/text-generation-inference into tgi-rocm Felix Marty 2023-11-16 17:30:01 +0000
  • 736e199737 use modified vllm rope kernel when using rocm Felix Marty 2023-11-16 17:29:27 +0000
  • 8e07a2487e
    Update .github/workflows/build.yaml fxmarty 2023-11-17 01:19:34 +0900
  • 457e72c386 v1.1.1 OlivierDehaene 2023-11-16 13:54:58 +0100
  • 6a25c14f46 Load local PEFT model Traun Leyden 2023-11-14 15:22:03 +0100
  • 223cd4954f Revert "Load local peft model and add print stmts" Traun Leyden 2023-11-14 15:20:12 +0100
  • 52c0e0e53b add image build workflow step Felix Marty 2023-11-14 13:53:00 +0000
  • 7903ad72a8 Merge branch 'main' into tgi-rocm Felix Marty 2023-11-14 13:37:06 +0000
  • 3be198c698 Load local peft model and add print stmts Traun Leyden 2023-11-14 12:46:53 +0100
  • a992084b9b fix wrong refactor of rope Felix Marty 2023-11-14 10:43:02 +0000
  • b7a825d0b5
    Introduce basic helm chart Wilfried Roset 2023-11-09 21:19:45 +0100
  • 8617d4795a move controlflow in forward Felix Marty 2023-11-09 09:38:32 +0000
  • cdd293f227
    Merge 84dd432a43 into a5def7c222 Florian Zimmermeister 2023-11-09 01:41:34 +0900
  • a5def7c222
    Fix link in quantization guide (#1246) Omar Sanseviero 2023-11-08 10:34:38 -0600
  • ed53f202ff
    Fix link in quantization guide Omar Sanseviero 2023-11-08 09:06:34 -0600
  • 80ce8910f1 update on review Felix Marty 2023-11-08 11:01:10 +0000
  • 6353a87f4c
    Update server/text_generation_server/models/custom_modeling/idefics_modeling.py fxmarty 2023-11-08 19:09:30 +0900
  • 0eea83be44
    Update server/text_generation_server/models/custom_modeling/flash_mistral_modeling.py fxmarty 2023-11-08 19:07:58 +0900
  • 891fe74099
    Update server/text_generation_server/models/custom_modeling/flash_mistral_modeling.py fxmarty 2023-11-08 19:07:45 +0900
  • f746b8e0ae
    Update server/text_generation_server/models/custom_modeling/flash_llama_modeling.py fxmarty 2023-11-08 19:06:59 +0900
  • 0f7b9bba33 fix tests Felix Marty 2023-11-07 15:12:33 +0000
  • 2446928768 cleaning Felix Marty 2023-11-07 15:01:30 +0000
  • ea8438a5a0 clean rocm support Felix Marty 2023-11-07 14:56:11 +0000
  • 7b5c167487
    Update README.md (#1242) Nicolas Patry 2023-11-07 10:24:53 +0100
  • 3229f8b6a8
    Update README.md Nicolas Patry 2023-11-07 10:24:33 +0100
  • b9184093d9
    Narsil patch 1 (#1241) Nicolas Patry 2023-11-07 10:13:09 +0100
  • 07f3f37d53
    Update README.md Nicolas Patry 2023-11-07 10:03:58 +0100
  • bfe43d9cc4
    Update README.md Nicolas Patry 2023-11-07 10:03:16 +0100
  • 794770d191
    Update README.md Nicolas Patry 2023-11-07 10:02:51 +0100
  • 323a60c050
    Update README.md Nicolas Patry 2023-11-07 10:01:02 +0100
  • 6dfb772029
    Update README.md Nicolas Patry 2023-11-07 10:00:13 +0100
  • 1d3b7ec4a5
    Update README.md Nicolas Patry 2023-11-07 09:56:31 +0100
  • c9da3fe409
    Update README.md Nicolas Patry 2023-11-07 09:55:58 +0100
  • ec994ac0c9
    Update README.md Nicolas Patry 2023-11-07 09:55:37 +0100
  • 065cd904c9
    Update README.md Nicolas Patry 2023-11-07 09:54:57 +0100
  • fe31ff8f16
    Update README.md Nicolas Patry 2023-11-07 09:54:26 +0100
  • eb59503f8f
    Update README.md Nicolas Patry 2023-11-07 09:50:48 +0100
  • fca3899390
    Update README.md Nicolas Patry 2023-11-07 09:47:04 +0100
  • 686c7fcd79
    Update README.md Nicolas Patry 2023-11-07 09:43:21 +0100
  • bfc4500846
    Update README.md Nicolas Patry 2023-11-07 09:40:46 +0100
  • b7723a727f
    Update README.md Nicolas Patry 2023-11-07 09:39:35 +0100
  • 74618bce15
    Update README.md Nicolas Patry 2023-11-07 09:38:32 +0100
  • 414a911b34
    Adding the video -> moving the architecture picture lower (#1239) Nicolas Patry 2023-11-07 01:01:40 +0100
  • 1fdfaf89c6 Adding the video -> moving the architecture picture lower Nicolas Patry 2023-11-06 15:13:50 +0100
  • ca9fc44cb5 fix: do not leak inputs on error OlivierDehaene 2023-11-03 14:42:00 +0100
  • 2200b7e6c4 docs: set tag to 1.1. Pedro Cuenca 2023-11-02 15:12:03 +0100
  • e8ed11be3e Client modified to accept new string. Suvro Ghosh 2023-11-01 11:15:07 -0400
  • b8952b2b32 Idefics force bfloat16 Vince Jankovics 2023-10-31 12:51:13 +0000
  • 897ed20842 Included model_id in response. Suvro Ghosh 2023-10-30 19:23:19 -0400
  • 52bdcf797d working rocm build Felix Marty 2023-10-30 10:42:34 +0000
  • b90660ddbc tutorials: fix links Stefan Schweter 2023-10-26 23:38:02 +0200
  • 330e506e88
    Improvement: Update Dockerfile Lukas Pfahler 2023-10-25 13:07:04 +0200
  • 84dd432a43
    use exllamav2QuantLinear instead of exllama1 Florian Zimmermeister 2023-10-25 12:26:20 +0200
  • 6a0a378c0c
    Merge branch 'huggingface:main' into main Florian Zimmermeister 2023-10-25 12:18:33 +0200
  • 96a982ad8f fix: better warmup error OlivierDehaene 2023-10-25 10:18:58 +0200
  • f9910d13e2
    feat: remove flume (#1184) OlivierDehaene 2023-10-23 15:51:12 +0200
  • b2a5dd64c1 back to atomicbool OlivierDehaene 2023-10-23 12:47:16 +0200
  • 3f1cc9bad7 add logs OlivierDehaene 2023-10-20 17:53:27 +0200
  • f40f02fc25 remove atomicbool OlivierDehaene 2023-10-20 17:39:28 +0200
  • 2c3772528d fix tests OlivierDehaene 2023-10-20 17:15:33 +0200
  • d2f7751d39 remove flume OlivierDehaene 2023-10-20 16:55:17 +0200