Commit Graph

  • 2f7473b454
    Update README.md Adarsh Shirawalmath 2023-08-15 21:17:22 +0530
  • defc477a03 Hardcode remove image tokens. Nicolas Patry 2023-08-15 15:24:36 +0000
  • 51348713d6 More import sanitation. Nicolas Patry 2023-08-15 15:09:19 +0000
  • 4ff509948a Load later to make tests pass. Nicolas Patry 2023-08-15 14:50:35 +0000
  • 5469316ed8 Fix specific transformers version. Nicolas Patry 2023-08-15 14:34:19 +0000
  • bfc998eb9d Requirements add Pillow. Nicolas Patry 2023-08-15 13:10:15 +0000
  • c428add692 Don't profile by default. Nicolas Patry 2023-08-15 12:49:44 +0000
  • 8030d66f0d Further speedup. Nicolas Patry 2023-08-15 12:47:02 +0000
  • fc02d99e57 Much better code with correct RMS + rotary: Nicolas Patry 2023-08-15 11:10:52 +0000
  • 21c15d576d Tp ready. Nicolas Patry 2023-08-14 17:05:21 +0000
  • eaf9448b48 Adding Idefics multi modal model. Nicolas Patry 2023-08-14 16:05:47 +0000
  • 3a86e4f519 Fmt. Nicolas Patry 2023-08-14 18:28:57 +0200
  • d73204d72c Fix benchmark Nicolas Patry 2023-08-14 18:19:03 +0200
  • 278e387d8c
    Update README.md Pasquale Minervini 2023-08-14 15:27:55 +0200
  • e149656194 Fix tokenizers==0.13.4 . Nicolas Patry 2023-08-14 12:38:00 +0000
  • 933f7adbf4 Remove vllm Nicolas Patry 2023-08-14 09:22:51 +0000
  • 0ad99f512c Remove dead dirs. Nicolas Patry 2023-08-14 09:21:59 +0000
  • 4ddeea2551 Version 1.0.1 Nicolas Patry 2023-08-14 09:18:54 +0000
  • 86e2ec4952 modified readme SawanRawatJr 2023-08-12 19:30:19 +0530
  • 89a4e723d2 Attempting to fix torch leak. fix_leak Nicolas Patry 2023-08-12 09:06:49 +0200
  • f21c507ebf
    Update docs/source/basic_tutorials/consuming_tgi.md Omar Sanseviero 2023-08-11 17:01:20 +0200
  • c7b26c5453
    Update consuming_tgi.md Merve Noyan 2023-08-11 17:52:58 +0300
  • 607238a4ce
    Update docs/source/basic_tutorials/consuming_tgi.md Merve Noyan 2023-08-11 17:33:26 +0300
  • 40876c6887
    Added ChatUI Screenshot Merve Noyan 2023-08-11 17:32:38 +0300
  • 36a34cc458 Bypassing config trust remote code.. Nicolas Patry 2023-08-11 14:06:08 +0000
  • 76dbf190d4 Fix rope dynamic + factor Nicolas Patry 2023-08-11 15:30:01 +0200
  • db2dd5229b
    Added streaming Merve Noyan 2023-08-11 16:21:11 +0300
  • 4a9615e8ff Add to ToC streaming_conceptual osanseviero 2023-08-11 15:05:10 +0200
  • 6daee77c09 Add embedded space osanseviero 2023-08-11 15:03:56 +0200
  • a19e49b4a1
    Merge branch 'main' into nice-snippets Omar Sanseviero 2023-08-11 11:44:02 +0200
  • 5980565091
    [docs] Build docs only when doc files change Mishig 2023-08-10 21:33:32 +0200
  • 247af2d1a8 removed uneeded diff marcusdunn 2023-08-10 11:09:21 -0700
  • d67a2e22fa formatting marcusdunn 2023-08-10 10:58:06 -0700
  • 1c9d953962 formatting marcusdunn 2023-08-10 10:57:34 -0700
  • 9a0262f38c tokenized inputs from pb marcusdunn 2023-08-10 10:16:47 -0700
  • a64c2a6f89 fixed HeterogeneousNextTokenChooser by using HeterogeneousProcessorWrapper with SequenceBiasLogitsProcessor marcusdunn 2023-08-10 09:49:55 -0700
  • 8453eca41b added logit_bias to the tgi server using SequenceBiasLogitsProcessor marcusdunn 2023-08-10 09:32:49 -0700
  • 20b05bc8ba added logit_bias param to REST and GRPC marcusdunn 2023-08-09 16:31:01 -0700
  • 6523e8f2a7
    Update using_cli.md Merve Noyan 2023-08-10 15:55:44 +0300
  • d369adfa8e
    Update docs/source/quicktour.md Omar Sanseviero 2023-08-10 14:52:33 +0200
  • 4d5fc1235f
    Update installation.md Merve Noyan 2023-08-10 15:52:22 +0300
  • d90425eaa7
    Merge branch 'main' into added_cli_docs Merve Noyan 2023-08-10 15:51:11 +0300
  • 3fb7803ed9
    adjust toctree and shell to bash Merve Noyan 2023-08-10 15:28:28 +0300
  • 6fa3d9963c
    Update docs/source/basic_tutorials/using_cli.md Merve Noyan 2023-08-10 15:26:15 +0300
  • 7f0ff8d7a9
    Update docs/source/basic_tutorials/using_cli.md Merve Noyan 2023-08-10 15:26:10 +0300
  • d1c1621d40
    Update docs/source/basic_tutorials/using_cli.md Merve Noyan 2023-08-10 15:26:05 +0300
  • 5b920b6a4c Have snippets in Python/JavaScript in quicktour osanseviero 2023-08-10 14:12:41 +0200
  • 712393a702
    Added back CLI md Merve Noyan 2023-08-10 14:09:34 +0300
  • 58092d33a3
    Update docs/source/installation.md Merve Noyan 2023-08-10 14:02:03 +0300
  • 3723d1bbca
    Update docs/source/installation.md Merve Noyan 2023-08-10 14:01:57 +0300
  • 46a794e635
    Remove redundant CLI md Merve Noyan 2023-08-10 13:48:15 +0300
  • 15ef2bc082
    sync changes and refactor Merve Noyan 2023-08-10 13:47:07 +0300
  • 29129dc660
    Merge branch 'main' into added_cli_docs Merve Noyan 2023-08-10 13:42:02 +0300
  • 198437f66d
    Update docs/source/_toctree.yml Merve Noyan 2023-08-10 13:31:50 +0300
  • 263f41f8da Minor style fixes osanseviero 2023-08-10 11:28:07 +0200
  • 1b4a1d8260 Fix snippet osanseviero 2023-08-10 11:21:29 +0200
  • 2d8b31a4b4 Fix gated docs osanseviero 2023-08-10 11:14:17 +0200
  • 43ed6c217a Dummy commit test_docs osanseviero 2023-08-10 10:33:52 +0200
  • 9304e07423 Move launch locally to an installation section osanseviero 2023-08-10 10:19:55 +0200
  • 33e6d264ab Rename docker launch to quicktour osanseviero 2023-08-10 10:15:31 +0200
  • 50fd663b9b Fix docker build, pinning the pytorch version. Nikola Borisov 2023-08-09 14:43:08 -0700
  • 744d78d88e Add docker run --help flag + add curl example in docker_launch osanseviero 2023-08-09 23:14:23 +0200
  • ae7c21af5b
    Update README with fork info Nikola Borisov 2023-08-09 13:56:39 -0700
  • 38652c9da5 Replace CLI command with docker run IMG --help, it gets improved after #799 osanseviero 2023-08-09 21:43:05 +0200
  • c48c2cc1fe
    Add gated docs (#797) Merve Noyan 2023-08-09 21:35:47 +0300
  • 72cc292611
    Update using_cli.md Merve Noyan 2023-08-09 21:09:20 +0300
  • c4c7e6d80d
    Added CLI docs and rename docker launch Merve Noyan 2023-08-09 21:03:18 +0300
  • f5bef02850
    Update gated_model_access.md Merve Noyan 2023-08-09 20:15:32 +0300
  • bb1a5e64a1
    Update docs/source/basic_tutorials/gated_model_access.md Merve Noyan 2023-08-09 20:15:11 +0300
  • 3c648e54f1 Use destructuring in router arguments to avoid '.0' Ivar Flakstad 2023-08-09 18:09:43 +0200
  • e2910acedf
    Added note about gated models Merve Noyan 2023-08-09 18:09:13 +0300
  • 271e6393bc
    Update _toctree.yml Merve Noyan 2023-08-09 18:03:15 +0300
  • ec592d550e Small rearrangement in index page osanseviero 2023-08-09 16:37:02 +0200
  • 862db1be0c
    Wording nit Merve Noyan 2023-08-09 17:11:11 +0300
  • 15175839b4
    Added note to install huggingface-hub Merve Noyan 2023-08-09 17:09:13 +0300
  • cd226354fb Rebased and removed backend. Nicolas Patry 2023-08-09 16:08:51 +0200
  • 25b6694da1 Cargo fmt. Nicolas Patry 2023-08-18 14:35:58 +0200
  • 730e5938f5 Removing dead code + "Fix" test. Nicolas Patry 2023-08-18 12:41:10 +0200
  • e0b197ea09 Fix after rebase. Nicolas Patry 2023-08-17 15:18:14 +0000
  • 25d6ddf6ea Fix typo in batch concatination Vincent Brouwers 2023-08-09 08:39:18 +0000
  • 66705831a9 Only return top_tokens field when requested Vincent Brouwers 2023-08-02 13:03:19 +0000
  • 8b2847fcf8 Add max_top_n_tokens CLI argument Vincent Brouwers 2023-08-02 12:42:59 +0000
  • af0adb8c71 Defer building top-token objects to Rust Vincent Brouwers 2023-08-01 15:02:30 +0000
  • 6429695228 Skip top-n tokens in prefill Vincent Brouwers 2023-08-01 13:55:38 +0000
  • 1b5fdf7000 Allocate top_n_token tensor in Batch Vincent Brouwers 2023-07-31 13:09:45 +0000
  • 65c0d9c19d Return more top-n-tokens when probabilities are equal Vincent Brouwers 2023-07-28 14:21:11 +0000
  • 8515999b1d Implement top-n-tokens for all models Vincent Brouwers 2023-07-26 15:12:57 +0000
  • 38691f8a28 Share computation for top-n-token decoding Vincent Brouwers 2023-07-25 14:55:32 +0000
  • dbb92c20e7 Add batched top-n-tokens to FlashCausalLM Vincent Brouwers 2023-07-25 14:17:25 +0000
  • 0facd94738 Add top-n-tokens support to benchmark Vincent Brouwers 2023-07-24 14:02:56 +0000
  • 8a4d2076a6 Add WIP support for returning top tokens Vincent Brouwers 2023-07-14 19:48:15 +0000
  • e605c2a43e
    Supporting code llama. (#918) Nicolas Patry 2023-08-24 18:54:47 +0200
  • 97444f9367
    Added gradio example to docs (#867) Merve Noyan 2023-08-24 00:50:12 +0300
  • 888c029114
    Upgrade version number in docs. (#910) Nicolas Patry 2023-08-23 13:45:28 +0200
  • c4422e5678
    Adding small benchmark script. (#881) v1.0.2 Nicolas Patry 2023-08-18 19:28:56 +0200
  • bfa070611d
    Add streaming guide (#858) Omar Sanseviero 2023-08-18 13:27:08 +0200
  • bce5e22444
    Adding Idefics multi modal model. (#842) Nicolas Patry 2023-08-17 14:38:49 +0200
  • b9e33c4953
    Upgrading versions of python client. (#862) Nicolas Patry 2023-08-17 09:15:35 +0200
  • 2e68ac01c0
    "Fix" for rw-1b. (#860) Nicolas Patry 2023-08-17 09:05:41 +0200
  • d9bceb8e6b
    Misc improvements for InferenceClient docs (#852) Omar Sanseviero 2023-08-16 14:29:54 +0200