Commit Graph

75 Commits

Author SHA1 Message Date
Guillaume LEGENDRE
c1095bb61a
add debug 2024-03-18 11:54:31 +01:00
Guillaume LEGENDRE
ece6b94118
remove proxy 2024-03-18 11:01:09 +01:00
Guillaume LEGENDRE
6de10b659d
new tailscale action 2024-03-18 10:42:14 +01:00
Guillaume LEGENDRE
c64866e05a
exclude ubuntu.com domain 2024-02-21 19:45:45 +01:00
Guillaume LEGENDRE
e61f124f63
fix 2024-02-21 19:33:37 +01:00
Guillaume LEGENDRE
710b760602
fix typo 2024-02-21 19:28:48 +01:00
Guillaume LEGENDRE
3a85f1bd54
try fixing buildx proxy 2024-02-21 19:27:28 +01:00
Guillaume LEGENDRE
d0d0fd24a8
update tailscale action version 2024-02-21 15:43:58 +01:00
Guillaume LEGENDRE
92ab9d2ee6
change runner and remove tailscale userspace for amd 2024-02-21 15:41:05 +01:00
Guillaume LEGENDRE
383478758b
fix tailscale 2024-02-21 15:36:48 +01:00
Nicolas Patry
ab60d15962 Desperate attempt. 2024-02-14 10:27:21 +00:00
Nicolas Patry
584c5fa0a0 Tailscale. 2024-02-14 10:22:36 +00:00
Nicolas Patry
212e1cbcbe no sudo. 2024-02-14 10:19:46 +00:00
Nicolas Patry
ffa1804a34 .. 2024-02-14 10:16:24 +00:00
Nicolas Patry
5b0befee43 Test. 2024-02-14 10:13:45 +00:00
Nicolas Patry
df91f105e8 Ofc. 2024-02-14 10:11:22 +00:00
Nicolas Patry
7f0a816a22 Maybe XML wasn't so bad after all. 2024-02-14 10:10:28 +00:00
Nicolas Patry
b1aff577a0 Worse invention ever. 2024-02-14 10:09:00 +00:00
Nicolas Patry
0523031ffb ... 2024-02-14 10:05:29 +00:00
Nicolas Patry
69d1d3cde6 Bahs in yaml is not our friend. 2024-02-14 10:02:53 +00:00
Nicolas Patry
e36887cbf5 Install docker manually. 2024-02-14 10:00:33 +00:00
Nicolas Patry
05aef4dd1a Upgrade install buildx. 2024-02-14 09:57:15 +00:00
Nicolas Patry
85bf172653 Our runner docker in docker. 2024-02-14 09:52:34 +00:00
Nicolas Patry
c54b5c7f04 Remove tailscale. 2024-02-13 17:51:12 +01:00
Nicolas Patry
a83772c87b Self hosted for nvidia too. 2024-02-13 17:31:39 +01:00
Nicolas Patry
31d965bf17 Our runner. 2024-02-13 17:15:45 +01:00
drbh
c5ef81bed5
chore: bump ci rust version (#1543)
This PR bumps the rust toolchain in CI to resolve the CI build issue

```bash
  Downloaded crossbeam-utils v0.8.19
  Downloaded crc32fast v1.3.2
error: failed to compile `text-generation-router v1.4.0 (/home/runner/work/text-generation-inference/text-generation-inference/router)`, intermediate artifacts can be found at `/home/runner/work/text-generation-inference/text-generation-inference/target`

Caused by:
  package `clap_lex v0.7.0` cannot be built because it requires rustc 1.74 or newer, while the currently active rustc version is 1.71.0
  Either upgrade to rustc 1.74 or newer, or use
  cargo update -p clap_lex@0.7.0 --precise ver
  where `ver` is the latest version of `clap_lex` supporting rustc 1.71.0
make: *** [Makefile:12: install-router] Error 101
```
2024-02-09 10:32:04 +01:00
OlivierDehaene
c2d4a3b5c7
v1.4.0 (#1494) 2024-01-26 19:04:57 +01:00
OlivierDehaene
9b56d3fbf5
feat: relax mistral requirements (#1351)
Close #1253 
Close #1279
2023-12-15 12:52:24 +01:00
Nicolas Patry
3238c49121
Add a stale bot. (#1313) 2023-12-05 14:42:55 +01:00
fxmarty
b2b5df0e94
Add RoCm support (#1243)
This PR adds support for AMD Instinct MI210 & MI250 GPUs, with paged
attention and FAv2 support.

Remaining items to discuss, on top of possible others:
* Should we have a
`ghcr.io/huggingface/text-generation-inference:1.1.0+rocm` hosted image,
or is it too early?
* Should we set up a CI on MI210/MI250? I don't have access to the
runners of TGI though.
* Are we comfortable with those changes being directly in TGI, or do we
need a fork?

---------

Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: OlivierDehaene <olivier@huggingface.co>
Co-authored-by: Your Name <you@example.com>
2023-11-27 14:08:12 +01:00
OlivierDehaene
8acdc1fae7 hotfix 1.1.1 2023-11-16 18:35:09 +01:00
Remy
72b8f88be8
fix: remove useless token (#1179)
This token is not used by your action.
Secret is removed from the repository.
2023-10-19 14:04:44 +02:00
Merve Noyan
259a230028
Automatic docs for TGI (#1045)
I had to open this PR since I initially worked from my fork, and it
requires a handful of work to trigger a new github action on my fork's
specific branch (couldn't find a way, at least, despite trying all of
them).

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-09-27 16:01:38 +02:00
Mishig
5df4c7c0d7
[docs] Build docs only when doc files change (#812)
Build docs only when change happens in `docs/source`

See for example
https://github.com/huggingface/api-inference/blob/main/.github/workflows/build_documentation.yml#L3-L8
2023-08-11 07:07:53 +02:00
Merve Noyan
647ae7a7d3
Setup for doc-builder and docs for TGI (#740)
I added ToC for docs v1 & started setting up for doc-builder. cc @Narsil
@osanseviero

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: osanseviero <osanseviero@gmail.com>
Co-authored-by: Mishig <mishig.davaadorj@coloradocollege.edu>
2023-08-10 10:24:52 +02:00
Nicolas Patry
92bb56b0c1
Local gptq support. (#738)
# What does this PR do?

Redoes #719

<!--
Congratulations! You've made it this far! You're not quite done yet
though.

Once merged, your PR is going to appear in the release notes with the
title you set, so make sure it's a great title that fully reflects the
extent of your awesome contribution.

Then, please replace this with a description of the change and which
issue is fixed (if applicable). Please also include relevant motivation
and context. List any dependencies (if any) that are required for this
change.

Once you're done, someone will review your PR shortly (see the section
"Who can review?" below to tag some potential reviewers). They may
suggest changes to make the code even better. If no one reviewed your PR
after a week has passed, don't hesitate to post a new comment
@-mentioning the same persons---sometimes notifications get lost.
-->

<!-- Remove if not applicable -->

Fixes # (issue)


## Before submitting
- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Did you read the [contributor
guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests),
      Pull Request section?
- [ ] Was this discussed/approved via a Github issue or the
[forum](https://discuss.huggingface.co/)? Please add a link
      to it if that's the case.
- [ ] Did you make sure to update the documentation with your changes?
Here are the
[documentation
guidelines](https://github.com/huggingface/transformers/tree/main/docs),
and
[here are tips on formatting
docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation).
- [ ] Did you write any new necessary tests?


## Who can review?

Anyone in the community is free to review the PR once the tests have
passed. Feel free to tag
members/contributors who may be interested in your PR.

<!-- Your PR will be replied to more quickly if you can figure out the
right person to tag with @


@OlivierDehaene OR @Narsil

 -->
2023-07-31 10:32:52 +02:00
Nicolas Patry
f063ebde10
chore: migrate ci region for more availability. (#581) 2023-07-12 10:01:01 +02:00
OlivierDehaene
e3e487dc71
feat(server): support trust_remote_code (#363) 2023-05-23 20:40:39 +02:00
OlivierDehaene
5f67923cac
feat: add nightly load testing (#358) 2023-05-23 17:42:19 +02:00
oOraph
0a6494785c
fix(ci): fix security group (#359)
# What does this PR do?
Switch security group used for ci
(open outbound rules)

Signed-off-by: Raphael <oOraph@users.noreply.github.com>
Co-authored-by: Raphael <oOraph@users.noreply.github.com>
2023-05-23 16:49:11 +02:00
OlivierDehaene
5a58226130
fix(server): fix decode token (#334)
Fixes #333

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-05-16 23:23:27 +02:00
OlivierDehaene
dbdc587ddd
feat(integration-tests): improve comparison and health checks (#336) 2023-05-16 20:22:11 +02:00
OlivierDehaene
e71471bec9
feat: add snapshot testing (#282) 2023-05-15 23:36:30 +02:00
OlivierDehaene
66b277321d
feat(ci): custom gpu runners (#328) 2023-05-15 15:53:08 +02:00
Nicolas Patry
411b0d4e1f
chore(github): add templates (#264) 2023-05-02 15:43:19 +02:00
Ehsan M. Kermani
f092ba9b22
feat(server): add watermarking tests (#248) 2023-04-27 19:16:35 +02:00
Nicolas Patry
db2b4e0754
feat(router): new healthcheck that skips the queue (#244)
Co-authored-by: OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
Co-authored-by: OlivierDehaene <olivier@huggingface.co>
2023-04-26 20:23:54 +02:00
Nicolas Patry
c4fb09f2ae
feat(router): add tests to validation (#237) 2023-04-26 16:14:40 +02:00
OlivierDehaene
274513e6a3
fix(ci): fix sha in docker image (#212) 2023-04-20 18:50:47 +02:00