Commit Graph

  • 98923b0783
    Remove duplicated RUN in Dockerfile Alvaro Bartolome 2024-09-21 10:37:51 +0200
  • 397297251e
    Simplify crossterm imports Orhun Parmaksız 2024-09-21 09:48:20 +0200
  • 38c625bfeb
    Release 2.3.0 git_v2.3.0 Nicolas Patry 2024-09-20 18:15:06 +0200
  • 169178b937
    Preparing for release. (#2540) v2.3.0 Nicolas Patry 2024-09-20 17:42:04 +0200
  • 4fbe0f3791
    Upgrade version in docs. Nicolas Patry 2024-09-20 17:29:34 +0200
  • 7e2d18877e
    fix: wrap python basic logs in debug assertion in launcher (#2539) OlivierDehaene 2024-09-20 16:59:31 +0200
  • 43af75b531
    Preparing for release. Nicolas Patry 2024-09-20 16:53:51 +0200
  • e6df56d070
    use level filters instead OlivierDehaene 2024-09-20 15:17:45 +0200
  • 6c9b5de2b7
    fix: wrap python basic logs in debug assertion in launcher OlivierDehaene 2024-09-20 14:31:55 +0200
  • f10144cd9e Add DenseMoELayer and wire it up in Mixtral/Deepseek V2 Daniël de Kok 2024-09-20 11:29:48 +0000
  • 4b3922ed2f nix: remove unused _server.nix file Daniël de Kok 2024-09-20 12:02:49 +0000
  • 21d1b0cd8b fix conflict Mohit Sharma 2024-09-20 08:59:17 +0000
  • f478aa77ad
    hotfix: ipex fails since cuda moe kernel is not supported (#2532) Wang, Yi 2024-09-20 16:02:55 +0800
  • abd24dd385
    doc: clarify that --quantize is not needed for pre-quantized models (#2536) Daniël de Kok 2024-09-19 22:17:15 +0200
  • c103760172
    Update to moe-kenels 0.3.1 (#2535) Daniël de Kok 2024-09-19 22:16:32 +0200
  • f512021e77
    Stream options. (#2533) Nicolas Patry 2024-09-19 20:50:37 +0200
  • 905b3db5f8
    Workflow Nicolas Patry 2024-09-19 19:24:07 +0200
  • 650ff012fe
    Fixes. Nicolas Patry 2024-09-19 19:20:02 +0200
  • fdc1f897ed Attempt to fix apt failure Daniël de Kok 2024-09-19 14:39:01 +0000
  • b6794da3bf Merge branch 'fix_rocm_fa' into rocm_6.2_fixes Mohit Sharma 2024-09-19 14:28:46 +0000
  • 4fb947d2aa fixed style Mohit Sharma 2024-09-19 14:28:21 +0000
  • ef7acd4452 doc: clarify that --quantize is not needed for pre-quantized models Daniël de Kok 2024-09-19 14:12:49 +0000
  • bb4101ea5b Update to moe-kenels 0.3.1 Daniël de Kok 2024-09-19 10:57:43 +0000
  • 41b297a26b Merge branch 'fix_rocm_fa' into rocm_6.2_fixes Mohit Sharma 2024-09-19 08:35:31 +0000
  • 1c58942133
    Optional usage. Nicolas Patry 2024-09-18 14:31:45 +0200
  • e6d07a6d34 euff Mohit Sharma 2024-09-18 12:03:52 +0000
  • 42fc45be62
    develop. Nicolas Patry 2024-09-18 13:26:00 +0200
  • bf8e8b5307
    Impure test because we need network. Nicolas Patry 2024-09-18 13:17:14 +0200
  • d495a8ac3d
    Update the docs. Nicolas Patry 2024-09-18 13:09:02 +0200
  • df287fe758
    Only send the usage when asked for. Nicolas Patry 2024-09-18 12:56:59 +0200
  • 4716bd51ad
    Adding the assert. Nicolas Patry 2024-09-18 12:41:21 +0200
  • 162549f37f
    Fetch stuff from nix integration test for easier testing. Nicolas Patry 2024-09-18 12:09:26 +0200
  • 678721bcf0
    Stream options. Nicolas Patry 2024-09-18 11:14:53 +0200
  • 098d313394 hotfix: ipex fails since cuda moe kernel is not supported Wang, Yi A 2024-09-17 19:52:42 -0700
  • 3184bb70cb
    bump hf hub to 0.25.0 Nicholas Broad 2024-09-17 10:03:52 -0700
  • ce85efa968
    Move to moe-kernels package and switch to common MoE layer (#2511) Daniël de Kok 2024-09-17 18:08:58 +0200
  • 86984e3236
    fix: metrics unbounded memory (#2528) OlivierDehaene 2024-09-17 18:01:28 +0200
  • fe920d5c76
    Merge branch 'huggingface:main' into tylertitsworth/numba-cache-fix Tyler Titsworth 2024-09-17 09:00:09 -0700
  • 662e073668
    priv-cache. nix_integration_tests Nicolas Patry 2024-09-17 17:26:37 +0200
  • 7e5a9cc533
    fix: metrics unbounded memory OlivierDehaene 2024-09-17 17:11:45 +0200
  • 2d3afb3274
    Wtf state. Nicolas Patry 2024-09-17 16:31:06 +0200
  • 0dd6eef748 Update runner feature/moe-kernels Daniël de Kok 2024-09-17 14:01:52 +0000
  • 911f82a34b
    Using the cache on both jobs. Nicolas Patry 2024-09-17 14:05:06 +0200
  • 71e4268600
    nix: pure Rust check/fmt/clippy/test (#2525) Daniël de Kok 2024-09-17 12:14:30 +0200
  • 110b9a0b4c
    Longer timeout Nicolas Patry 2024-09-17 12:13:51 +0200
  • 748a8090cd nix: pure Rust check/fmt/clippy/test Daniël de Kok 2024-09-17 08:24:36 +0000
  • c821a0ff76
    Tmp dump. prefix_chunk Nicolas Patry 2024-09-03 12:30:12 +0200
  • 2f0fde1055
    TMP chunking. Nicolas Patry 2024-09-02 11:46:36 +0200
  • df4b1ec936
    Remove NCCL debug. Nicolas Patry 2024-09-17 11:13:11 +0200
  • cd9fc66058 Make cargo check pass Daniël de Kok 2024-09-17 08:52:39 +0000
  • 666d946ed7
    Give me the rights. Nicolas Patry 2024-09-17 10:43:45 +0200
  • bec5c94714
    No capsys inside docker. Nicolas Patry 2024-09-17 10:35:18 +0200
  • 06cee05d44
    O bind what ? Nicolas Patry 2024-09-17 10:34:32 +0200
  • 1333c58b62
    Syntax ? Nicolas Patry 2024-09-17 10:33:23 +0200
  • 123a59531d
    Attempt a bind instead of symlink. Nicolas Patry 2024-09-17 10:30:14 +0200
  • c584443373
    Symlink doesn't work Nicolas Patry 2024-09-17 10:27:08 +0200
  • dc2e1a36e0
    Fix ? Nicolas Patry 2024-09-17 10:23:01 +0200
  • b74f335b02
    Give access to runner. Nicolas Patry 2024-09-17 10:21:32 +0200
  • 54e703cc5a
    Create /nix before the action creates it. Nicolas Patry 2024-09-17 10:19:22 +0200
  • 1ff5b64b1c
    OMG. Nicolas Patry 2024-09-17 10:16:26 +0200
  • e680a57147
    Disabling the sharding please. Nicolas Patry 2024-09-17 10:12:52 +0200
  • 5827137a29
    Wtf ? Nicolas Patry 2024-09-17 10:04:43 +0200
  • a34dbb0ca1
    Force this stuff. Nicolas Patry 2024-09-17 09:59:26 +0200
  • c859663f98
    NCCL attempts Nicolas Patry 2024-09-17 09:43:28 +0200
  • 7a5855ff01
    NCCL ? Nicolas Patry 2024-09-17 09:37:05 +0200
  • fb7e8c8970
    Add the cache. Nicolas Patry 2024-09-17 09:20:12 +0200
  • 2aa2851e01
    use runners with cache Guillaume LEGENDRE 2024-09-17 08:12:19 +0200
  • a18e071690 speculative decoding complete guide added Shirin Yamani 2024-09-16 18:17:37 -0600
  • 87c85fdc38
    Standard setup. Nicolas Patry 2024-09-16 17:04:11 +0200
  • 69c20a9d3f
    Tmate let's find with ldconfig ? Nicolas Patry 2024-09-16 16:18:29 +0200
  • c784cb401d
    Let's try a compat drvier ? Nicolas Patry 2024-09-16 12:16:47 +0200
  • fe533dc57b
    Back to failing version Nicolas Patry 2024-09-16 11:58:44 +0200
  • 2f1f082abe
    Tmate. Nicolas Patry 2024-09-16 11:30:18 +0200
  • 1a6b9926f6
    missing lib. Nicolas Patry 2024-09-16 11:22:17 +0200
  • 332e42f59a
    Attempt. Nicolas Patry 2024-09-16 11:16:03 +0200
  • ec6fe324c6
    Link to nix owned lib Nicolas Patry 2024-09-16 10:56:32 +0200
  • 83ee55a617
    Trye somethign. Nicolas Patry 2024-09-16 10:40:21 +0200
  • 047530216c
    No idea where the shared disk is. Nicolas Patry 2024-09-15 15:34:33 +0200
  • 9f548fa82a
    Change the home location ? Nicolas Patry 2024-09-15 14:58:51 +0200
  • 3ff12084b7
    Revert "No tmate." Nicolas Patry 2024-09-15 14:49:41 +0200
  • 26634f9697
    No tmate. Nicolas Patry 2024-09-15 14:34:09 +0200
  • a533d086f0
    Tmate to find cache. Nicolas Patry 2024-09-15 00:43:18 +0200
  • a5b81ab457
    Home. Nicolas Patry 2024-09-15 00:24:29 +0200
  • 98f2241a88
    Put back libnvidia-ml Nicolas Patry 2024-09-14 23:29:12 +0200
  • 72a805d50d
    Remove tmate. Nicolas Patry 2024-09-14 21:37:47 +0200
  • 45c0129976
    Attempting something. Nicolas Patry 2024-09-14 21:08:04 +0200
  • 2b18537f85
    More tmate. Nicolas Patry 2024-09-14 18:54:01 +0200
  • 12b88204b0
    Putting the cuda package in the flake. Nicolas Patry 2024-09-14 18:43:12 +0200
  • d7333830b5
    Tmate. Nicolas Patry 2024-09-14 18:31:59 +0200
  • f8a7df41c4
    More timeout to the healthcheck. Nicolas Patry 2024-09-14 17:20:27 +0200
  • 68f7d75f23
    Fixing the update to outlines. Nicolas Patry 2024-09-14 15:13:00 +0200
  • c4bbe06bf1
    Simpler command Nicolas Patry 2024-09-14 15:02:56 +0200
  • d0ae24a167
    Release tests. Nicolas Patry 2024-09-13 23:16:14 +0200
  • 5c4b2eaa30
    Seeing the damage on the release tests. Nicolas Patry 2024-09-13 23:06:13 +0200
  • 2bd0117c27
    Remove unused code. Nicolas Patry 2024-09-13 21:50:59 +0200
  • 70f910bba6
    Remove tmate. Nicolas Patry 2024-09-13 21:49:31 +0200
  • b4654a36dc
    Fixing up the tests ? Nicolas Patry 2024-09-13 21:48:12 +0200
  • 5adece6313
    This doesn't seem needed. Nicolas Patry 2024-09-13 19:16:13 +0200
  • dd4b774e0d
    New cargo lock Nicolas Patry 2024-09-13 18:56:49 +0200
  • b7cb8d5145
    Let's figure out the issue... Nicolas Patry 2024-09-13 18:49:27 +0200