Fix Falcon weight mapping for H2O.ai checkpoints

During the safetensor conversion, duplicate weights are removed. However, which of the duplicates gets removed, differs per checkpoint. In some, like `h2oai/h2ogpt-oig-oasst1-falcon-40b`, the weight `transformer.word_embeddings.weightSafetensor` gets removed. In others, `lm_head.weight` gets removed. Long story long, we need to support both. Originally, f018143 mapped `lm_head` to `word_embeddings`. Then ac736fd switched this around. This commit merges them and allows for both.
2025-09-10 20:04:52 +00:00 · 2023-08-30 09:50:49 +00:00 · 2023-08-30 09:50:49 +00:00 · e864b95656
commit e864b95656
parent 7c2e0af2a6
1 changed files with 4 additions and 1 deletions
--- a/server/text_generation_server/models/flash_rw.py
+++ b/server/text_generation_server/models/flash_rw.py
@ -54,7 +54,10 @@ class FlashRWSharded(FlashCausalLM):
            device,
            dtype,
            process_group=self.process_group,
-            aliases={"lm_head.weight": ["transformer.word_embeddings.weight"]},
+            aliases={
+                "lm_head.weight": ["transformer.word_embeddings.weight"],
+                "transformer.word_embeddings.weight": ["lm_head.weight"],
+            },
        )

        config.quantize = quantize