mirror of
				https://github.com/huggingface/text-generation-inference.git
				synced 2025-10-20 12:25:23 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			31 lines
		
	
	
		
			551 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			31 lines
		
	
	
		
			551 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| <div align="center">
 | |
| 
 | |
| # Text Generation Inference benchmarking tool
 | |
| 
 | |
| 
 | |
| 
 | |
| </div>
 | |
| 
 | |
| A lightweight benchmarking tool based inspired by [oha](https://github.com/hatoo/oha)
 | |
| and powered by [tui](https://github.com/tui-rs-revival/ratatui).
 | |
| 
 | |
| ## Install
 | |
| 
 | |
| ```shell
 | |
| make install-benchmark
 | |
| ```
 | |
| 
 | |
| ## Run
 | |
| 
 | |
| First, start `text-generation-inference`:
 | |
| 
 | |
| ```shell
 | |
| text-generation-launcher --model-id bigscience/bloom-560m
 | |
| ```
 | |
| 
 | |
| Then run the benchmarking tool:
 | |
| 
 | |
| ```shell
 | |
| text-generation-benchmark --tokenizer-name bigscience/bloom-560m
 | |
| ```
 |