Model Database's logo Model Database
  • Models
  • Datasets
  • Spaces
  • Docs
  • Pricing

  • Log In
  • Sign Up

microsoft
/
phi-1_5

Text Generation
Transformers PyTorch English mixformer-sequential custom_code
License: other
Model card Files Files and versions Community
25
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Inference speed is much longer than reported

#25 opened about 3 hours ago by jeff-gao

Use DirectML for microsoft/phi-1_5

#24 opened 1 day ago by shiqi1031

raise error when `use_cache = True`

#23 opened 1 day ago by wjfwzzc

Adding _set_gradient_checkpointing for compatibility

#22 opened 1 day ago by vriveras

Any plan to release phi-1.5 web mentioned in your paper?

#21 opened 1 day ago by sanqiang

Adding `safetensors` variant of this model

#18 opened 5 days ago by SFconvertbot

Creating a RetNet (Retentive Network) version is planned?

#16 opened 6 days ago by guyko81

Adding tf lite variant

#13 opened 7 days ago by 0xrk

Could one potentially train a mini-model based on this concept on synthetic structural data?

4
#11 opened 7 days ago by Mr8BitHK

tokenizer.model file

4
#10 opened 8 days ago by hanisaf

Attention mask for generation function in the future?

3
#7 opened 8 days ago by rchan26

Unofficial dataset

3
#2 opened 8 days ago by SinanAkkoyun
Company
© Model Database
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs