Commit Graph

  • d2fcbbc72e
    Add Mixtral models (#553) Artem Chumachenko 2024-03-29 12:07:42 +0100
  • 91b1e2a27b fix order of init Artem Chumachenko 2024-03-29 10:37:20 +0100
  • 5fe8fa4a45
    Update src/petals/models/mixtral/model.py Artem Chumachenko 2024-03-29 10:32:11 +0100
  • 866927d88c remove unnes Artem Chumachenko 2024-03-12 12:05:09 +0400
  • 81a5e70c89 fix bloom Artem Chumachenko 2024-02-27 15:43:48 +0400
  • 08bbbd38f0 style Artem Chumachenko 2024-02-27 15:10:33 +0400
  • 7b6224d0cf Choose right attn Artem Chumachenko 2024-02-27 15:08:44 +0400
  • 4852cd5bf0 Fixes Artem Chumachenko 2024-02-06 19:20:59 +0400
  • 4cdd57cf49 Fix generation Artem Chumachenko 2024-02-06 13:59:44 +0400
  • d275d79b72 Add somehow workable version Artem Chumachenko 2024-01-29 13:27:40 +0400
  • 23e1803f1a
    Merge 63282afb4e into 2ad0b2b936 Alexander Borzunov 2024-03-19 12:43:07 +0400
  • 07e64c2993
    Merge 6626f8ecc2 into 2ad0b2b936 Denis Mazur 2024-03-19 12:43:07 +0400
  • 2ad0b2b936
    Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563) justheuristic 2024-03-18 00:11:47 +0300
  • 18ec785b31 test cache_position safely Your Name 2024-03-18 00:01:49 +0300
  • 6278395227 support transformers 4.38.2 Your Name 2024-03-17 23:45:58 +0300
  • 6dc7794557 try bump transformers version Your Name 2024-03-17 22:45:03 +0300
  • ac48d6670a Forward step_metadata to fix p2p pushing in rpc_inference; found by @miaoqijun in #550 Yingtong Dou 2024-03-17 22:35:24 +0300
  • 26d4cd855d 2 forward_backward Denis Mazur 2024-03-17 18:50:29 +0300
  • 16619f4564 some stuff works Denis Mazur 2024-03-09 22:10:11 +0300
  • f584f72bc7 minimal rpc call notebook Denis Mazur 2024-03-09 21:23:02 +0300
  • f8b922ca34 initial Denis Mazur 2024-03-09 21:18:39 +0300
  • 5ec7f373c7
    Merge branch 'main' into forward_backward Denis Mazur 2024-03-10 01:14:09 +0700
  • fe076fcf06 initial Denis Mazur 2024-03-09 21:10:36 +0300
  • efee5d1fa8
    Clean disk space in push-docker-image.yaml (#558) justheuristic 2024-03-02 20:25:03 +0300
  • 3bd142d35a
    Clean disk space in push-docker-image.yaml justheuristic 2024-02-28 14:36:47 +0300
  • 6626f8ecc2
    Update push-docker-test2.yaml fix-docker justheuristic 2024-02-28 14:32:07 +0300
  • f96be67364
    Create push-docker-test2.yaml justheuristic 2024-02-19 21:11:53 +0300
  • faf1e71fe4 try bumping buld-push-action version Denis Mazur 2024-02-17 20:24:53 +0300
  • 0d91bbdac3
    Bump transformers and accelerate versions (#554) Denis Mazur 2024-02-15 15:24:37 +0300
  • 4a892e2b95 actually assert transformers version Denis Mazur 2024-02-15 11:21:03 +0300
  • 103b437770 versions check Denis Mazur 2024-02-14 01:28:48 +0300
  • 3dab138f57 bump accelerate Denis Mazur 2024-02-13 20:34:29 +0300
  • 72c8ce9443 remove skips Denis Mazur 2024-02-13 19:15:40 +0300
  • 38a03bcc6a forked Denis Mazur 2024-02-13 19:08:47 +0300
  • 10cca1f152 try skipping bot remote seq tests Denis Mazur 2024-02-13 17:18:32 +0300
  • 0bb67e864f try skipping flaky test Denis Mazur 2024-02-12 02:45:57 +0300
  • 61c8d6eda6 revert to older peft Denis Mazur 2024-02-12 01:53:00 +0300
  • 25ee8ecfd0 remove falcon from ci tests Denis Mazur 2024-02-11 23:42:12 +0300
  • 72e1f00599 bloom Denis Mazur 2024-02-11 23:06:17 +0300
  • 7ff5952499 fix type annotation Denis Mazur 2024-02-06 21:24:40 +0300
  • 876b83868b fix remote generation Denis Mazur 2024-02-03 00:47:37 +0300
  • e9c81b2beb make test_optimized_block forked again Denis Mazur 2024-02-02 21:56:09 +0300
  • a945711f58 refactor llama/model.py Denis Mazur 2024-02-02 21:54:15 +0300
  • 3b00036ec6 fix newline in setup.cfg Denis Mazur 2024-02-02 21:51:08 +0300
  • 027b506508 remove repr from remote kv cache Denis Mazur 2024-02-02 21:49:55 +0300
  • 61456d9968 fix optimized layers test Denis Mazur 2024-02-02 21:38:33 +0300
  • 1e6bd07bf6 fix peft Denis Mazur 2024-02-02 00:45:00 +0300
  • 321b0483a1 first attempt Denis Mazur 2024-02-02 00:14:46 +0300
  • 382f267952 first attempt Denis Mazur 2024-01-25 19:04:09 +0300
  • 08b83d27c9 Create agent-examples.ipynb Roman Ageev 2023-12-12 06:29:27 +0800
  • 875dc81c4c
    Merge 3195579620 into d59c15c578 justheuristic 2023-12-02 21:00:48 +0000
  • 3195579620 Merge remote-tracking branch 'origin/main' into forward_kwargs forward_kwargs Your Name 2023-12-02 23:57:55 +0300
  • 3251f36359
    Merge 401e791700 into d59c15c578 Poedator 2023-11-22 13:00:49 +0000
  • 401e791700 black younesbelkada 2023-11-22 13:00:35 +0000
  • 5aebd3e8fc edits to bloom, falcon inputs_embeds arg younesbelkada 2023-11-22 12:47:28 +0000
  • bd0ca0fbae llama block reformatted younesbelkada 2023-11-22 12:43:08 +0000
  • e5a3fe64e2 attn_mask fixes for falcon younesbelkada 2023-11-22 12:36:33 +0000
  • 3810049411 _prepare_4d_causal_attention_mask in tests younesbelkada 2023-11-22 12:26:27 +0000
  • 741b5394cc past_key_values to None if zero shape younesbelkada 2023-11-22 09:48:44 +0000
  • fa254cff02 Llama rotary dims from 4 to 2 younesbelkada 2023-11-22 08:30:08 +0000
  • 2bdbf2da58 mask to bool for bloom fwd younesbelkada 2023-11-22 08:28:30 +0000
  • c6db638433 upd Bloom _prepare_attn_mask() younesbelkada 2023-11-21 23:40:48 +0000
  • d315467aab upped transformers ver to 4.35 younesbelkada 2023-11-21 23:00:23 +0000
  • 76479fdc43 replaced call to _prepare_4d_causal_attention_mask younesbelkada 2023-11-21 22:24:34 +0000
  • 313992eb38
    (temporary) bump version bump justheuristic 2023-11-21 01:46:07 +0300
  • 4dcb735732 install script to help the project deployment or for easy use PS. i hate vim unknown 2023-11-16 19:10:45 +0100
  • d59c15c578
    Bump version for inference diagnostics (#543) justheuristic 2023-11-16 06:12:30 +0300
  • 8fa2993481
    bump version for inference diagnostics justheuristic 2023-11-16 05:59:56 +0300
  • 03cbe90234
    Optimize LLaMA for inference (#513) Max Ryabinin 2023-11-14 18:14:19 +0100
  • 31dfea4657
    Merge branch 'main' into optimize_llama justheuristic 2023-11-14 15:18:02 +0300
  • 947bf2387b
    Merge 4159e557bf into 25a0796b39 Max Ryabinin 2023-11-10 13:50:00 -0700
  • 25a0796b39
    Hotfix: require peft version 0.5.0 (#539) justheuristic 2023-11-07 19:05:54 +0300
  • ce6b85232c
    Merge branch 'main' into test-adapter justheuristic 2023-11-07 18:48:29 +0300
  • 8be7040f55 Peft: strict version check for now Your Name 2023-11-07 18:46:38 +0300
  • dcce43670f
    Hotfix: set transformers version <=4.34 temporarily (#538) justheuristic 2023-11-07 18:19:19 +0300
  • aae440a7b4 fix transformers version for now Your Name 2023-11-07 17:50:15 +0300
  • d85a91e195 fix transformers version for now Your Name 2023-11-07 17:33:14 +0300
  • b6eb38eb85 test rebuild test_main Your Name 2023-11-07 17:20:38 +0300
  • 113e57b11d Add OpenAI compatible API Roman Ageev 2023-11-07 02:24:58 +0800
  • a19ccea233 Fix model type detection in tests Max Ryabinin 2023-09-20 15:40:30 +0300
  • 94b052db21 Fix optimizations, test LLaMA properly Max Ryabinin 2023-10-20 17:49:04 +0300
  • 78c7f58200 Optimize LLaMa for inference Max Ryabinin 2023-09-18 12:25:24 +0300
  • 63282afb4e Try 5% fails fix-inference-retry Aleksandr Borzunov 2023-09-28 18:38:56 +0000
  • 82a97d6e9e
    Fix beam search in GPU clients (#531) Alexander Borzunov 2023-10-23 20:13:13 +0400
  • 36688c294e Add --device cpu to CI servers, refactor run-tests.yaml Aleksandr Borzunov 2023-10-23 12:44:43 +0000
  • 13df222da6 Fix hypo_ids on cuda Aleksandr Borzunov 2023-10-23 09:58:10 +0000
  • 47d50e1e29
    Improve default arguments for clients and servers (#530) Alexander Borzunov 2023-10-23 05:26:40 +0600
  • 5c63682ef4 Limit max number of retries in tests Aleksandr Borzunov 2023-10-22 21:10:08 +0000
  • f4ee2ab511 Show "bf16 on CPU" warning even if there's not enough RAM Aleksandr Borzunov 2023-10-22 20:58:09 +0000
  • 8d41e0ed3d Use float32 as default client dtype Aleksandr Borzunov 2023-10-22 20:56:14 +0000
  • 46aef2cf2e Make default --attn_cache_tokens 2x smaller Aleksandr Borzunov 2023-10-22 20:52:40 +0000
  • ae19b65095
    Add position_ids argument to DistributedFalconModel (#525) Max Ryabinin 2023-10-08 22:09:46 +0300
  • 4a06994efa Add position_ids argument to DistributedFalconModel Max Ryabinin 2023-10-08 20:29:49 +0200
  • 1d9401ddce
    Update README.md (#520) Alexander Borzunov 2023-09-22 06:16:32 +0400
  • 0ab2c206cb
    Update README.md Alexander Borzunov 2023-09-22 03:32:14 +0400
  • af798091e6
    Update README.md Alexander Borzunov 2023-09-22 03:31:32 +0400
  • a88fdfb506
    Update README.md Alexander Borzunov 2023-09-22 03:25:19 +0400
  • 92c3d76ac5
    Update README.md Alexander Borzunov 2023-09-22 03:24:03 +0400
  • d7335cef79
    Update README.md Alexander Borzunov 2023-09-22 03:21:39 +0400
  • b3c8b62ebe
    Update README.md Alexander Borzunov 2023-09-22 03:21:22 +0400