Commit Graph

  • e1e478d45c
    Merge 4dcb735732 into e268c99a6b elsolo5000-2 2024-05-06 02:47:40 -0400
  • 5744205357 added setup.sh and added .env.example mike dupont 2024-04-29 20:11:40 +0000
  • a13e0986f5
    Merge pull request #1 from jmikedupont2/feature/dht1 Richard Bakobiibizo 2024-04-29 11:26:26 -0700
  • db5cb42d40 update config mike dupont 2024-04-29 14:21:31 -0400
  • e268c99a6b
    Restrict PyTorch version to <2.3.0 to resolve import error (#577) main Priyanshupareek 2024-04-27 19:05:07 +0530
  • 88e1d6a990
    Update setup.cfg Priyanshupareek 2024-04-27 18:06:34 +0530
  • 0d437de2cf
    Pin PyTorch version to 2.2.2 to resolve import error Priyanshupareek 2024-04-25 16:31:06 +0530
  • dd4a171acd
    Merge 08b83d27c9 into 30f522d1a0 Romech 2024-04-19 15:07:09 +0800
  • 088ba3f74b
    Merge 26d4cd855d into 30f522d1a0 Denis Mazur 2024-04-19 15:06:51 +0800
  • 1644e89385 reformat black mike dupont 2024-04-17 12:56:48 -0400
  • 3365e2efbf Merge branch 'feature/docker-compose' mike dupont 2024-04-17 12:50:40 -0400
  • 6124622667 mixtral working mike dupont 2024-04-17 12:46:20 -0400
  • c98b5328cb docker changes mike dupont 2024-04-17 12:35:10 -0400
  • be69f3117f grok mike dupont 2024-04-16 14:58:27 -0400
  • 661545f0ff tiny mixtral running locally mike dupont 2024-04-16 14:05:02 -0400
  • c76e447ac7 infrastructure j mike dupont 2024-04-15 20:05:36 +0000
  • 30f522d1a0
    Fix dummy cache allocation (#574) Artem Chumachenko 2024-04-16 19:25:29 +0200
  • 1222e172ef github actions and docker compose Ubuntu 2024-03-28 19:14:24 +0000
  • 0fda7da816 pip freeze mike dupont 2024-04-15 16:32:49 -0400
  • 2a522d01f3 update j mike dupont 2024-04-15 20:05:36 +0000
  • 0ca54a5e76 Rechain reloc Artem Chumachenko 2024-04-11 11:41:16 +0200
  • e5dddfe0b6 Try mps device selecting Artem Chumachenko 2024-04-11 11:37:27 +0200
  • f02bd578b7 Fix dummy cache allocation Artem Chumachenko 2024-04-11 10:54:32 +0200
  • 2d3b63ed15 first docker compose mike dupont 2024-04-10 13:58:28 -0400
  • 2325875186 moving to my org mike dupont 2024-04-10 11:20:42 -0400
  • d6f4f80f3f
    Fix Mixtral-related issues (#570) Artem Chumachenko 2024-04-10 13:49:50 +0200
  • 5ca2a031f5 Comments Artem Chumachenko 2024-04-10 10:48:44 +0200
  • 63a421bad9 Add llama to non-bs mix Artem Chumachenko 2024-04-10 10:43:46 +0200
  • e3539c0759 Comments Artem Chumachenko 2024-04-10 10:34:48 +0200
  • 6a1596b633
    Update tests/test_full_model.py Artem Chumachenko 2024-04-10 10:32:04 +0200
  • 1b4bb1a743 Another fix Artem Chumachenko 2024-04-10 07:34:10 +0200
  • 16d97fcbce Fix benchmarks Artem Chumachenko 2024-04-10 07:23:18 +0200
  • a447cbe16d Add assert about BS Artem Chumachenko 2024-04-10 07:12:11 +0200
  • 5f91793f8c Skip BS for mixtral for now Artem Chumachenko 2024-04-09 15:26:49 +0200
  • ba271dc626 Fix get_model_block Artem Chumachenko 2024-04-08 19:41:59 +0200
  • 2ca531623c Return compatibility with tests Artem Chumachenko 2024-04-08 19:40:03 +0200
  • 0f498814be Fix cache again Artem Chumachenko 2024-04-08 19:34:47 +0200
  • 46e29b230e Fix cache in tests Artem Chumachenko 2024-04-08 19:27:10 +0200
  • d41ff56047 Fix type Artem Chumachenko 2024-04-08 19:13:00 +0200
  • db3087aee9 Style Artem Chumachenko 2024-04-08 19:02:47 +0200
  • 9eb7928bef Add mixtral in tests Artem Chumachenko 2024-04-08 19:00:56 +0200
  • 204855c751 fix cache and throughput Artem Chumachenko 2024-04-08 18:59:13 +0200
  • c6dd015ea3 petals inference John Doe 2024-04-08 16:17:30 +0000
  • f06cfd2b97 fix imports Artem Chumachenko 2024-04-08 18:08:57 +0200
  • aecf074f25 fix block init Artem Chumachenko 2024-04-08 18:04:50 +0200
  • 5b7e7c7375 update mike dupont 2024-04-08 11:48:52 -0400
  • dfcdaca2ac run Ubuntu 2024-03-28 19:14:24 +0000
  • 84dd3a1a5d
    Merge b945e388e5 into d2fcbbc72e Alexander Borzunov 2024-04-08 10:22:01 +0800
  • c44c6cba92 working as root John Doe 2024-03-30 15:48:57 +0000
  • 998cb34722 adding steps John Doe 2024-03-30 14:56:26 +0000
  • afe06e1757 update Ubuntu 2024-03-28 19:34:10 +0000
  • 80443ec10a run Ubuntu 2024-03-28 19:14:24 +0000
  • 03e49d93f2 Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563) justheuristic 2024-03-18 00:11:47 +0300
  • b638b046ee Clean disk space in push-docker-image.yaml (#558) justheuristic 2024-03-02 20:25:03 +0300
  • f7e13c40db Bump transformers and accelerate versions (#554) Denis Mazur 2024-02-15 15:24:37 +0300
  • b2b3f974b2 Bump version for inference diagnostics (#543) justheuristic 2023-11-16 06:12:30 +0300
  • 29363912c4 Optimize LLaMA for inference (#513) Max Ryabinin 2023-11-14 18:14:19 +0100
  • 83c7b5639f Hotfix: require peft version 0.5.0 (#539) justheuristic 2023-11-07 19:05:54 +0300
  • 3e6c09ffd7 Hotfix: set transformers version <=4.34 temporarily (#538) justheuristic 2023-11-07 18:19:19 +0300
  • d1ec0076fc Fix beam search in GPU clients (#531) Alexander Borzunov 2023-10-23 20:13:13 +0400
  • 376de004fe Improve default arguments for clients and servers (#530) Alexander Borzunov 2023-10-23 05:26:40 +0600
  • 90780d6da2 Add position_ids argument to DistributedFalconModel (#525) Max Ryabinin 2023-10-08 22:09:46 +0300
  • 5b09563e84 Update README.md (#520) Alexander Borzunov 2023-09-22 06:16:32 +0400
  • 96d5cee0fe Fix file locks in NFS-mounted directories (#517) FYY 2023-09-19 20:01:23 -0400
  • 9157e13b94 Store (start_block, end_block) in each DHT record for reliability (#510) Alexander Borzunov 2023-09-15 23:53:57 +0400
  • 9b83ef6b1d Bump version to 2.2.0 (#502) Alexander Borzunov 2023-09-06 19:43:30 +0400
  • edcd8ff56f Optimize the Falcon block for inference (#500) Max Ryabinin 2023-09-04 14:38:32 +0200
  • 5b5285fc67 Fix prompt tuning after #464 (#501) Alexander Borzunov 2023-09-04 12:25:29 +0400
  • 5f36029534 Add Falcon support (#499) Alexander Borzunov 2023-09-04 01:45:37 +0400
  • b3952e16ee Force use_cache=True in config only (#497) Alexander Borzunov 2023-09-03 01:16:00 +0400
  • 85c0db6883 Force use_cache=True (#496) Alexander Borzunov 2023-09-02 22:57:18 +0400
  • 6065bc7140 Create model index in DHT (#491) Alexander Borzunov 2023-08-31 10:31:03 +0400
  • e3e9cbe358 Replace dots in repo names when building DHT prefixes (#489) Alexander Borzunov 2023-08-30 23:31:39 +0400
  • ad7cab39d6 Fix race condition in MemoryCache (#487) Alexander Borzunov 2023-08-30 14:13:43 +0400
  • 0034745f13 Wait for DHT storing state OFFLINE on shutdown (#486) Alexander Borzunov 2023-08-30 07:48:11 +0400
  • f68cced72a Fix `.generate(input_ids=...)` (#485) Alexander Borzunov 2023-08-30 06:59:33 +0400
  • 6205344c3d Remove no-op process in PrioritizedTaskPool (#484) Alexander Borzunov 2023-08-30 06:07:04 +0400
  • 20d04b73a8 Support macOS (#477) Alexander Borzunov 2023-08-29 07:49:27 +0400
  • f70d2a550c Refactor readme (#482) Alexander Borzunov 2023-08-28 19:09:13 +0400
  • c1bec34dd7 Rewrite MemoryCache alloc_timeout logic (#434) justheuristic 2023-08-28 16:01:50 +0300
  • 44f4498883 Fix requiring transformers>=4.32.0 (#480) Alexander Borzunov 2023-08-26 05:32:16 +0400
  • 1498e2936e Require transformers>=4.32.0 (#479) Alexander Borzunov 2023-08-25 01:37:30 +0400
  • f4ae9dcffc Don't install cpufeature on non-x86_64 machines (#478) Alexander Borzunov 2023-08-24 19:57:15 +0400
  • fe64dfcbf5 Bump version to 2.1.0 (#474) Alexander Borzunov 2023-08-24 19:42:19 +0400
  • 210ac116d7 Hide excess key message (#476) Alexander Borzunov 2023-08-24 00:41:40 +0400
  • 5eea7f04ab Update peft to 0.5.0 version (#475) Artem Chumachenko 2023-08-23 20:21:28 +0400
  • 76b7e0248d Support loading weights from Safetensors on server (#473) Alexander Borzunov 2023-08-23 01:43:29 +0400
  • da6e1e8084 Change transformers version assert (#472) justheuristic 2023-08-22 20:53:14 +0300
  • dcdc7f84e9 Support transformers 4.32.x (#471) justheuristic 2023-08-22 20:10:29 +0300
  • 005033c995 Temporarily require peft<0.5.0, transformers<4.32.0 (#470) justheuristic 2023-08-22 19:45:37 +0300
  • d2c68d92e7
    Merge 7fdbc84b95 into d2fcbbc72e Daniel Ahern 2024-03-30 13:49:52 +0100
  • 94445cd0c7
    Merge 3f66c3615a into d2fcbbc72e Artem Chumachenko 2024-03-29 14:26:47 +0300
  • 3f66c3615a Remove unnesc todo peft_update Artem Chumachenko 2024-03-29 12:26:41 +0100
  • 96cf244fae
    Update comments Artem Chumachenko 2024-03-29 12:23:51 +0100
  • b143055547 style Artem Chumachenko 2024-03-05 13:45:41 +0400
  • e90612221b Fix versions Artem Chumachenko 2024-03-05 13:45:16 +0400
  • 7d9afb0fb4 Fix trainability Artem Chumachenko 2024-03-05 13:08:06 +0400
  • a93221a9f3 Fix inference without adapter Artem Chumachenko 2024-03-04 13:48:33 +0400
  • cf4f80a020 lib number Artem Chumachenko 2024-02-27 16:53:59 +0400
  • 83c95518d9 Make fixes Artem Chumachenko 2024-02-27 16:50:14 +0400