Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/falcon #71

Open
wants to merge 40 commits into
base: develop
Choose a base branch
from
Open

Feature/falcon #71

wants to merge 40 commits into from

Conversation

xeon27
Copy link
Collaborator

@xeon27 xeon27 commented Sep 15, 2023

PR Type ([Feature | Fix | Documentation | Test])

Feature

Short Description

  1. Enabled Accelerate DeepSpeed for multi-GPU
  2. Bumped up GPUs for both falcon versions (7B-2, 40B-4) enabling longer output sequences

echo "SLURM_GPUS_ON_NODE"=$SLURM_GPUS_ON_NODE
echo "SLURM_CPUS_ON_NODE"=$SLURM_CPUS_ON_NODE
echo "SLURM_PROCID"=$SLURM_PROCID

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Recommend you just run env instead so you get all the Slurm variables, along with other environment variables tha tmight be helpful

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am just printing these for debug purposes, can you elaborate further on using env to print them?

Copy link
Collaborator

@markcoatsworth markcoatsworth left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've added a bunch of comments. Couple points of confusion, nothing too serious. Also there is lots of commented-out code here that creates clutter, this should be deleted.

web/utils/triton.py Outdated Show resolved Hide resolved
web/utils/triton.py Outdated Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants