Skip to content

Conversation

@stevhliu
Copy link
Member

cleans up the Optimization section a bit to remove redundant content

  • removes the hardware specific sections (GPU/CPU) since many of the same optimizations in these docs already exist
  • give Optimum it's own section in the docs for more visibility and combines GPU/CPU usage for it
  • give assisted decoding it's own section in the docs for more visibility as well and it's also not really a decoding strategy like the other search/sampling methods are
  • also removes the Optimizing inference doc since it also has redundant content in it and will be replaced by the newer overview in [docs] optimizations quickstart #42538

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@Cyrilvallez Cyrilvallez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants