Skip to content

Add DataFrame API Documentation for DataFusion Python #1132

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

kosiew
Copy link
Contributor

@kosiew kosiew commented May 23, 2025

Which issue does this PR close?

Rationale for this change

This PR introduces comprehensive documentation for the DataFrame API in the DataFusion Python library. It enhances the usability and discoverability of the DataFrame functionalities, providing developers with structured, example-rich content on how to create, manipulate, and render DataFrames using DataFusion.

What changes are included in this PR?

  • Added dataframe.rst containing:
    • Overview of the DataFrame abstraction
    • Examples of DataFrame creation from various sources
    • Common operations (select, filter, aggregate, join, etc.)
    • Terminal operations and output formats
    • HTML rendering features in Jupyter with customization options
    • Advanced formatting examples and best practices
  • Added api/index.rst as a new section in the documentation to include the DataFrame API reference
  • Updated the root index.rst to link to the new API documentation section

Are these changes tested?

Since this is a documentation-only PR, traditional unit tests do not apply. The added examples can be used interactively to verify correctness.

Are there any user-facing changes?

Yes. This PR introduces a new "DataFrame API" documentation section that users can navigate to from the main documentation index. It improves the developer experience by offering detailed and formatted guides on using the DataFusion Python API.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add a dataframe.rst for #1100
1 participant