Skip to content

Added Dtypes function to DataFrame#112

Draft
alexott wants to merge 1 commit intoapache:masterfrom
alexott:dtypes-dataframe-function
Draft

Added Dtypes function to DataFrame#112
alexott wants to merge 1 commit intoapache:masterfrom
alexott:dtypes-dataframe-function

Conversation

@alexott
Copy link
Copy Markdown
Contributor

@alexott alexott commented Jan 4, 2025

What changes were proposed in this pull request?

This PR adds Dtypes function

Why are the changes needed?

to make implementation on pair with Python version

Does this PR introduce any user-facing change?

How was this patch tested?

Added integration test

@alexott
Copy link
Copy Markdown
Contributor Author

alexott commented Jan 4, 2025

@grundprinzip the good question is about compatibility of type names - PySpark returns things like bigint for Long, int for Integer, etc.

@grundprinzip
Copy link
Copy Markdown
Contributor

So I was wondering how much we need this function. The PySpark API has some other interesting functionality to create the DDL String from a dataframe or more generally a schema to SQL string and SQL types. Personally, I would maybe pause on this PR until we have additional data type support (e.g. we're missing struct from arrow to go right now)

@alexott
Copy link
Copy Markdown
Contributor Author

alexott commented Jan 6, 2025

yep, I picked it up just because it in the list, but not 100% sure how much we need it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants