-
Notifications
You must be signed in to change notification settings - Fork 1.9k
GetColumn method is extension for IDataView but not mlContext. #2473
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Can I have @TomFinley approval of this issue? |
I think so. What is our intent with this API? I see a couple problems: Usage of Generally What do we think about that? |
Also, I think this extension like other accessors on columns should take the |
My intent is to have consistency in across our APIs. From what I see in code So from what I'm hearing from you, you more leaning towards leave it as part of |
You are thinking of #1098. But that is creation of components. But what is being created? Usually such things as estimators, some conveniences to make data views. Yet, we do not involve Likewise, and most relevant ot this point, we do not use Indeed, I have observed a disturbing misconception about the intent of As far as I can tell, there is no genuine, real need to have an Let me know if you feel differently, but I have some conviction on this point. |
#1708 moved CreateEnumerable from extension on top of DataView to become extension on mlContext object.
We still have GetColumn extension
machinelearning/src/Microsoft.ML.Data/Utilities/ColumnCursor.cs
Line 25 in 7fc7e50
which works on top of DataView. Shall we move it to mlContext?
var enumerable = mlContext.CreateEnumerable<SamplesUtils.DatasetUtils.SampleTemperatureData>(filteredData, reuseRowObject: true);
var originalColumnBack = transformedData_default.GetColumn<VBuffer<ReadOnlyMemory<char>>>(mlContext, defaultColumnName);
They look silly together.
The text was updated successfully, but these errors were encountered: