Skip to content

Release/4.0#95

Draft
RobbinBouwmeester wants to merge 46 commits intomasterfrom
release/4.0
Draft

Release/4.0#95
RobbinBouwmeester wants to merge 46 commits intomasterfrom
release/4.0

Conversation

@RobbinBouwmeester
Copy link
Member

No description provided.

Alirezak2n and others added 30 commits March 14, 2025 15:23
I fix bugs and now it works perfectly
Add fine tuning class, dataset class, and make changes to doc string.
Remove TensorFlow and add PyTorch.
Remove redundancy.
PyTorch version of DeepLC
Cleanup and preliminary refactoring of feature extractor
- Remove ensemble-model mode (different kernel sizes).
- Split of model loading to separate function.
- Make dataset take peptidoforms instead of PSMs.
- Get unique peptidoforms before predicting and keep inverse index for mapping back predictions to input PSM list.
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO - meant to take in a PSM list and calibrate separate runs to a single run

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Skip GUI for now and focus mostly on streamlit/webserver?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Whowa, pytorch models are so much smaller

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems that some functionality is still only available through "combined" functions. Such as "predict_and_calibrate" shouldn't there also be a "calibrate" functionality?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add CCS feature logic can be removed, check if n-terminal (or c-terminal I guess) modifications are now handled correctly. We used to put them on either the first or last AA without regards to where they were located (n-terminal or residual). I think it is also fine to add an additional feature field in the matrix that allocates them specifically on the termini, but this would require retraining, so lets keep it as is for now...

raise CalibrationError("Source values have zero or invalid range; cannot calibrate.")

boundaries = np.linspace(cal_min, cal_max, self.number_of_splits + 1, dtype=np.float32)
starts: np.ndarray = np.searchsorted(source, boundaries[:-1], side="left") # type: ignore[var-annotated]
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is this type?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CCS feature logic can be removed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

4 participants