reprozip: ReproZip hangs on fit_transform() method from sklearn.decomposition and phate

When running reprozip trace for the experiment containing dimensionality reduction algorithms like PCA or t-SNE from scikit-learn or PHATE from phate library, it hangs on performing the fit_transform() method.

Versions of libraries: scikit-learn 1.2.0 phate 1.0.10 pandas 1.3.5

System: Ubuntu 18.04.6 LTS

Sample code to reproduce the issue:

from pathlib import Path
import pandas as pd
from sklearn.decomposition import PCA

def main():
    path = Path(f"path to a csv file")
    X = pd.read_csv(path)

    print("Creating PCA object")
    pca = PCA(random_state=123, n_components=3)
    print("PCA object created")
   
    print("fitting PCA transform")
    X_pca = pca.fit_transform(X)  # It hangs here. The print method below is never reached.
    print("PCA transform fitted")

if __name__ == "__main__":
    main()

About this issue

  • Original URL
  • State: open
  • Created a year ago
  • Comments: 15 (7 by maintainers)

Most upvoted comments

Thanks @milech, that should help narrow it down. Hopefully I can find where the issue is. ReproZip should not interfere with OpenMP like this.