Increasingly, we turn to instructional videos for accomplishing everyday tasks and learning new skills such as sewing a face mask, cooking, and household repairs. Improving Remote Learning via Hierarchical Decomposition of Instructional Videos will facilitate the creation of instructional video to allow better navigation by hierarchically segmenting tasks into steps and providing voice-based navigation commands for accessing the steps; accomplishing this with algorithms that can automatically learn shared action steps from videos across different tasks by explicitly leveraging the conjugate constraints between actions and states.