-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Description
Although chapter 4 addresses the implementation of dispatchers and handlers for cleaning, chunking and embedding posts, articles and repositories, the current version of the pipeline only includes articles from different sources.
I guess it must have something to do with Linkedin and Github difficulting to create crawlers as they may protect their endpoints with user and password. But I'd expect the "import data warehouse from JSON" to work without internet connection. In particular, this command:
poetry poe run-import-data-warehouse-from-jsonTo achieve that, can the corresponding files...
be populated?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels