I can answer my own question. Jsoup can do what I need:
- Extract the content (tested)
- Change the link for Spring (tested)
- Save HTML (tested), images (tested), videos (not yet tested)
Related Posts:
- Where does the Media Library live in the database?
- Failed to import Media
- WordPress Theme Options Import/Export
- One Click demo content installation feature in WordPress theme options panel
- Import and replace existing page/post content
- Migrating WP site to another URL
- How to migrate a WordPress installation from one site to another, including all images?
- How to completely export and import WordPress?
- Download/View Schema (or DTD) for XML Import
- GUID not updated on import
- Importing large data from blogger
- Update Attached Image Size after Import
- Exporting and Importing WordPress Media Libraries and Galleries
- WordPress does not import all pages and post
- Problems with WordPress Import
- In What WordPress Version Was the Import/Export Tool Introduced?
- Moving WordPress site to an existing WordPress site
- Export Users From Live Site to Import onto staging
- Import subdomain WordPress into main domain WordPress
- Exporting featured image
- product export and import using xml
- Export WordPress from one domain to another domain
- Exporting by table
- Existing posts hidden after WP import
- WordPress importer – How to handle id clashes for featured images
- How to import several big Import XMLs?
- Merge two wordpress sites
- Moving 2 wordpress sites to one
- Export and import all Plugin options
- WXR? Export from Drupal 7 – import into WordPress
- WordPress Export/Import for custom options.php values?
- Import / Export database with SQL Server Server Management Studio
- Remove Project from Android Studio
- How to export and import taxonomies (category, tag and/or custom taxonomy) and their terms
- Why shouldn’t @import be used to import a parent theme into a child theme?
- WordPress Export Import Site Navigation Menu
- WXR import problem — not including categories
- Exporting just posts of a specific category
- Does WordPress Import/Export tool actually exports media (images) as well?
- Convert WordPress.com embed links to normal embeds
- WordPress (+WPMU) Import/Export ALL settings?
- Scrape data from a website and post it on the blog
- Importing New Posts from Live Site into Local Dev Site
- wordpress custom theme import and export options for sample data [closed]
- Is it possible to recover featured images after WordPress export/import
- Exporting and importing my WordPress database, but none of the plugin settings are importing
- How much of my site can I recover from public_html?
- Is there a way to import Blogger into bbPress?
- WXR slicing script
- can’t import xml file
- Imported thumbnails have incorrect URL
- Unknown collation when I import a dump from an existing site into a development database?
- Error: Failed to import tag/post
- Merging two databases without borking it
- Extending the woocommerce product export functionality
- What is the difference between wp-admin>Export>All content and MySql dump and import
- How to access my wordpress via IP in shared hosting
- WordPress Local And Live Site
- Move Multiple WP Installs on a single domain to WP Multisite
- What is basic structure of xml for importing a page in wordpress?
- How to import content from WordPress.com to a self hosted WordPress installation?
- WordPress pages creation work distribution & then combining – Localhost XAMPP
- Import HTML/JS post to WordPress?
- After imorting posts from another blog double line breaks have been replaced with single line breaks
- When moving a WordPress installation to a new server, why do I have to update each post before content shows?
- Which is the correct way to migrate content from single WP to WP multisite
- How to add biography of user while importing?
- How to export a WordPress blog with no export function?
- Importing posts via MySql (a csv file) need to be automatically published
- Is there a way to show different source feeds on individual pages?
- automated import from blogger
- Error in importing wordpress xml [closed]
- How to import media from folder as opposed from old site?
- CSV Import Tables
- Two-way synchronizing of a post type among multisite blogs
- Transferring theme and imported post attachments
- WordPress Import (CLI) returns Error Establishing a database connection
- How do you do your big exports / imports?
- Upgrading to WP 5+ with block content – Export & Import Page Content
- WordPress xml import too slow
- After importing items are not displaying
- How to import posts correctly?
- Content export, reinstall and content import
- Exporting Post type from one theme and importing it to another theme
- Moving website from localhost to existing website without altering existing live theme
- Pull specific data from CSV
- Migrate site from WordPress multisite to another WordPress multisite
- Importing many times to succeed
- Moving Posts to a new site and maintaining URLs
- Create CSV for import from File directories?
- how to import custom taxonomy (& terms)
- Import data from file larger than 15 MB
- How to import individual blog posts into WordPress?
- Astra Site is not importing
- Siteurl code for links
- WP-all-import problems with large input file [closed]
- How to create import option for xml
- How to export and import plugin data through WordPress tools option
- Does WordPress import export tool keeps the file hosted on the old website?
- How to add featured thumbnail into import XML using URL image?