Page MenuHomePhabricator

cscott (C. Scott Ananian)
Parser whisperer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Oct 21 2014, 6:47 PM (552 w, 1 d)
Availability
Available
IRC Nick
cscott
LDAP User
C. Scott Ananian
MediaWiki User
Cscott [ Global Accounts ]

Editor since 2005; WMF developer since 2013. I work on Parsoid and OCG, and dabble with VE, real-time collaboration, and OOjs.

On github: https://github.com/cscott

See https://en.wikipedia.org/wiki/User:cscott for more.

Recent Activity

Yesterday

cscott added a comment to T394946: Extract wfDeprecated into a separate library.

a9f2434e6f88afafbe1ec099bd21a2363a0df692 is one example where a uniform mechanism would have been useful, instead we manually recreated the trigger_error mechanism.

Wed, May 21, 7:41 PM · Librarization
cscott created T394946: Extract wfDeprecated into a separate library.
Wed, May 21, 7:38 PM · Librarization
cscott closed T392118: Add metrics to track async content (Wikifunctions) SLO, a subtask of T390548: Establish an SLO for the Wikifunctions integration into Wikimedia projects' wikitext pages, to assure reader experience quality is maintained during roll-out, as Resolved.
Wed, May 21, 5:31 PM · Workstreams, OKR-Work, Abstract Wikipedia team (25Q4 (Apr–Jun))
cscott closed T392118: Add metrics to track async content (Wikifunctions) SLO as Resolved.
Wed, May 21, 5:31 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), OKR-Work, Abstract Wikipedia team (25Q4 (Apr–Jun)), Content-Transform-Team (Work In Progress)
cscott added a comment to T392118: Add metrics to track async content (Wikifunctions) SLO.

"Extra update MedaiWiki jobs due to Wikifunctions content" will be the mediawiki_refreshlinks_parsercache_operations_total metric with status=cache_miss and has_async_content=true. This the total number of refresh links jobs with async content. If you look at the label async_not_ready=true then these are jobs which are going to need to be repeated once the async content is ready, and so they are "extra" update jobs. In addition, there will be a few extra update jobs with async_not_ready=false when entries fall out of the parser cache, do to the way we currently handle updating async content. So the range of "extra" jobs is between the lower bound of the # of jobs with async_not_ready=true and the upper bound of the # of jobs with has_async_content=true. We can refine this metric further if/when the upper bound gets close to our SLI limit.

Wed, May 21, 5:30 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), OKR-Work, Abstract Wikipedia team (25Q4 (Apr–Jun)), Content-Transform-Team (Work In Progress)
cscott closed T365036: JSON serialization failures on media files as Resolved.
Wed, May 21, 5:03 PM · Content-Transform-Team (Work In Progress), Patch-Needs-Improvement, Essential-Work, MW-1.43-notes (1.43.0-wmf.6; 2024-05-21), JsonCodec, MediaWiki-Parser
cscott closed T392133: Async content needs !misermode as Resolved.
Wed, May 21, 5:02 PM · Essential-Work, MediaWiki-Parser, Abstract Wikipedia team, Content-Transform-Team (Work In Progress)
cscott closed T373480: Display categories in sorted order by default as Resolved.

Closing this task as (1) category sorting was enabled as an option in core, and (2) investigation revealed that many editors currently expect categories to be an ordered list *not* a set. So we'll have to apply that learning to metadata update when we work on selective update.

Wed, May 21, 5:02 PM · Content-Transform-Team (Work In Progress), MediaWiki-Categories, MediaWiki-User-Interface, Parsoid-Read-Views (Phase 1 - DiscussionTools support), MW-1.43-notes (1.43.0-wmf.21; 2024-09-03), User-notice
cscott closed T373383: Parsoid doesn't generate tracking categories for the use of magic words (ISBN, RFC, PMID) as Resolved.

Verified in parser tests; can't verify in production because production doesn't (yet) use Parsoid for the refreshlinksjob.

Wed, May 21, 4:58 PM · Essential-Work, Content-Transform-Team (Work In Progress), Parsoid
cscott closed T391788: Parser limit reporting doesn't work on FlaggedRevs pages as Resolved.

Verified that limit report appears on https://de.wikipedia.org/wiki/Johanne_Karoline_Wilhelmine_Spazier

Wed, May 21, 4:57 PM · Content-Transform-Team (Work In Progress), MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), FlaggedRevs
cscott moved T380530: Add Parsoid-compatible <link> tag to legacy parser output for redirects from To Verify to Code Review on the Content-Transform-Team (Work In Progress) board.
Wed, May 21, 4:55 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), Essential-Work, Content-Transform-Team (Work In Progress), Patch-For-Review, Accessibility, MediaWiki-Redirects
cscott added a comment to T380530: Add Parsoid-compatible <link> tag to legacy parser output for redirects.

Added a patch to add 'href'.

Wed, May 21, 4:55 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), Essential-Work, Content-Transform-Team (Work In Progress), Patch-For-Review, Accessibility, MediaWiki-Redirects
cscott added a comment to T380530: Add Parsoid-compatible <link> tag to legacy parser output for redirects.

Currently
https://en.wikipedia.org/w/index.php?title=H2O&redirect=no&useparsoid=1 contains <link rel="mw:PageProp/redirect" href="./Water" id="mwAg"> and
https://en.wikipedia.org/w/index.php?title=H2O&redirect=no&useparsoid=0 contains <link rel="mw:PageProp/redirect"> I'm not sure why the href is missing?

Wed, May 21, 4:44 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), Essential-Work, Content-Transform-Team (Work In Progress), Patch-For-Review, Accessibility, MediaWiki-Redirects
cscott closed T392436: Parsoid-only parser functions aren't specifically listed on Special:Version as Resolved.

https://www.wikifunctions.org/wiki/Special:Version lists {{#function}} as a Parsoid-only module, seems to work.

Wed, May 21, 4:41 PM · Essential-Work, MediaWiki-Special-pages, MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), Content-Transform-Team (Work In Progress), Parsoid, Abstract Wikipedia team

Tue, May 20

cscott created T394836: Use refactored grammar for `{{....}}` constructs.
Tue, May 20, 7:14 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott renamed T393391: Refactor PEG grammar for transclusions from Refactor PEG grammar for transclusion arguments to Refactor PEG grammar for transclusions.
Tue, May 20, 7:13 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott renamed T394834: Use refactored PEG grammar for `{{#....}}` constructs from Use refactored PEG grammer for `{{#....}}` constructs to Use refactored PEG grammar for `{{#....}}` constructs.
Tue, May 20, 7:12 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
Jdforrester-WMF awarded T394834: Use refactored PEG grammar for `{{#....}}` constructs a Like token.
Tue, May 20, 7:10 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott created T394834: Use refactored PEG grammar for `{{#....}}` constructs.
Tue, May 20, 7:09 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Mon, May 19

cscott added a comment to T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.

The goal here is to move to the more standards-compliant PHP8.4 implementations as soon as WMF production is ready for them. We intend to switch to the PHP8.4 DOM classes in CI right away (https://gerrit.wikimedia.org/r/c/mediawiki/services/parsoid/+/704745), so we don't regress.

Mon, May 19, 8:41 PM · Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid, RemexHtml
cscott assigned T391655: Proposal: allow clients to control caching behavior of HTML endpoints using a cache-control header to daniel.
Mon, May 19, 3:14 PM · Content-Transform-Team, Essential-Work, Patch-For-Review, MW-Interfaces-Team, RESTBase-API, MediaWiki-Parser
cscott added a comment to T391655: Proposal: allow clients to control caching behavior of HTML endpoints using a cache-control header.

I wonder about the security (DoS) implications of allowing clients to control cacheability. I think 'no-store' is the potentially problematic one, given that you are interpreting 'no-cache' as "no stale" not actually "no cache". But "no-store" could be used to DoS us, and I feel like we should put some guardrails around its use.

Mon, May 19, 3:14 PM · Content-Transform-Team, Essential-Work, Patch-For-Review, MW-Interfaces-Team, RESTBase-API, MediaWiki-Parser
cscott moved T394059: Post-cache output transforms are expensive on large pages from To Deploy to Code Review on the Content-Transform-Team (Work In Progress) board.
Mon, May 19, 3:08 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Patch-For-Review, Content-Transform-Team (Work In Progress), MediaWiki-Platform-Team (Radar), Wikimedia-Performance-recommendation, Performance Issue, MediaWiki-Parser
cscott assigned T394270: LogicException: Title not found! to ssastry.
Mon, May 19, 3:07 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
cscott assigned T376943: Linter bogus file option not detected when the File: invocation is wrapped in indicator tags to Arlolra.
Mon, May 19, 3:06 PM · MW-1.45-notes (1.45.0-wmf.3; 2025-05-27), Patch-For-Review, Content-Transform-Team (Work In Progress), Essential-Work, Parsoid
CDanis awarded T387130: CVE-2025-32699: Potential javascript injection attack enabled by Unicode normalization in Action API a Pirate Logo token.
Mon, May 19, 2:35 PM · MW-Interfaces-Team, Essential-Work, Content-Transform-Team (Work In Progress), SecTeam-Processed, Vuln-Inject, Vuln-XSS, MediaWiki-Action-API, Security, Security-Team
kamila awarded T387130: CVE-2025-32699: Potential javascript injection attack enabled by Unicode normalization in Action API a Pirate Logo token.
Mon, May 19, 11:31 AM · MW-Interfaces-Team, Essential-Work, Content-Transform-Team (Work In Progress), SecTeam-Processed, Vuln-Inject, Vuln-XSS, MediaWiki-Action-API, Security, Security-Team

Fri, May 16

andrea.denisse awarded T387130: CVE-2025-32699: Potential javascript injection attack enabled by Unicode normalization in Action API a Burninate token.
Fri, May 16, 7:00 PM · MW-Interfaces-Team, Essential-Work, Content-Transform-Team (Work In Progress), SecTeam-Processed, Vuln-Inject, Vuln-XSS, MediaWiki-Action-API, Security, Security-Team

Thu, May 15

cscott moved T373252: Ensure Parsoid can issue batch precompute requests for external services like wikifunctions from Backlog to Later on the Content-Transform-Team board.
Thu, May 15, 2:39 PM · Content-Transform-Team, Abstract Wikipedia team, OKR-Work, Wikifunctions, Parsoid
cscott moved T348722: Indicators in templates (PTag - phab page indicator; Geo (wikivoyage maps)) broken with Parsoid (mediawiki, metawiki, enwikivoyage) from Backlog to Later on the Content-Transform-Team board.
Thu, May 15, 2:39 PM · Parsoid-Read-Views (Small Size Wikipedias), Content-Transform-Team, Parsoid
cscott moved T366986: MW 1.41 parser locked error when using languages with grammatical gender from Backlog to Later on the Content-Transform-Team board.
Thu, May 15, 2:38 PM · Content-Transform-Team, MW-1.44-release, MW-1.43-release, MediaWiki-Parser, MW-1.41-release
cscott added a comment to T366986: MW 1.41 parser locked error when using languages with grammatical gender.

Tag #ctt-attention on gerrit if you upload a patch for this.

Thu, May 15, 2:38 PM · Content-Transform-Team, MW-1.44-release, MW-1.43-release, MediaWiki-Parser, MW-1.41-release
cscott claimed T372387: Wrong section numbering if Parsoid is used and wikitext is invalid.
Thu, May 15, 2:36 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid
cscott edited projects for T372387: Wrong section numbering if Parsoid is used and wikitext is invalid, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 15, 2:35 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid
cscott moved T376943: Linter bogus file option not detected when the File: invocation is wrapped in indicator tags from Backlog to To Verify on the Content-Transform-Team (Work In Progress) board.
Thu, May 15, 2:34 PM · MW-1.45-notes (1.45.0-wmf.3; 2025-05-27), Patch-For-Review, Content-Transform-Team (Work In Progress), Essential-Work, Parsoid
cscott edited projects for T376943: Linter bogus file option not detected when the File: invocation is wrapped in indicator tags, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 15, 2:34 PM · MW-1.45-notes (1.45.0-wmf.3; 2025-05-27), Patch-For-Review, Content-Transform-Team (Work In Progress), Essential-Work, Parsoid
cscott closed T393904: Bump memory of testreduce1002 as Resolved.
Thu, May 15, 2:33 PM · Content-Transform-Team, serviceops
cscott moved T394059: Post-cache output transforms are expensive on large pages from Backlog to To Deploy on the Content-Transform-Team (Work In Progress) board.
Thu, May 15, 2:33 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Patch-For-Review, Content-Transform-Team (Work In Progress), MediaWiki-Platform-Team (Radar), Wikimedia-Performance-recommendation, Performance Issue, MediaWiki-Parser
cscott edited projects for T394059: Post-cache output transforms are expensive on large pages, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 15, 2:33 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Patch-For-Review, Content-Transform-Team (Work In Progress), MediaWiki-Platform-Team (Radar), Wikimedia-Performance-recommendation, Performance Issue, MediaWiki-Parser
cscott moved T394270: LogicException: Title not found! from Backlog to To Verify on the Content-Transform-Team (Work In Progress) board.
Thu, May 15, 2:32 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
cscott edited projects for T394270: LogicException: Title not found!, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 15, 2:32 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Parsoid, Wikimedia-production-error
cscott edited projects for T394296: parsoid pcache keys get are too big, added: Content-Transform-Team (Work In Progress); removed Content-Transform-Team.
Thu, May 15, 2:31 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott added a project to T394296: parsoid pcache keys get are too big: Parsoid-Read-Views (Performance and Cache research Q4 FY24-25).
Thu, May 15, 2:30 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid

Wed, May 14

cscott added a comment to T394053: PHP Warning: Invalid label key: 'same-wt'.

I am recommending a rollback of https://gerrit.wikimedia.org/r/c/mediawiki/core/+/1133094 rather than a new release of Parsoid to address this for this week's train, as the strict label checking will cause a stats dropout of other affected code in addition to causing logspam.

Wed, May 14, 1:55 PM · MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), Content-Transform-Team, Parsoid, Wikimedia-production-error

Tue, May 13

cscott added a comment to T349310: DOMException: Invalid Character Error.

It turns out this is an upstream spec bug: see https://github.com/whatwg/dom/issues/849 and https://github.com/whatwg/dom/issues/769.

Tue, May 13, 10:40 PM · MW-1.42-notes (1.42.0-wmf.23; 2024-03-19), Content-Transform-Team-WIP, Patch-For-Review, Essential-Work, User-brennen, Parsoid, Wikimedia-production-error
cscott added a comment to T393983: `Error: Class "Wikimedia\Parsoid\Wt2Html\XMLSerializer" not found` in PHPUnit tests.

There is a class_alias, but Composer doesn't support generating autoload data for class aliases, so the class is not loaded when the alias is used.

Tue, May 13, 1:27 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), DiscussionTools, ci-test-error (WMF-deployed Build Failure), Parsoid

Mon, May 12

cscott updated the task description for T393916: CTT tasks week of 2025-05-09.
Mon, May 12, 10:35 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Essential-Work
cscott updated the task description for T393916: CTT tasks week of 2025-05-09.
Mon, May 12, 10:30 PM · MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Content-Transform-Team (Work In Progress), Essential-Work
cscott closed T236205: Clean up Parsoid metrics as Resolved.

Closing this task as we renamed a bunch of these with the prometheus migration.

Mon, May 12, 9:36 PM · Parsoid
cscott closed T94219: Add Additional Parsoid Performance Metrics as Resolved.

We've added a lot of new performance metrics as part of the integration with core, inheriting most of core's limit report metrics, new metrics from the parser cache, etc. Closing.

Mon, May 12, 9:35 PM · Parsoid
cscott added a parent task for T122390: Is RDFa metadata in Parsoid HTML head actually useful to you / no user name & edit comment suppression in Parsoid <head> metadata: T393925: Parsoid should generate the <head> on the core side, from the ParserOutput metadata.
Mon, May 12, 4:05 PM · Platform Team Legacy (Watching / External), Product-Infrastructure-Team-Backlog-Deprecated, Services (watching), Mobile-Content-Service, RESTBase, VisualEditor, Parsoid
cscott added a parent task for T128525: Strip old metadata from old Parsoid content <head>: mw:TimeUuid, user, comment: T393925: Parsoid should generate the <head> on the core side, from the ParserOutput metadata.
Mon, May 12, 4:05 PM · Platform Team Legacy (Later), User-Clarakosi, Services (next), RESTBase
cscott added a parent task for T178037: Parsoid uses non-canonical URL encoding in <link> in <head>: T393925: Parsoid should generate the <head> on the core side, from the ParserOutput metadata.
Mon, May 12, 4:05 PM · Platform Team Legacy (Later), Parsoid, Services (later), RESTBase
cscott added subtasks for T393925: Parsoid should generate the <head> on the core side, from the ParserOutput metadata: T128525: Strip old metadata from old Parsoid content <head>: mw:TimeUuid, user, comment, T122390: Is RDFa metadata in Parsoid HTML head actually useful to you / no user name & edit comment suppression in Parsoid <head> metadata, T178037: Parsoid uses non-canonical URL encoding in <link> in <head>.
Mon, May 12, 4:05 PM · MW-1.43-notes (1.43.0-wmf.23; 2024-09-17), Parsoid-Read-Views (Phase 4 - Parsoid generates metadata needed by core), Parsoid
cscott created T393925: Parsoid should generate the <head> on the core side, from the ParserOutput metadata.
Mon, May 12, 4:04 PM · MW-1.43-notes (1.43.0-wmf.23; 2024-09-17), Parsoid-Read-Views (Phase 4 - Parsoid generates metadata needed by core), Parsoid
cscott claimed T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.
Mon, May 12, 3:50 PM · Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid, RemexHtml
cscott created T393922: Make Parsoid/Zest/Remex safe for use with PHP8.4 Dom\Document.
Mon, May 12, 3:48 PM · Patch-For-Review, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid, RemexHtml
cscott moved T373472: Make ParserMigration indicator optional from Backlog to Q4 FY24-25 on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:22 PM · Content-Transform-Team (Work In Progress), MW-1.44-notes (1.44.0-wmf.6; 2024-12-03), OKR-Work, MW-1.43-notes (1.43.0-wmf.6; 2024-05-21), MediaWiki-extensions-ParserMigration
cscott moved T365371: ParserMigration: Add "report visual bug" link from Backlog to Q4 FY24-25 on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:21 PM · Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Small Size Wikipedias), OKR-Work, MediaWiki-extensions-ParserMigration
cscott moved T393391: Refactor PEG grammar for transclusions from Backlog to In Progress on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:21 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott moved T389623: Parsoid fails to parse [[{}]] single curly brackets inside double rectangular brackets from Backlog to In Progress on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:21 PM · OKR-Work, Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress), Parsoid
cscott claimed T389623: Parsoid fails to parse [[{}]] single curly brackets inside double rectangular brackets.
Mon, May 12, 3:21 PM · OKR-Work, Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress), Parsoid
cscott added a comment to T363484: Update ParserMigration notice.

That design seems feasible. Just a note that in addition to the "report visual bug" UX we're also looking for a replacement design for the "parsoid icon" indicator to indicate that parsoid is in use on the page -- the replacement could be "no replacement", a message in the footer, a revised indicator, etc -- we should just document what the proposed replacement is.

Mon, May 12, 3:15 PM · Parsoid-Read-Views (Small Size Wikipedias), Content-Transform-Team (Work In Progress), OKR-Work, MediaWiki-extensions-ParserMigration
cscott claimed T384151: DiscussionTools gives error for the second message being written.
Mon, May 12, 3:10 PM · OKR-Work, Patch-For-Review, MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid, Content-Transform-Team (Work In Progress), DiscussionTools
cscott moved T346196: Wikimedia\Assert\InvariantException: Invariant failed: Expected valid DSR from Code Review to To Verify on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:09 PM · Content-Transform-Team (Work In Progress), OKR-Work, Patch-For-Review, Parsoid, Wikimedia-production-error
cscott moved T391109: Consider parsing transcluded special pages as raw HTML from Code Review to In Progress on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:07 PM · Essential-Work, Patch-For-Review, Content-Transform-Team (Work In Progress), MediaWiki-Parser, MediaWiki-Special-pages
cscott moved T380530: Add Parsoid-compatible <link> tag to legacy parser output for redirects from To Deploy to To Verify on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:06 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), Essential-Work, Content-Transform-Team (Work In Progress), Patch-For-Review, Accessibility, MediaWiki-Redirects
cscott moved T392118: Add metrics to track async content (Wikifunctions) SLO from To Deploy to To Verify on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:06 PM · MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), OKR-Work, Abstract Wikipedia team (25Q4 (Apr–Jun)), Content-Transform-Team (Work In Progress)
cscott moved T392133: Async content needs !misermode from To Deploy to To Verify on the Content-Transform-Team (Work In Progress) board.
Mon, May 12, 3:05 PM · Essential-Work, MediaWiki-Parser, Abstract Wikipedia team, Content-Transform-Team (Work In Progress)
cscott added a comment to T219543: UX review of Special:SpecialPages.

Without "category" being sortable, you can't recreate the old organization of the page (grouped by category), which seems like a regression in functionality.

Mon, May 12, 2:01 AM · MW-1.45-notes (1.45.0-wmf.2; 2025-05-20), User-notice, Wikimedia-Hackathon-2025, Design, Wikimedia-Design, MW-1.41-notes (1.41.0-wmf.22; 2023-08-15), MediaWiki-Special-pages

Fri, May 9

cscott created T393795: Hash extensions for extension tags.
Fri, May 9, 3:28 PM · MediaWiki-Parser-Templates

Thu, May 8

cscott added a comment to T393726: Cache WikiLink processing in WikiLinkHandler.

Note that the work I'm doing for T393391: Refactor PEG grammar for transclusions also can result in "parsed less" [[...]] and [...] tokens. So maybe it makes sense to wait to tackle this one until I've landed that task, so we're not creating conflicts?

Thu, May 8, 8:41 PM · Patch-For-Review, Content-Transform-Team (Work In Progress), Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Parsoid
cscott added a comment to T325322: Performance implications of using dynamic properties in NodeData in newer versions of PHP.

I did a quick test in the above patch of removing dynamic properties in DataMw and replacing them with an associative array and __get/__set magic methods.

Thu, May 8, 3:47 PM · Patch-For-Review, OKR-Work, Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), Content-Transform-Team (Work In Progress), Parsoid
cscott closed T390799: Create Parsoid 0.21.0 for MW 1.44.0, a subtask of T390686: Release MediaWiki 1.44.0, as Resolved.
Thu, May 8, 2:49 PM · MW-1.44-release, MW-1.44-notes, Release, MediaWiki-Releasing
cscott closed T390799: Create Parsoid 0.21.0 for MW 1.44.0 as Resolved.
Thu, May 8, 2:49 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Essential-Work, Content-Transform-Team (Work In Progress), MW-1.44-release
cscott moved T353697: Parsoid/legacy parser {{Pre}} template rendering difference from Q4 FY24-25 to Backlog on the Content-Transform-Team (Work In Progress) board.
Thu, May 8, 2:48 PM · Content-Transform-Team (Work In Progress), OKR-Work, Patch-For-Review, Parsoid-Read-Views (Phase 1 - DiscussionTools support), Parsoid
cscott claimed T389687: Difference in TOC metadata with some broken markup.
Thu, May 8, 2:47 PM · Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress), Parsoid
cscott moved T389687: Difference in TOC metadata with some broken markup from Q4 FY24-25 to In Progress on the Content-Transform-Team (Work In Progress) board.
Thu, May 8, 2:47 PM · Parsoid-Read-Views (Wiktionary Q3 FY2024-25), Content-Transform-Team (Work In Progress), Parsoid
cscott moved T363484: Update ParserMigration notice from Q4 FY24-25 to Blocked on the Content-Transform-Team (Work In Progress) board.
Thu, May 8, 2:46 PM · Parsoid-Read-Views (Small Size Wikipedias), Content-Transform-Team (Work In Progress), OKR-Work, MediaWiki-extensions-ParserMigration
cscott moved T390799: Create Parsoid 0.21.0 for MW 1.44.0 from Q4 FY24-25 to In Progress on the Content-Transform-Team (Work In Progress) board.
Thu, May 8, 2:45 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Essential-Work, Content-Transform-Team (Work In Progress), MW-1.44-release
cscott merged T330036: Linter updates in a Parsoid-in-ParserCache world into T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:33 PM · MediaWiki-extensions-Linter
cscott merged task T330036: Linter updates in a Parsoid-in-ParserCache world into T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:33 PM · MediaWiki-Platform-Team (Radar), RESTBase Sunsetting, MediaWiki-extensions-Linter
cscott added a comment to T330036: Linter updates in a Parsoid-in-ParserCache world.

At the last engineering offsite, we decided that putting lints into ParserOutput was the future direction here. I'm going to close this as a dupe of T393717: Put lints in ParserOutput/RefreshLinksJob to reflect this consensus.

Thu, May 8, 2:32 PM · MediaWiki-Platform-Team (Radar), RESTBase Sunsetting, MediaWiki-extensions-Linter
cscott merged T370624: Reduce concurrency of RecordLintJobs or shard it per section into T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:32 PM · MediaWiki-extensions-Linter
cscott merged task T370624: Reduce concurrency of RecordLintJobs or shard it per section into T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:32 PM · Content-Transform-Team, RESTBase Sunsetting, MediaWiki-extensions-Linter
cscott added a comment to T370624: Reduce concurrency of RecordLintJobs or shard it per section.

We decided at the last engineering offsite that Lints are going to be put into ParserOutput, and that we'll move the DB maintenance to RefreshLinksJob, although not until RLJ is powered by Parsoid (T393716). I'm going to close this as a duplicate of T393717: Put lints in ParserOutput/RefreshLinksJob to reflect this consensus.

Thu, May 8, 2:31 PM · Content-Transform-Team, RESTBase Sunsetting, MediaWiki-extensions-Linter
cscott added a subtask for T393717: Put lints in ParserOutput/RefreshLinksJob: T393716: RefreshLinksJob should use Parsoid-generated metadata.
Thu, May 8, 2:27 PM · MediaWiki-extensions-Linter
cscott added a parent task for T393716: RefreshLinksJob should use Parsoid-generated metadata: T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:27 PM · Content-Transform-Team
cscott created T393717: Put lints in ParserOutput/RefreshLinksJob.
Thu, May 8, 2:26 PM · MediaWiki-extensions-Linter
cscott added a parent task for T310511: Metadata comparison testing between Parsoid and the legacy parser: T393716: RefreshLinksJob should use Parsoid-generated metadata.
Thu, May 8, 2:22 PM · Parsoid-Read-Views (Phase 4 - Parsoid generates metadata needed by core), Parsoid
cscott added a parent task for T310512: Parsoid and the legacy parser should emit exactly the same ParserOutput metadata: T393716: RefreshLinksJob should use Parsoid-generated metadata.
Thu, May 8, 2:22 PM · MW-1.43-notes (1.43.0-wmf.23; 2024-09-17), Parsoid-Read-Views (Phase 4 - Parsoid generates metadata needed by core), Parsoid
cscott added subtasks for T393716: RefreshLinksJob should use Parsoid-generated metadata: T310512: Parsoid and the legacy parser should emit exactly the same ParserOutput metadata, T310511: Metadata comparison testing between Parsoid and the legacy parser.
Thu, May 8, 2:22 PM · Content-Transform-Team
cscott created T393716: RefreshLinksJob should use Parsoid-generated metadata.
Thu, May 8, 2:22 PM · Content-Transform-Team

Wed, May 7

cscott claimed T390799: Create Parsoid 0.21.0 for MW 1.44.0.
Wed, May 7, 4:57 PM · MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Essential-Work, Content-Transform-Team (Work In Progress), MW-1.44-release

Mon, May 5

cscott added a comment to T235295: MathML tags are missing xmlns attribute.

And just for the record, my response to this:

Mon, May 5, 10:47 PM · Parsoid-PHP
cscott added a comment to T363484: Update ParserMigration notice.

Design is going to synthesize a concrete recommendation here by end of May and CTT will implement it in June.

Mon, May 5, 5:52 PM · Parsoid-Read-Views (Small Size Wikipedias), Content-Transform-Team (Work In Progress), OKR-Work, MediaWiki-extensions-ParserMigration
cscott added a comment to T392775: Add link color for temporary usernames.

I think the work done in https://gerrit.wikimedia.org/r/c/mediawiki/core/+/1139193 for this task may already be covered by the patch https://gerrit.wikimedia.org/r/c/mediawiki/core/+/1138396 we are working on for T389474: CheckUser: Special:GlobalContributions should highlight temporary accounts, since we are moving the logic to determine the CSS classes to apply form CheckUser's GlobalContributions (https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CheckUser/+/1137000) to core.

Mon, May 5, 5:02 PM · Essential-Work, Patch-For-Review, MediaWiki-General, Content-Transform-Team (Work In Progress), Temporary accounts (Major pilot wiki deployment)
cscott closed T329457: IP Masking Considerations: services/parsoid, a subtask of T326874: Update Content Transform Team-owned products that may be affected by IP Masking, as Resolved.
Mon, May 5, 4:59 PM · Content-Transform-Team (Work In Progress), Trust and Safety Product Team, Essential-Work, Temporary accounts
cscott closed T329457: IP Masking Considerations: services/parsoid as Resolved.
Mon, May 5, 4:59 PM · Content-Transform-Team (Work In Progress), Trust and Safety Product Team, Essential-Work, Temporary accounts
cscott added a comment to T348254: Add ParserOutput::getHtmlHolder().

Yes, but the whole point of ContentHolder was to avoid having to boil the ocean in a single day. We have existing passes out there, and existing users of text-based passes via the extension postprocess hook (including DiscussionTools, and ParserOutput serialization/deserialization). By creating an abstraction we can work toward the goal of moving everything to DOM without having to do it all at once. And, as @ihurbain notes, the existing pipelines we have to work with are very sensitive to stage order (unfortunately!) so "just" moving all the DOM-based passes to the front/back isn't an easy solution.

Mon, May 5, 4:47 PM · Parsoid-Read-Views (Performance and Cache research Q4 FY24-25), MediaWiki-Parser, Parsoid