Hello, Jensen Head
There is a distinct separation between "accessibility tagging" with its application of the paragraph (and similar) "tags", and the "paragraph formatting" options, most of which are now present in the properties pane while editing text. I apologize for not explaining this clearly enough in my previous posts.
Jensen Head wrote: ↑Mon Jun 02, 2025 8:42 am
After the explanations in the
#44978, I realized that paragraphs are not the essence of the PDF format, they are not there, they are a convention introduced by the editor itself for the convenience of the user.
First, I should clarify, the "paragraph" tags in question from the more recent post, are "accessibility tags". They are not there for the convenience of the user, but to
enable functions through which third party software can properly interpret the document, in a pre-defined order. No aspect of these tags is intended as something directly visible for the reader of the file, nor do they have any direct relation to the text formatting options which would benefit someone editing the file (in most cases, their use makes later editing more cumbersome).
- Accessibility tags need to be drawn manually at this time. No it is not possible to simply "select all" text objects, and add the paragraph tag, then get a nicely separated set of tags. The reason accessibility needs to be done fully manually is because a computer is not great at determining reading order, and what information is most important, or which items are "cliffnotes" of otherwise unimportant side-notes. Eventually this may become a possibility, but for now, a human touch is needed.
Accessibility tagging is supposed to to be the final step performed after a document is completed, and no further editing will happen. It also only serves to help accessibility software (such as vision aiding tools), to operate in the intended consumption order. If you will not be sharing the document publicly, or with someone who needs these accessibility functions, there is no purpose to such tagging, and it will not make editing your file any easier. In-fact, if you may need to later make any level of extended changes to the file, after adding these, you will likely need to perform tedious maintenance to ensure the new content is correctly tagged and ordered, so it is very important to understand when it is time to use this function, and when it is not.
Before adding tags, you should ask yourself, Do you actually have any need to add these items, and do you need to do so right now? Will it benefit anyone else reading your docuemnts at this point in time? The tools are not easy to use, and while some improvements likely will be made in time, it is an ongoing process, and not a quick one. In the meantime, use of these functions is quite cumbersome, due to the complexity and strict accessibility requirements.
Relating to the later question the "paragraph options" menu.
- This is accessible when text is selected by pushing Ctrl_H (will only affect the current line, unless you actively select the text while in text editing mode). Or you can get to the same area via the right click context menu. However, this menu has no relation to the accessibility options which you are investigating here today.
image.png
As I mentioned in other posts, the concept of a paragraph does not actually exist in PDF. These settings are presented as a tool to reposition and reflow the selected text objects relevant to one another, it is "emulating" the existence of a paragraph structure, hence the name of the tool, but it is not a real paragraph object, since they simply do not exist. To see text in its actual native format in PDF, you need to use the "Edit Objects > text" tool, instead of the "Edit Text" tool (Since edit text enables text object grouping in an effort to make the unwieldy PDF format, more usable in a human capacity.
Moving on to your processes today:
Jensen Head wrote: ↑Mon Jun 02, 2025 8:42 am
If I save this document as .txt, the visually existing eight paragraphs are preserved, but six paragraphs are added to them that are not in the text (that is, additional paragraph breaks are inserted).
As before, the concept of tags only serves as an aid for third party accessibility tools, it does not, and cannot have an impact on the export functions. When you export to a txt file, the "paragraph" formatting follows the same rules as the "edit text" tool, with some special "post-processing" that is handled entirely by these options:
image(1).png
Once again, the concept of a paragraph is emulated, and assumed based on the proximity to other text items. If 4 "lines" of text are all positions in a similar pattern, within a few "points" of one another, they will be assumed and emulated as a congruent object for display purposes.
Similarly, there generally are not actual return characters present in a document (Though it is possible to insert them manually), and so the "insert line breaks" option does this automatically, based on the whether or not the following text item, according to the content order, is inline horizontally.
Kind regards,
You do not have the required permissions to view the files attached to this post.