Where can I find how to tex symbols for different fonts?Loading fonts in LuaTeX (plain TeX)How to install or replace fonts in old TeX files?Math symbols in plain TeXHow can I use times fonts in Plain TeX?Can I have a list of all fonts available in plain TeX?Using urw-garamond fonts in plain TeXHow is obeylines different from obeyspaces?Where can I find the plain TeX source file on my PC?LaTeX for plain TeX users?Where can I read about the TeX commands not LaTeX commands // Math commands
Expressing 'our' for objects belonging to our apartment
Why is the relative clause in the following sentence not directly after the noun and why is the verb not in the end of the sentence?
How should I tell my manager I'm not paying for an optional after work event I'm not going to?
How long would it take for people to notice a mass disappearance?
Would the Disguise Self spell be able to reveal hidden birthmarks/tattoos (of the person they're disguised as) to a character?
How can I support myself financially as a 17 year old with a loan?
Upside-Down Pyramid Addition...REVERSED!
Would Hubble Space Telescope improve black hole image observed by EHT if it joined array of telesopes?
How to change lightning-radio-group selected value programmatically?
Why are prions in animal diets not destroyed by the digestive system?
Point of the the Dothraki's attack in GoT S8E3?
Is latino sine flexione dead?
What matters more when it comes to book covers? Is it ‘professional quality’ or relevancy?
What was the design of the Macintosh II's MMU replacement?
Getting a W on your transcript for grad school applications
On which topic did Indiana Jones write his doctoral thesis?
How might a mountain bowl form?
Where can I go to avoid planes overhead?
Using a microphone from the 1930s
What are the differences between credential stuffing and password spraying?
I have a unique character that I'm having a problem writing. He's a virus!
Can hackers enable the camera after the user disabled it?
Why is Arya visibly scared in the library in S8E3?
Can a nothic's Weird Insight action discover secrets about a player character that the character doesn't know about themselves?
Where can I find how to tex symbols for different fonts?
Loading fonts in LuaTeX (plain TeX)How to install or replace fonts in old TeX files?Math symbols in plain TeXHow can I use times fonts in Plain TeX?Can I have a list of all fonts available in plain TeX?Using urw-garamond fonts in plain TeXHow is obeylines different from obeyspaces?Where can I find the plain TeX source file on my PC?LaTeX for plain TeX users?Where can I read about the TeX commands not LaTeX commands // Math commands
I'm aware of fonts such as cmr10, cmex10 and cmsy10. Right now I would like to know how to use TeX to produce symbols in the character tables such as http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
Is there a dictionary somewhere on the internet containing how to TeX every symbol in character tables given the font? (i.e. Given 'x41' and the font cmr10 I want to obtain 'A', given 'x00' and cmr10 I want to obtain 'textGamma').
P.S. This question is motivated by my attempts to extract text from .tex files. I end up deciding to first convert .tex files to DVI files and then use dviasm
to extract the text because it bypasses the need to essentially build another TeX engine.
plain-tex
add a comment |
I'm aware of fonts such as cmr10, cmex10 and cmsy10. Right now I would like to know how to use TeX to produce symbols in the character tables such as http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
Is there a dictionary somewhere on the internet containing how to TeX every symbol in character tables given the font? (i.e. Given 'x41' and the font cmr10 I want to obtain 'A', given 'x00' and cmr10 I want to obtain 'textGamma').
P.S. This question is motivated by my attempts to extract text from .tex files. I end up deciding to first convert .tex files to DVI files and then use dviasm
to extract the text because it bypasses the need to essentially build another TeX engine.
plain-tex
1
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
1
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such asmathcalA
as well.
– Ying Zhou
Apr 24 at 5:16
add a comment |
I'm aware of fonts such as cmr10, cmex10 and cmsy10. Right now I would like to know how to use TeX to produce symbols in the character tables such as http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
Is there a dictionary somewhere on the internet containing how to TeX every symbol in character tables given the font? (i.e. Given 'x41' and the font cmr10 I want to obtain 'A', given 'x00' and cmr10 I want to obtain 'textGamma').
P.S. This question is motivated by my attempts to extract text from .tex files. I end up deciding to first convert .tex files to DVI files and then use dviasm
to extract the text because it bypasses the need to essentially build another TeX engine.
plain-tex
I'm aware of fonts such as cmr10, cmex10 and cmsy10. Right now I would like to know how to use TeX to produce symbols in the character tables such as http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
Is there a dictionary somewhere on the internet containing how to TeX every symbol in character tables given the font? (i.e. Given 'x41' and the font cmr10 I want to obtain 'A', given 'x00' and cmr10 I want to obtain 'textGamma').
P.S. This question is motivated by my attempts to extract text from .tex files. I end up deciding to first convert .tex files to DVI files and then use dviasm
to extract the text because it bypasses the need to essentially build another TeX engine.
plain-tex
plain-tex
edited Apr 23 at 23:36
Ying Zhou
asked Apr 23 at 23:30
Ying ZhouYing Zhou
1109
1109
1
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
1
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such asmathcalA
as well.
– Ying Zhou
Apr 24 at 5:16
add a comment |
1
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
1
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such asmathcalA
as well.
– Ying Zhou
Apr 24 at 5:16
1
1
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
1
1
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such as
mathcalA
as well.– Ying Zhou
Apr 24 at 5:16
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such as
mathcalA
as well.– Ying Zhou
Apr 24 at 5:16
add a comment |
4 Answers
4
active
oldest
votes
An adjunct to the "Comprehensive LaTeX symbols list" is the "rawtables
" pdf file that contains font tables for all the fonts covered by that list, arranged in alphabetical order. The font table arrangement shows the location in the font presented to TeX; it does not identify the glyphs by Unicode ID.
The collection is on CTAN: http://mirrors.ctan.org/info/symbols/comprehensive
and the pdf listing comes in either lettersize or a4.
Despite the "LaTeX" in the title, these fonts can be used also with plain TeX.
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
add a comment |
While the user specifies TeX (for Plain TeX, see SUPPLEMENT), these tables are most easily obtainable via LaTeX, in the format described by the OP at http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
documentclassarticle
usepackagefonttable
begindocument
fonttablecmex10
enddocument
The same font table may be alternately obtained via xfonttableOMXcmexmn
.
To answer the OP's specific question about the letter A
in cmr10,
documentclassarticle
usepackagefonttable
begindocument
fonttablecmr10
enddocument
Just remember though, that for a given encoding scheme, one knows where to find various glyphs, even without printing the font table, especially for standard glyphs such as those available in ASCII.
SUPPLEMENT
For the Plain TeX alternative (fontchart.tex
, found at https://ctan.org/pkg/fontchart?lang=en), here is the result for cmr10:
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. TheTeX
code that can generate them also need to be identified automatically if possible.
– Ying Zhou
Apr 24 at 4:06
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at thecmr
table, I can see thatfontfamilycmrfontencodingOT1selectfontchar"03
will give me a capitallambda
. On the other hand, if visually presented with a$Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.
– Steven B. Segletes
Apr 24 at 9:55
add a comment |
LaTeX Font Encodings contains font tables for every legacy LaTeX encoding in common use. The modern toolchain with fontspec
simply uses the Unicode encoding (under the alias TU).
If you want to be able to copy-and-paste, or otherwise automatically convert, text from a PDF document compiled from LaTeX source, your best bet is to use unicode-math
. Then, all your glyphs are already encoded in Unicode.
A font using a non-standard encoding (such as U
) should come with documentation. For example, the masfonts
manual comes with tables of all its fonts in an appendix.
Really thanks for the book! I got itsTeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).
– Ying Zhou
Apr 24 at 4:08
add a comment |
You could look at the tex4ht files. There are lot of mappings of slot positions to unicode in the hf-fonts. E.g. texmf-disttex4htht-fontsunicodecmcmex.htf
contains mappings like this:
'∘' '' 112
'∘' '' 113
'∘' '' 114
'∘' '' 115
'∘' '' 116
'│' '' 117
'┌' '' 118
'║' '' 119
'↑' '' 120
'↓' '' 121
Mappings between slot positions and (la)tex commands are imho much more difficult to optain and maintain as every package/document can change or add commands.
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "85"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f486288%2fwhere-can-i-find-how-to-tex-symbols-for-different-fonts%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
An adjunct to the "Comprehensive LaTeX symbols list" is the "rawtables
" pdf file that contains font tables for all the fonts covered by that list, arranged in alphabetical order. The font table arrangement shows the location in the font presented to TeX; it does not identify the glyphs by Unicode ID.
The collection is on CTAN: http://mirrors.ctan.org/info/symbols/comprehensive
and the pdf listing comes in either lettersize or a4.
Despite the "LaTeX" in the title, these fonts can be used also with plain TeX.
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
add a comment |
An adjunct to the "Comprehensive LaTeX symbols list" is the "rawtables
" pdf file that contains font tables for all the fonts covered by that list, arranged in alphabetical order. The font table arrangement shows the location in the font presented to TeX; it does not identify the glyphs by Unicode ID.
The collection is on CTAN: http://mirrors.ctan.org/info/symbols/comprehensive
and the pdf listing comes in either lettersize or a4.
Despite the "LaTeX" in the title, these fonts can be used also with plain TeX.
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
add a comment |
An adjunct to the "Comprehensive LaTeX symbols list" is the "rawtables
" pdf file that contains font tables for all the fonts covered by that list, arranged in alphabetical order. The font table arrangement shows the location in the font presented to TeX; it does not identify the glyphs by Unicode ID.
The collection is on CTAN: http://mirrors.ctan.org/info/symbols/comprehensive
and the pdf listing comes in either lettersize or a4.
Despite the "LaTeX" in the title, these fonts can be used also with plain TeX.
An adjunct to the "Comprehensive LaTeX symbols list" is the "rawtables
" pdf file that contains font tables for all the fonts covered by that list, arranged in alphabetical order. The font table arrangement shows the location in the font presented to TeX; it does not identify the glyphs by Unicode ID.
The collection is on CTAN: http://mirrors.ctan.org/info/symbols/comprehensive
and the pdf listing comes in either lettersize or a4.
Despite the "LaTeX" in the title, these fonts can be used also with plain TeX.
answered Apr 24 at 0:23
barbara beetonbarbara beeton
70.5k9161383
70.5k9161383
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
add a comment |
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
Really thanks! I have read it! I still need to find an actual list of TeX code that can generate the respective glyphs. It seems that I need to manually do that. I will do that for at least the most popular fonts.
– Ying Zhou
Apr 24 at 4:12
add a comment |
While the user specifies TeX (for Plain TeX, see SUPPLEMENT), these tables are most easily obtainable via LaTeX, in the format described by the OP at http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
documentclassarticle
usepackagefonttable
begindocument
fonttablecmex10
enddocument
The same font table may be alternately obtained via xfonttableOMXcmexmn
.
To answer the OP's specific question about the letter A
in cmr10,
documentclassarticle
usepackagefonttable
begindocument
fonttablecmr10
enddocument
Just remember though, that for a given encoding scheme, one knows where to find various glyphs, even without printing the font table, especially for standard glyphs such as those available in ASCII.
SUPPLEMENT
For the Plain TeX alternative (fontchart.tex
, found at https://ctan.org/pkg/fontchart?lang=en), here is the result for cmr10:
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. TheTeX
code that can generate them also need to be identified automatically if possible.
– Ying Zhou
Apr 24 at 4:06
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at thecmr
table, I can see thatfontfamilycmrfontencodingOT1selectfontchar"03
will give me a capitallambda
. On the other hand, if visually presented with a$Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.
– Steven B. Segletes
Apr 24 at 9:55
add a comment |
While the user specifies TeX (for Plain TeX, see SUPPLEMENT), these tables are most easily obtainable via LaTeX, in the format described by the OP at http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
documentclassarticle
usepackagefonttable
begindocument
fonttablecmex10
enddocument
The same font table may be alternately obtained via xfonttableOMXcmexmn
.
To answer the OP's specific question about the letter A
in cmr10,
documentclassarticle
usepackagefonttable
begindocument
fonttablecmr10
enddocument
Just remember though, that for a given encoding scheme, one knows where to find various glyphs, even without printing the font table, especially for standard glyphs such as those available in ASCII.
SUPPLEMENT
For the Plain TeX alternative (fontchart.tex
, found at https://ctan.org/pkg/fontchart?lang=en), here is the result for cmr10:
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. TheTeX
code that can generate them also need to be identified automatically if possible.
– Ying Zhou
Apr 24 at 4:06
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at thecmr
table, I can see thatfontfamilycmrfontencodingOT1selectfontchar"03
will give me a capitallambda
. On the other hand, if visually presented with a$Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.
– Steven B. Segletes
Apr 24 at 9:55
add a comment |
While the user specifies TeX (for Plain TeX, see SUPPLEMENT), these tables are most easily obtainable via LaTeX, in the format described by the OP at http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
documentclassarticle
usepackagefonttable
begindocument
fonttablecmex10
enddocument
The same font table may be alternately obtained via xfonttableOMXcmexmn
.
To answer the OP's specific question about the letter A
in cmr10,
documentclassarticle
usepackagefonttable
begindocument
fonttablecmr10
enddocument
Just remember though, that for a given encoding scheme, one knows where to find various glyphs, even without printing the font table, especially for standard glyphs such as those available in ASCII.
SUPPLEMENT
For the Plain TeX alternative (fontchart.tex
, found at https://ctan.org/pkg/fontchart?lang=en), here is the result for cmr10:
While the user specifies TeX (for Plain TeX, see SUPPLEMENT), these tables are most easily obtainable via LaTeX, in the format described by the OP at http://www.math.union.edu/~dpvc/jsmath/symbols/cmex10.html
documentclassarticle
usepackagefonttable
begindocument
fonttablecmex10
enddocument
The same font table may be alternately obtained via xfonttableOMXcmexmn
.
To answer the OP's specific question about the letter A
in cmr10,
documentclassarticle
usepackagefonttable
begindocument
fonttablecmr10
enddocument
Just remember though, that for a given encoding scheme, one knows where to find various glyphs, even without printing the font table, especially for standard glyphs such as those available in ASCII.
SUPPLEMENT
For the Plain TeX alternative (fontchart.tex
, found at https://ctan.org/pkg/fontchart?lang=en), here is the result for cmr10:
edited Apr 24 at 2:23
answered Apr 24 at 0:26
Steven B. SegletesSteven B. Segletes
164k9209422
164k9209422
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. TheTeX
code that can generate them also need to be identified automatically if possible.
– Ying Zhou
Apr 24 at 4:06
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at thecmr
table, I can see thatfontfamilycmrfontencodingOT1selectfontchar"03
will give me a capitallambda
. On the other hand, if visually presented with a$Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.
– Steven B. Segletes
Apr 24 at 9:55
add a comment |
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. TheTeX
code that can generate them also need to be identified automatically if possible.
– Ying Zhou
Apr 24 at 4:06
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at thecmr
table, I can see thatfontfamilycmrfontencodingOT1selectfontchar"03
will give me a capitallambda
. On the other hand, if visually presented with a$Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.
– Steven B. Segletes
Apr 24 at 9:55
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. The
TeX
code that can generate them also need to be identified automatically if possible.– Ying Zhou
Apr 24 at 4:06
Really thanks for your detailed answer! However I need to identify more than just the glyphs even though they are also something I can only identify manually now. The
TeX
code that can generate them also need to be identified automatically if possible.– Ying Zhou
Apr 24 at 4:06
1
1
@YingZhou It depends on which direction of decode you need to go. For example, looking at the
cmr
table, I can see that fontfamilycmrfontencodingOT1selectfontchar"03
will give me a capital lambda
. On the other hand, if visually presented with a $Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.– Steven B. Segletes
Apr 24 at 9:55
@YingZhou It depends on which direction of decode you need to go. For example, looking at the
cmr
table, I can see that fontfamilycmrfontencodingOT1selectfontchar"03
will give me a capital lambda
. On the other hand, if visually presented with a $Lambda$
and then to figure out what font family it belongs to, which font encloding, and which glyph slot, that is a different problem.– Steven B. Segletes
Apr 24 at 9:55
add a comment |
LaTeX Font Encodings contains font tables for every legacy LaTeX encoding in common use. The modern toolchain with fontspec
simply uses the Unicode encoding (under the alias TU).
If you want to be able to copy-and-paste, or otherwise automatically convert, text from a PDF document compiled from LaTeX source, your best bet is to use unicode-math
. Then, all your glyphs are already encoded in Unicode.
A font using a non-standard encoding (such as U
) should come with documentation. For example, the masfonts
manual comes with tables of all its fonts in an appendix.
Really thanks for the book! I got itsTeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).
– Ying Zhou
Apr 24 at 4:08
add a comment |
LaTeX Font Encodings contains font tables for every legacy LaTeX encoding in common use. The modern toolchain with fontspec
simply uses the Unicode encoding (under the alias TU).
If you want to be able to copy-and-paste, or otherwise automatically convert, text from a PDF document compiled from LaTeX source, your best bet is to use unicode-math
. Then, all your glyphs are already encoded in Unicode.
A font using a non-standard encoding (such as U
) should come with documentation. For example, the masfonts
manual comes with tables of all its fonts in an appendix.
Really thanks for the book! I got itsTeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).
– Ying Zhou
Apr 24 at 4:08
add a comment |
LaTeX Font Encodings contains font tables for every legacy LaTeX encoding in common use. The modern toolchain with fontspec
simply uses the Unicode encoding (under the alias TU).
If you want to be able to copy-and-paste, or otherwise automatically convert, text from a PDF document compiled from LaTeX source, your best bet is to use unicode-math
. Then, all your glyphs are already encoded in Unicode.
A font using a non-standard encoding (such as U
) should come with documentation. For example, the masfonts
manual comes with tables of all its fonts in an appendix.
LaTeX Font Encodings contains font tables for every legacy LaTeX encoding in common use. The modern toolchain with fontspec
simply uses the Unicode encoding (under the alias TU).
If you want to be able to copy-and-paste, or otherwise automatically convert, text from a PDF document compiled from LaTeX source, your best bet is to use unicode-math
. Then, all your glyphs are already encoded in Unicode.
A font using a non-standard encoding (such as U
) should come with documentation. For example, the masfonts
manual comes with tables of all its fonts in an appendix.
answered Apr 23 at 23:39
DavislorDavislor
7,6791433
7,6791433
Really thanks for the book! I got itsTeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).
– Ying Zhou
Apr 24 at 4:08
add a comment |
Really thanks for the book! I got itsTeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).
– Ying Zhou
Apr 24 at 4:08
Really thanks for the book! I got its
TeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).– Ying Zhou
Apr 24 at 4:08
Really thanks for the book! I got its
TeX
code and am trying to figure out how to let them print the code (e.g. 'rightarrow') in addition to the glyphs (e.g. a right arrow).– Ying Zhou
Apr 24 at 4:08
add a comment |
You could look at the tex4ht files. There are lot of mappings of slot positions to unicode in the hf-fonts. E.g. texmf-disttex4htht-fontsunicodecmcmex.htf
contains mappings like this:
'∘' '' 112
'∘' '' 113
'∘' '' 114
'∘' '' 115
'∘' '' 116
'│' '' 117
'┌' '' 118
'║' '' 119
'↑' '' 120
'↓' '' 121
Mappings between slot positions and (la)tex commands are imho much more difficult to optain and maintain as every package/document can change or add commands.
add a comment |
You could look at the tex4ht files. There are lot of mappings of slot positions to unicode in the hf-fonts. E.g. texmf-disttex4htht-fontsunicodecmcmex.htf
contains mappings like this:
'∘' '' 112
'∘' '' 113
'∘' '' 114
'∘' '' 115
'∘' '' 116
'│' '' 117
'┌' '' 118
'║' '' 119
'↑' '' 120
'↓' '' 121
Mappings between slot positions and (la)tex commands are imho much more difficult to optain and maintain as every package/document can change or add commands.
add a comment |
You could look at the tex4ht files. There are lot of mappings of slot positions to unicode in the hf-fonts. E.g. texmf-disttex4htht-fontsunicodecmcmex.htf
contains mappings like this:
'∘' '' 112
'∘' '' 113
'∘' '' 114
'∘' '' 115
'∘' '' 116
'│' '' 117
'┌' '' 118
'║' '' 119
'↑' '' 120
'↓' '' 121
Mappings between slot positions and (la)tex commands are imho much more difficult to optain and maintain as every package/document can change or add commands.
You could look at the tex4ht files. There are lot of mappings of slot positions to unicode in the hf-fonts. E.g. texmf-disttex4htht-fontsunicodecmcmex.htf
contains mappings like this:
'∘' '' 112
'∘' '' 113
'∘' '' 114
'∘' '' 115
'∘' '' 116
'│' '' 117
'┌' '' 118
'║' '' 119
'↑' '' 120
'↓' '' 121
Mappings between slot positions and (la)tex commands are imho much more difficult to optain and maintain as every package/document can change or add commands.
answered Apr 24 at 8:05
Ulrike FischerUlrike Fischer
202k9310697
202k9310697
add a comment |
add a comment |
Thanks for contributing an answer to TeX - LaTeX Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f486288%2fwhere-can-i-find-how-to-tex-symbols-for-different-fonts%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
Have you considered just using the PDF directly, e.g. with pdf2htmlEX ? There's a comparison of other options here.
– ShreevatsaR
Apr 24 at 3:22
@ShreevatsaR No, that does not work for me because despite the fact that the HTML is almost perfect I can't extract any characters at all.
– Ying Zhou
Apr 24 at 4:03
1
I am able to copy characters from e.g. this demo. Of course it only works when they are present in the original PDF.
– ShreevatsaR
Apr 24 at 4:36
@ShreevatsaR Well, I need to automate the process. Moreover by "copying characters" I include copying non-Latin characters and things such as
mathcalA
as well.– Ying Zhou
Apr 24 at 5:16