# Steps to convert a published scientific paper into the Physics Derivation Graph

Published 2021-08-08T15:15:00.002Z by Physics Derivation Graph

This post documents the steps taken to get the derivations in the paper Speed of sound from fundamental physical constants into the Physics Derivation Graph.

1) Read the paper. I need to understand the point of the paper and where content is in the text.

2) Identify the primary result of the derivation. Which equation is the output? Is there more than one? In this paper, equation 1 is the primary result.

Tangent: I tested the question of whether the most popular equation is the most important.
This is measurable using the command

cat sound1.tex | tr " " "\n" |\          # replace spaces with newlines -- word tokenization
grep "ref{" |\                        # find internal references to labeled expressions
sed 's/.*\\ref{//' | sed 's/}.*//' |\ # remove the irrelevant latex "\ref{}" to isolate the expression label ID
sort | uniq -c | sort -n              # rank the popularity of expression labels
2 v001
3 abinitio
3 all
3 ratio
3 v0
3 v01
3 v1
5 elemental
6 bohr
6 rydberg
11 a
15 v00
15 v3

Decoding those latex labels to equations in the PDF, "v3" is equation 9, which is tied for most popular with "v00" -- equation 4. The second most popular label "a" is equation 10. The equation I'm claiming as the primary result, equation 1, is labeled as "v0" and is referenced 3 times.

3) What sections of the paper contain the derivation?
There are actually two derivations of the same result.

• one derivation is on page 2 column 1
• another derivation is on page 2 column 2

4) On paper, write out equations from the paper. Here I'm limiting the scope to one derivation on page 2 column 1. Fill in additional equations that are missing from the text. Use arrows to denote flow of derivation.

Often I'll make math mistakes or go down paths that turn out to be irrelevant. If I were a typical researcher, this is where the reconstruction process would stop.

5) On a separate paper, rewrite the graph and label the edges in the directed graph to describe the operation being carried out.

6) On a separate paper, rewrite the content from the previous step to refine the "operation being carried out" to a directed graph with inference rules available in the Physics Derivation Graph.

7) Enter the symbols used in this derivation in https://derivationmap.net/list_all_symbols#add%20symbol or identify pre-existing symbols and record their PDG ID.

8) If novel inference rules are needed for this derivation, enter those in https://derivationmap.net/list_all_inference_rules#add_inf_rule

9) Using https://derivationmap.net/start_new_derivation/, enter the expressions and inference rules for the steps of the derivation. For each step,

• ensure the symbols are referenced correctly in each expression
• ensure the SymPy version of the expression is correct

The output of this step is https://derivationmap.net/review_derivation/608598/

10) Offline, in the JSON file containing the derivation, edit the symbol IDs and SymPy representations as needed to get the step validations to be correct.

Review the modifications on https://derivationmap.net/review_derivation/608598/

TODO: The extensive linking of variables to definitions is not made accessible in the output, so there is no way for the user to leverage the linked information.

• expressions in the PDF/HTML output contain variables which have no link to their definitions
• variables in the text of the PDF/HTML output are not linked to the expressions or their definitions
As a result of the above observation, https://github.com/allofphysicsgraph/proofofconcept/issues/217 was opened.