curlとgrep でHTMLから指定IDを取り出す。/ 後方参照の代替

takuya_1st https://blog.hatena.ne.jp/takuya_1st/ それマグで！ https://takuya-1st.hatenablog.jp/ grep curl スクレイピング grep で HTMLタグの値を取り出す。 curl と組み合わせて戦う curl $URL | grep -oP '(?<=name="post_id" type="hidden" value=").+(?=" />)' grep では後方参照がいい感じに取れないので、「後読み（lookbehind）や先読み（lookahead）」を使って対応することになる。 HTMLのinputのvalueを取り出す例次のようなHTMLがあって、valueだけを取り出したい。とする、。 <input name="post_id" type="hidden" value="XxICIHRNcDFBr9Rl… 190 <iframe src="https://hatenablog-parts.com/embed?url=https%3A%2F%2Ftakuya-1st.hatenablog.jp%2Fentry%2F2023%2F05%2F23%2F170351" title=" curlとgrep でHTMLから指定IDを取り出す。/ 後方参照の代替 - それマグで！" class="embed-card embed-blogcard" scrolling="no" frameborder="0" style="display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;"></iframe> Hatena Blog https://hatena.blog 2023-05-23 17:03:51 curlとgrep でHTMLから指定IDを取り出す。/ 後方参照の代替 rich https://takuya-1st.hatenablog.jp/entry/2023/05/23/170351 1.0 100%