howard/news-analyze

Fork 0

mirror of https://github.com/hpware/news-analyze.git synced 2025-07-16 19:19:33 +08:00

元皓 b9ff1e73c9

Update README.md

2025-07-08 17:21:38 -07:00

9.6 KiB

Raw Blame History

News Analyze

English Version 繁體中文版

A Neighborhood project. For desktop users only, mobile is not supported (fn).

App Design: PDF Document

Reverse engineering documentation: about

Deploy: via docker compose

Goals before the next devlog: Markdown file

Video Guide: YouTube

Demo:

Production (Latest Docker Image): https://yhw.tw/news

Beta (Beta Docker Image): https://newsbeta.20090526.xyz

Video Guide

https://github.com/user-attachments/assets/29414c5d-3b2f-420d-93c0-95c14a15bbb7

Notes:

The enviroment vars are stored in the database, which is cursed, I know, but this is the only way to let the system access new envs sent by the user, so if you are trying to spin up a instence of this app you MUST put the postgres url in the .env & create a table using beekeeper studio (my choice for SQL editing, you can choose whatever you like), and after that you can create the entire database by using this api call, https://<<your_domain>>/api/create_database in your browser.

    CREATE TABLE IF NOT EXISTS global_vars (
    NAME TEXT PRIMARY KEY NOT NULL,
    VAR TEXT NOT NULL
    );
    INSERT INTO global_vars(name, var)
    VALUES ('groq_api_key', '<<YOUR_API_KEY_HERE>>');
    INSERT INTO global_vars(name, var)
    VALUES ('password_hash_salt', '<<YOUR_PASSWORD_SALT_HERE>>');

Replace <<YOUR_API_KEY_HERE>> with your actual api key, and also replace <<YOUR_PASSWORD_SALT_HERE>> with a random salt you get by running this command on your Mac/Linux device (Windows idk) openssl rand -base64 48.

Issues:

Onboarding:

Onboarding is a must for most people that are using the app for the first time, but I want to do to via a non-video like system, however implementing the function in a already large repo is kinda hard. So I just add a basic video onboarding system.

The current login DOES NOT see if you're logged in or not, it just prompts if the user wants to login or not. This NEEDS to be fixed

Windows with the wraping function `<BlurPageBeforeLogin></BlurPageBeforeLogin>`:

The wrapping function, <BlurPageBeforeLogin></BlurPageBeforeLogin>, is currently running a static value for testing use only, so for pages that reqire you to be logged in WILL NOT work for (even if you logged in). It is just a value in the blurPageBeforeLogin.vue function if (true)

Server Downtime

Use https://status.yhw.tw/ for checking down time, most of the time it will be up, but sometimes it just won't updated to the latest feature & update.

Scraping restrictions:

As LINE Today only loads & put the image file via JS in the browser, node-fetch is not working (yes, this platform uses node-fetch as the only way to scrape stuff). If LINE today became more problematic of this platform, those APIs will no longer work & most of the things will just not work, as it requires LINE Today to NOT patch these node-fetch things.

Developing enviroment lagging:

The desktop app alone has 700+ lines of code, and compiling on the fly is really slow & can really lag your computer (like my Macbook, which has lagged the entire time I'm trying to develop the app.).

Translating system:

A few pages now contains translations, like the news, aboutNewsOrg and newsView pages. This project currently is using Google Translate. However, muiti translate platform support is coming soon™ (If you login with your account). The translations are not accrate at all, like something that should be I just want to write about sports becomes I just want to write, like bro, what is even that?

Deploying:

This code is absolutly NOT designed to be spinned up at Vercel or Netlify, it has the scraping system now inside of the main website code, oh also the entire "caching feature" is based in memory, so please don't use those platforms, for Zeabur your cost might be expensive. idk, I haven't tried hit yet. The web url: https://news.yuanhau.com is hosted on my own infra, you should too. Please get a server off of yahoo 拍賣, 蝦皮 or eBay to do so.

The API returning outdated data from more than 5+ years:

Here is the GitHub Issue: https://github.com/hpware/news-analyze/issues/2

When using the desktop in the dev env it pops up an error

For some reasons, Nuxt's dev env prev does not display this error, but with the newer ones, it started displaying this error, please run ./wipedev.sh or ./wipedev.bat and restart the dev server. (And this is only a temp fix, I have no idea how can I fix this, if you have a fix, please submit a PR, thx.)

Stack:

Postgres
Tailwind
Nuxt
Animate.css
GSAP
Nuxt i18n
BunJS
Groq
Custom Infra
Docker
Docker Compose
GitHub Actions
Line Today (Unoffical APIs)
Cheerio
Sentry
Umami Analytics
Prettier

Mirrors:

Preview Images:

Home Page:

Desktop App:

Why Line Today?

LINE Advertising Marketing

According to LINE's marketing team, "LINE TODAY is an important portal for consumers to obtain various knowledge and information." Of course, it can let news media make money for its news, so many articles will be on LINE Today and they will be short, consise and easy to find differents.

FREE APIs:

NOTE: The returning data WILL BE in chinese, if you don't mind, you can use it.

API Info: https://news.yuanhau.com/apis

If you just want to throw to an LLM and tell it to do stuff, here is the endpoints (w/cors, but I (hpware) has given permission for you to use it for free.), you are welcome to build something better than mine. Just credit me :) thanks.

https://news.yuanhau.com/api/tabs for fetching Tabs

The API looks like this:

{
  "data": [
    {
      "text": "焦點",
      "url": "top",
      "default": true
    },
    ...
    {
      "text": "追蹤",
      "url": "subscription",
      "default": false
    }
  ],
  "cached": true
}

https://news.yuanhau.com/api/home/lt?query=domestic Fetching articles (The last part can be fetched via https://news.yuanhau.com/datainfo/linetodayjsondata.json and DON'T remove the ?query=)