discourse

mirror of https://github.com/discourse/discourse.git synced 2024-11-23 03:16:41 +08:00

Author	SHA1	Message	Date
Régis Hanol	823b940b9d	PERF: improve loading of indexes in bulk import Similar strategy as for `c52191d` in which we stream the results from the database into an automatically growing array instead of using a hash.	2020-06-18 16:32:27 +02:00
Régis Hanol	c52191d49e	PERF: improve loading a imported_ids in bulk imports - Stream the queries that load the imported_ids - Use an array instead of a hash for keeping the mapping between imported_ids and new ids - Ensure we always treat the imported_ids as integers instead of strings	2020-06-16 19:55:08 +02:00
Jarek Radosz	669c940ec3	Revert "DEV: Remove the remaining ENV["TRAVIS"] usage (#10041 )" This reverts commit `78aff841e3`. See https://review.discourse.org/t/dev-remove-the-remaining-env-travis-usage-10041/12737/4?u=cvx	2020-06-16 19:42:00 +02:00
Jarek Radosz	78aff841e3	DEV: Remove the remaining ENV["TRAVIS"] usage (#10041 )	2020-06-16 17:41:15 +02:00
Jarek Radosz	3d55f2e3b7	FIX: Improvements and fixes to the image downsizing script (#9950 ) Fixed bugs, added specs, extracted the upload downsizing code to a class, added support for non-S3 setups, changed it so that images aren't downloaded twice. This code has been tested on production and successfully resized ~180k uploads. Includes: * DEV: Extract upload downsizing logic * DEV: Add support for non-S3 uploads * DEV: Process only images uploaded by users * FIX: Incorrect usage of `count` and `exist?` typo * DEV: Spec S3 image downsizing * DEV: Avoid downloading images twice * DEV: Update filesizes earlier in the process * DEV: Return false on invalid upload * FIX: Download images that currently above the limit (If the image size limit is decreased, then there was no way to resize those images that now fall outside the allowed size range) * Update script/downsize_uploads.rb (Co-authored-by: Régis Hanol <regis@hanol.fr>)	2020-06-11 14:47:59 +02:00
Justin DiRose	be28fc73a0	DEV: Improvements to Drupal script (#10016 ) Refactors script to follow conventions of other importers and adds some features including like import, processing of post raw text, and, if needed, SSO import.	2020-06-10 10:59:17 -05:00
Arpit Jalan	a93d24501c	FIX: base import script was not updating first_post_created_at column FEATURE: new rake task to update first_post_created_at column The not-equal operator (`<>`) in PostgreSQL does not compare values with NULL. We should instead use `IS DISTINCT FROM` when comparing values with NULL.	2020-06-04 11:26:40 +05:30
Gerhard Schlager	f683c5d0e0	DEV: Check English locale for errors in CI Moves the most important checks into a linter. It gets executed by Lefthook as well as the docker rake task and Github actions. Doing those checks in rspec takes too long and it produces errors when the discourse:test Docker image contains old, invalid locale files.	2020-06-03 21:54:58 +02:00
Jarek Radosz	4babdf510b	DEV: Update facter usage `Facter.reset` (`65d167eac9/lib/facter.rb (L126-L137)`) clears `Facter::Options[:external_dir]` which seems to be the 4.x equivalent of `Facter::Util::Config.external_facts_dirs`. This commit also makes sure that version 4.0 or higher is installed.	2020-06-01 05:50:49 +02:00
Jarek Radosz	8c8391c742	DEV: Don't require wget in script/bench	2020-06-01 05:50:48 +02:00
Jarek Radosz	eb462bfb3d	FIX: Improve image downsizing script (#9549 ) Correctly handles more upload formats in posts, updates post custom fields, fixes more edge cases, adds debugging capabilities. (VERBOSE=1 and INTERACTIVE=1 flags) Includes these commits and some more: * DEV: Show the fixed image dimensions * FIX: Support more upload url formats * DEV: Remove the old upload after updating posts * FIX: Use the `process_post_#{id}` mutex * FIX: Avoid rebaking twice * DEV: Print out the link to the post * DEV: Process posts chronologically * DEV: Do a dry-run before saving, pause on any issue * FIX: Also process deleted posts * DEV: Make matchers case-insensitive * DEV: Pause on "detached" uploads, add more debug info * DEV: Print out time when finished * DEV: Add support for WORKER_ID/WORKER_COUNT * DEV: Fix the onebox in cooked text heuristic * DEV: Don't report already processed posts * DEV: Beep when done! * DEV: Ignore issues with deleted posts * DEV: Ignore issues with deleted topics * DEV: Multiline SQL * DEV: Use the bulk attribute assignment * DEV: Add ENV["INTERACTIVE"] mode * DEV: Handle post custom fields * DEV: Bail on non-S3 sites * DEV: Allow sizes smaller than 1 mpix	2020-05-26 15:38:23 +02:00
Kane York	869f9b20a2	PERF: Dematerialize topic_reply_count (#9769 ) * PERF: Dematerialize topic_reply_count It's only ever used for trust level promotions that run daily, or compared to 0. We don't need to track it on every post creation. * UX: Add symbol in TL3 report if topic reply count is capped * DEV: Drop user_stats.topic_reply_count column	2020-05-14 15:42:00 -07:00
Blake Erickson	ab919332dc	DEV: api documentation updates (#9612 ) * DEV: api documentation updates - Created a script to convert json responses to rswag - Documented several api endpoints - Switched rswag to use header based auth * Update script, fix some schema missmatches	2020-05-11 13:06:49 -06:00
Francesco Frassinelli	c6f68e4006	Fix syntax error in fluxbb.rb (#9727 )	2020-05-11 11:07:57 -04:00
Kyle E. Mitchell	6c968b7945	Add script for compiling copyright deposits (#9646 ) * Add script for compiling copyright deposits * git mv copyright-deposit script/	2020-05-06 12:51:45 -04:00
Krzysztof Kotlarek	9bff0882c3	FEATURE: Nokogumbo (#9577 ) * FEATURE: Nokogumbo Use Nokogumbo HTML parser.	2020-05-05 13:46:57 +10:00
Martin Brennan	867bc3b48e	FIX: Change base importer to create new Bookmark records (#9603 ) Also add a spec and fixture with a mock importer that we can use to test the create_X methods of the base importer	2020-05-01 11:34:55 +10:00
Sam Saffron	d0d5a138c3	DEV: stop freezing frozen strings We have the `# frozen_string_literal: true` comment on all our files. This means all string literals are frozen. There is no need to call #freeze on any literals. For files with `# frozen_string_literal: true` ``` puts %w{a b}[0].frozen? => true puts "hi".frozen? => true puts "a #{1} b".frozen? => true puts ("a " + "b").frozen? => false puts (-("a " + "b")).frozen? => true ``` For more details see: https://samsaffron.com/archive/2018/02/16/reducing-string-duplication-in-ruby	2020-04-30 16:48:53 +10:00
discoursehosting	094ddb1c1f	VBulletin5 importer improvements (#9477 ) - no more hard coded contenttypes - permalinks for topics, categories, subcategories - better uploads handling - tag support	2020-04-22 22:04:59 +02:00
Jarek Radosz	28c706bd09	DEV: Do less work in docker_test (#9470 ) * DEV: Update the working tree just once. `git pull` was effectively doing `git fetch` and `git merge FETCH_HEAD`, and only then we were checking out the desired branch/commit. This change will skip the the merge step. * DEV: Don't run lefthook in docker_test	2020-04-21 03:48:58 +02:00
Jay Pfaffman	3c72cbc5de	FIX: Google groups import changed login URL (#9432 ) I'm not clear why changing only the `wait_for_url` address was necessary and not also the `get` a few lines above, but this change seems to work for me on both literatecomputing.com Groups and a public group.	2020-04-15 16:45:14 -04:00
Gerhard Schlager	2b2584912a	Improve Telligent import script * Imports private messages * Replaces internal links for topics and replies * Allows incremental import of accepted answers	2020-04-03 18:10:52 +02:00
Robin Ward	b2b7afd310	Rename the server side widget hbs compiler	2020-03-27 12:06:14 -04:00
Gerhard Schlager	739430c01e	FIX: mbox import failed if no tags were configured	2020-03-26 16:41:11 +01:00
Gerhard Schlager	d216483c53	FIX: Importing with pgbouncer failed Checking if all records have been imported uses a temp table in PostgreSQL. This fails when pgbouncer is used unless the temp table is created inside a transaction.	2020-03-26 16:41:09 +01:00
Gerhard Schlager	c94b63bc75	DEV: Improve import of attachments from Telligent	2020-03-26 16:37:55 +01:00
Jarek Radosz	d21d80198c	DEV: Update rubocop-discourse (#9270 ) Includes: * DEV: Use `eq_time` matcher	2020-03-26 16:32:41 +01:00
Robin Ward	eaa324ecbd	Revert "Move the widget-hbs compiler to js from es6" This reverts commit `5d66a2c16e`.	2020-03-25 16:13:26 -04:00
Robin Ward	5d66a2c16e	Move the widget-hbs compiler to js from es6	2020-03-25 15:03:21 -04:00
Gerhard Schlager	5b2b769eb7	DEV: Ensure uploads aren't deleted during imports Sidekiq might delete uploads if you, for some reason, create upload records before using them in posts.	2020-03-24 17:14:16 +01:00
Gerhard Schlager	445b35381d	Improve Telligent import script * Detects mostly all attachments and it's a lot faster * Parses user properties in Ruby instead of the DB, because that's less errorprone * Imports user avatars * Imports topic views by users * Better handling of quotes and YouTube links	2020-03-23 09:18:12 +01:00
Gerhard Schlager	d27ece9ded	FIX: Method from Telligent import script was deleted by accident	2020-03-14 22:10:40 +01:00
Gerhard Schlager	e825f47daa	DEV: Better handling of incremental scrapes for Google Groups	2020-03-14 00:00:36 +01:00
Gerhard Schlager	0a88232e87	DEV: Improve mbox import script * Better documentation of settings * Add option to exclude trimmed parts of emails (enabled by default) to not revail email addresses	2020-03-14 00:00:36 +01:00
Gerhard Schlager	36062f43c8	DEV: Improve Telligent import script * Adds ability to map forums to categories and tags as well as ignore forums. * Fixes regular expression for detecting attachments in posts. * Handles "remote attachments" 😮 by inserting a link. * Imports view counts for topics. * Handles incorrect references of parent posts. * Better handling of quotes. * Finds a lot more attachments by trying to replace various Unicode characters in filenames.	2020-03-14 00:00:36 +01:00
Gerhard Schlager	ba1b840816	DEV: Don't deactivate suspended users during import Otherwise a cleanup job might delete those deactivated users.	2020-03-14 00:00:36 +01:00
Justin DiRose	6c948f27ea	FIX: Missing constant in SMF2 importer (#9178 )	2020-03-11 10:19:59 -05:00
Gerhard Schlager	0e752db411	DEV: Improve mbox import script * Customizable email subject prefixes to remove "Re" and "Fwd" as well as localized prefixes. * Configuration option for prefixes like [FOO] or (BAR) which can be replaced with tags during import. * Bugfix: Import script might have skipped some users due to missing ORDER BY.	2020-03-09 10:26:45 +01:00
Gerhard Schlager	edc8d58ac3	FEATURE: Add site setting to disable staged user cleanup ... and disabled the cleanup during imports, otherwise a running Sidekiq might delete users before posts are created	2020-03-09 10:26:41 +01:00
Jarek Radosz	f7ea2fdea5	FIX: Import posts of missing users from phpbb3 (#9085 ) Posts without a user probably shouldn't happen unless there was some direct database tampering, but data like that has been seen in the wild. The importer will assign those posts to the "system" user.	2020-03-06 22:54:40 +01:00
Gerhard Schlager	d7ccb58559	FIX: Google Groups scraper failed to login	2020-03-02 17:24:48 +01:00
Brad Morrical	ff5ff8d0d2	fix invalid byte sequence in UTF-8 (ArgumentError) (#9077 )	2020-02-28 10:26:18 -05:00
Justin DiRose	f35ee5e887	DEV: Improvements to SMF2 script (#9006 )	2020-02-24 12:51:45 -06:00
Sam Saffron	28292d2759	PERF: avoid shelling to get hostname aggressively Previously we had many places in the app that called `hostname` to get hostname of a server. This commit replaces the pattern in 2 ways 1. We cache the result in `Discourse.os_hostname` so it is only ever called once 2. We prefer to use Socket.gethostname which avoids making a shell command This improves performance as we are not spawning hostname processes throughout the app lifetime	2020-02-18 15:13:19 +11:00
David Taylor	5919618a87	DEV: Drop legacy OpenID 2.0 support (#8894 ) This is not used in core or official plugins, and has been printing a deprecation notice since v2.3.0beta4. All OpenID 2.0 code and dependencies have been dropped. The user_open_ids table remains for now, in case anyone has missed the deprecation notice, and needs to migrate their data. Context at https://meta.discourse.org/t/-/113249	2020-02-07 17:32:35 +00:00
Régis Hanol	0843e3e6ce	FIX: add support for sub-sub-categories in base_importer Also delegates 'post_already_imported?' and 'user_already_imported?' to the base importer.	2020-02-05 10:40:28 +01:00
Jarek Radosz	8a82ceb3bc	FIX: Improve downsize_uploads (#8409 ) With this change the script: * Actually removes original large-sized images * Doesn't save processed files if their size has increased * Prevents inconsistent state	2020-01-27 03:31:11 +01:00
Gerhard Schlager	ab07b945c2	Merge pull request #8736 from gschlager/rename_reply_id_column REFACTOR: Rename `post_replies.reply_id` column to `post_replies.reply_post_id`	2020-01-17 17:24:49 +01:00
Gerhard Schlager	e474cda321	REFACTOR: Restoring of backups and migration of uploads to S3	2020-01-14 11:41:35 +01:00
Sam Saffron	710eafdd35	FIX: ensure we consistently pick the same topic for bench We pick the first topic with 30 responses as our bench topic. Previously we simply picked the last topic, but hand no guarantee on ordering. This also attempts to correct previous runs of the bench.	2020-01-08 16:33:45 +11:00

1 2 3 4 5 ...

940 Commits